-
- Use the dataset given in the readme
- Use [Google Colab](https://research.google.com/colaboratory/) to perform the EDA
- Do not copy-paste code from other places without giving proper references
…
-
Using the relevant dataset, a machine learning model has been trained to detect whether a person is suffering from heart disease or not. The model has been trained using two classification techniques:…
-
A recipe on elementary techniques to explore categorical data
-
# MA plot
[Wiki page](https://en.wikipedia.org/wiki/MA_plot)
[Interpretation of differential gene expression results of RNA-seq data: review and integration](https://dx.doi.org/10.1093%2Fbib%2Fbby…
-
Criar um notebook de análise exploratória `notebooks/01-exploratory_data_analysis.ipynb`, contendo as seguintes seções de texto:
- [x] **Descrição dos dados**: informações sobre a quantidade de in…
-
The business case is very clear and detailed—great job on this! The data has been loaded, but it hasn’t been cleaned yet, and there’s no clear indication of who will be working on which parts of the p…
-
Criar um notebook de análise exploratória `notebooks/01-exploratory_data_analysis.ipynb`, contendo as seguintes seções de texto:
- [x] **Descrição dos dados**: informações sobre a quantidade de in…
-
- [x] prepare data dictionary
- [ ] manage code in github
- [ ] review dictionary and meanings with Fiona
- [ ] basic data cleaning
- [ ] prepare summary statistics including missingness for all data
-
-
Work on summary statistics tables using **select**, **"mutate**, **group by** commands in the tidyverse package.