Closed ailithewing closed 1 year ago
@catavallejos @nathansam @hwarden162 @Alanocallaghan Please add any additional things that I've missed.
kmeans: set seed for heatmap code chunk starting library("pheatmap")
(which might be covered by the coloured blocks to do)
Challenge 1 in episode 1. Not sure about question 4. Is this a good example of high-dim data? Because it is one observation and so many features?
- Predicting probability of a patient's cancer progressing using gene expression data from 20,000 genes, as well as data associated with general patient health (age, weight, BMI, blood pressure) and cancer growth (tumour size, localised spread, blood test results).
Changing that challenge from singular to plural patients would also be good to avoid implying high precision from generic prediction models (ie precision med hype)
Current uniqueness/communality explanations contradicts Wikipedia I think: https://en.wikipedia.org/wiki/Factor_analysis#Terminology
One way of reducing the number of dep packages is to move all the data wrangling stuff to a data package and then just remotes::install_github
it.
Glossary still open, but covered by issue #89
A list of proposed changes following the May delivery of HDS
These are in addition to the changes in the pull request ailith_delivery3 and to the changes that Hannes made that have yet to be pushed to the main course materials.
Throughout
Intro
pairs()
(from Emma's review in #39)Regression with many features (many outcomes)
pairs()
(from Emma's review in #39)Regression in high-dimensional settings
where we introduce the methylation data and the two different types of problems. However, this is outside the scope for this round of changes. Creating this separate episode would also address some of Emma's concerns.dream()
from VariancePartition which is similar to limma but can handle grouping (random effects)Regularisation
pairs()
(from Emma's review in #39)PCA
pairs()
(from Emma's review in #39)FA
pairs()
(from Emma's review in #39)K means
pairs()
(from Emma's review in #39)Hierarchical clusters
pairs()
(from Emma's review in #39)Other
dependencies.csv
can be reduced (see #34)