ramhiser / datamicroarray

A collection of small-sample, high-dimensional microarray data sets to assess machine-learning algorithms and models.
104 stars 42 forks source link

Add data sets from this PLOS ONE paper #13

Closed ramhiser closed 9 years ago

ramhiser commented 11 years ago

This PLOS ONE paper provides a table of 19 2-population microarray data sets. I have gathered the majority of these data sets into the datamicroarray package, but below I have listed several the papers that I missed. (The data set number provided in Table 2 is given in square brackets.)

  1. Hodges (2006) HD Caudate [2]
  2. Hodges (2006) HD Cerebellum [4]
  3. Okada (2003) Liver Cancer [10]
  4. Beer (2002) Lung Cancer [12]
  5. Iizuka (2003) Liver Cancer [13]
  6. Dhanasekaran (2001) Prostate Cancer [14]
  7. Gruvberger (2001) Breast Cancer [15]
  8. Berchuck (2005) Ovarian Cancer [17]
  9. Zapala (2005) Neural Tissue [18]
ramhiser commented 11 years ago

For the Zapala (2005) data set: notice that the two classes in the PLOS ONE paper are brain regions versus body regions. Biologically, this is of interest as noted in the 3rd paragraph of the Zapala (2005) paper.