Closed jaclyn-taroni closed 1 year ago
LGTM. I might include miSeq, depending on how many samples are out there. I know it was a pretty popular instrument, especially for smaller organisms. So if we expect to add new yeast data, I would keep it in there.
I checked the number of RNA-seq and transcriptomic samples assayed on Illumina MiSeq. There were some yeast, yes, but the majority were human and mouse. I spot-checked some random human and mouse experiments, and it wasn't obvious to me that we shouldn't support them, so I added that platform in https://github.com/AlexsLemonade/refinebio/pull/3281/commits/8bc04bb7193e85b6986186a270b3bf9e2d471d52.
LGTM. I might include miSeq, depending on how many samples are out there. I know it was a pretty popular instrument, especially for smaller organisms. So if we expect to add new yeast data, I would keep it in there.
I checked the number of RNA-seq and transcriptomic samples assayed on Illumina MiSeq. There were some yeast, yes, but the majority were human and mouse. I spot-checked some random human and mouse experiments, and it wasn't obvious to me that we shouldn't support them, so I added that platform in 8bc04bb.
👍🏼 It wouldn't surprise me if there were a lot pilot studies in there. Or people tired of waiting for the core.
Issue Number
Closes #3280
Purpose/Implementation Notes
Here I'm adding newer (and some older) Illumina instrument models to our list of supported RNA-seq platforms.
Methods
I am requesting both David and Josh to review due to these methods.
My approach was to snag a TSV of human or mouse RNA-seq data with raw reads that were generated on an Illumina platform from European Nucleotide Archive:
Then over to R:
Then we can look at what platforms are in the public data but not yet in our supported list of platforms with:
I included the platforms that are currently positioned for transcriptome sequencing (by Illumina): Illumina NovaSeq 6000, NextSeq 1000, Illumina NovaSeq X, NextSeq 2000.
Somewhat obviously, I will not be including “unspecified.” There are also a few benchtop sequencers that I didn’t include at this point (MiSeq, iSeq 100, MiniSeq).
For the HiSeq X platforms, I spot-checked a few experiments with this methodology (example using HiSeq X Five):
It seemed reasonable to add them.
Types of changes
What types of changes does your code introduce?
To my knowledge
Functional tests
N/A
Checklist
Put an
x
in the boxes that apply.