FDA-ARGOS / data.argosdb

MIT License
3 stars 7 forks source link

Mazumder - Select ngs data for Candida Auris, provide justification, and add data to ngsQC_HIVE #264

Closed steph-sing closed 1 year ago

steph-sing commented 1 year ago

Data Selected via this Protocol and justification: Fungi_Data_selection_protocol_SS1

  1. Candida auris genome assembly selected: https://www.ncbi.nlm.nih.gov/assembly/GCA_002759435.2/

  2. This NGS data is associated with the selected genome assembly, which is associated with the last outbreak between 2015-2018. Data was collected by the CDC and others:

  3. NGS Data associated with the most recent outbreak, also collected by the CDC, is in the following BioProject and has the following data:

@JingyueWu

steph-sing commented 1 year ago

@JingyueWu prioritize #3 in the list above

steph-sing commented 1 year ago

here is another reference genome cited by the group and under the same BioProject: https://www.ncbi.nlm.nih.gov/assembly/GCF_003013715.1/ @penningtonea can you please look into the this more?

JingyueWu commented 1 year ago

@steph-sing Status update (as of 5:30 pm April 11):

As instructed to prioritize 3), I have added the 117 SRAs into the QC Organism Tracking. Since there are 117 total, which is a lot, I highlighted 10 SRAs in the same color for now. That way, it is easier for me to keep track of them as I run ngsQC on them.

This afternoon, I finished running 20 SRAs (highlighted in green and pink on QC Organism Tracking). Their corresponding ngsQC metrics were entered on ngsQC_HIVE (starting from row 2198) from V1.3 folder.

I will continue this tomorrow. I foresee no blockers, but the parts that took the longest time were 1) to get those SRRs into HIVE, and 2) each time after ngsDataGrabber script was run, HIVE automatically logs me out.

JingyueWu commented 1 year ago

Status update: all 117 SRR's in 3) have been ngsQC'd and their metrics were recorded here, starting from row 2198.

In addition, the new entries were recorded on QC Organism Tracking (second tab)

Jgergely11 commented 1 year ago

@JingyueWu Please resolve the issues outlined below

penningtonea commented 1 year ago

here is another reference genome cited by the group and under the same BioProject: https://www.ncbi.nlm.nih.gov/assembly/GCF_003013715.1/ @penningtonea can you please look into the this more?

Would you like me to QC this assembly and compare to GCA_002759435.2?

JingyueWu commented 1 year ago

@Jgergely11 Fixed. Please review.