genophenoenvo / terraref-datasets

Repository for code and small datasets derived from the TERRA REF program
MIT License
0 stars 3 forks source link

Add to cultivar list comparison across seasons and traits #59

Closed MagicMilly closed 4 years ago

MagicMilly commented 4 years ago

Per feedback from Arun and Ishita, add to the cultivars list which cultivars are available in a combination of seasons (i.e. seasons 4 & 6, all seasons), and add cultivars which are available for pertinent traits.

MagicMilly commented 4 years ago

Sent @rossarun and @ishitadebnath this link with germplasmNames available for MAC Seasons 4 and 6. Would like to add the KSU cultivars to a table like this.

MagicMilly commented 4 years ago

Have added the KSU cultivars to a table with all cultivars and seasons, uploaded to Google Drive, and notified researchers.

rbartelme commented 4 years ago

@MagicMilly it would be good to touch base with @kshefchek to see what cultivars are in the VCF file. Then we'll have all the information that we need for the models. Ultimately, we'll want all cultivars across the three seasons that have genomic data.

MagicMilly commented 4 years ago

@rbartelme Thank you for that suggestion! Would be important to have that information in addition to the lists that also have proprietary cultivars mixed in. I'll talk to Kent. 👍

MagicMilly commented 4 years ago

Ran into duplicate issues with SQL query results (when adding KSU cultivars) and have individual lists that need to be merged for this ticket to be complete. May need to work with @rbartelme on R code since that seems to be the easiest solution compared to Pandas or SQL.

MagicMilly commented 4 years ago

Fixed look up table to remove duplicates and include cultivars in season 4, season 6, ksu, and those with genomic data available from the vcf file (thank you @kshefchek). Added to Google Drive

Screen Shot 2020-05-22 at 7 56 17 AM

Next need to push notebooks / code / SQL script for creating this table