farhat-lab / gentb-site

The genTB project, the Django site, variant calling and prediciton pipeline, and mapping pipeline with hooks to two ravens
https://gentb.hms.harvard.edu
Other
8 stars 11 forks source link

Old data settings value #67

Closed doctormo closed 7 years ago

doctormo commented 7 years ago

These items are in the settings column and need translating.

mahafarhat commented 7 years ago

Ok can you please send me the name, source and setting field info for each of these.

I think we will need to rename the "Source" field to a field called "LaboratorySource", and move all the RIVM/MSLI/CDC etc "setting" data from gtbdr to this laboratory field.

For the wgsmtb data, erase the source data as it is (current a researcher's name) and replace with the following info for each line of data:

Name: LaboratorySource borowski\d: UCSF CDC-\d+: CDC \d\d-R\d+: MSLI
H37Ra: NCBI K-\d: FZB X122: Stellenbosch R1207: Stellenbosch R\d[3]\w?: Stellenbosch w-148: PHRI/NCBI c: PHRI/NCBI M\d{2-3}\w?: Stellenbosch haarlem: PHRI/NCBI MT00\d\d: BCCDC HN878: NCBI MTB210: NCBI K\d{2}\d?: RIVM \d{4-5}_\d{2}: FZB T\d{2}: UCSF GM0981: UCSF \d{2}\d{4}: UCSF M4100: UCSF

If not listed above leave LaboratorySource blank

Please also make the following corrections to the data: M4100 country is: South Korea, setting is: isolated in SF lineage 3A, city is NULL

If there is a reference eg. agerton 1997 JAMA; Maus, AAC; Sacchettini, Ioerger BMC Genomics 2010 etc in the current course field please add this to the setting field using comma as a separator between the current info and this new info, and then replace source with laboratorysource data as I detail above.

thank you!