Closed doctormo closed 7 years ago
Ok can you please send me the name, source and setting field info for each of these.
I think we will need to rename the "Source" field to a field called "LaboratorySource", and move all the RIVM/MSLI/CDC etc "setting" data from gtbdr to this laboratory field.
For the wgsmtb data, erase the source data as it is (current a researcher's name) and replace with the following info for each line of data:
Name: LaboratorySource
borowski\d: UCSF
CDC-\d+: CDC
\d\d-R\d+: MSLI
H37Ra: NCBI
K-\d: FZB
X122: Stellenbosch
R1207: Stellenbosch
R\d[3]\w?: Stellenbosch
w-148: PHRI/NCBI
c: PHRI/NCBI
M\d{2-3}\w?: Stellenbosch
haarlem: PHRI/NCBI
MT00\d\d: BCCDC
HN878: NCBI
MTB210: NCBI
K\d{2}\d?: RIVM
\d{4-5}_\d{2}: FZB
T\d{2}: UCSF
GM0981: UCSF
\d{2}\d{4}: UCSF
M4100: UCSF
If not listed above leave LaboratorySource blank
Please also make the following corrections to the data: M4100 country is: South Korea, setting is: isolated in SF lineage 3A, city is NULL
If there is a reference eg. agerton 1997 JAMA; Maus, AAC; Sacchettini, Ioerger BMC Genomics 2010 etc in the current course field please add this to the setting field using comma as a separator between the current info and this new info, and then replace source with laboratorysource data as I detail above.
thank you!
These items are in the settings column and need translating.