monarch-initiative / dipper

Data Ingestion Pipeline for Monarch
https://dipper.readthedocs.io/en/latest/
BSD 3-Clause "New" or "Revised" License
57 stars 26 forks source link

add worm strains from cgc #168

Open nlwashington opened 9 years ago

nlwashington commented 9 years ago

here's the list of strains: http://www.cbs.umn.edu/sites/default/files/public/files/celelist2.txt

there's the elementary pulling in the strains. but we'll want to also parse them into their constituent parts.

we've taken a stab at a parser in perl (cgc_strain_parser.pl)...could be checked in here, but we might consider rewriting this in python, if it makes it easier.

nlwashington commented 9 years ago

also note that the cgc is here: http://www.cbs.umn.edu/research/resources/cgc/strains which links to this: http://www.cbs.umn.edu/sites/default/files/public/downloads/elegans_list.txt this is the current one.