fhcrc / deenurp

16S rRNA gene sequence curation and phylogenetic reference set creation
GNU General Public License v3.0
4 stars 3 forks source link

pubmed_ids in seq_info.csv file #23

Closed crosenth closed 9 years ago

crosenth commented 9 years ago

Would like a way to know if Genbank records have pubmed_ids or not. Since multiple pubmed_ids can exist per record I will create two files that can tie back to the seq_info.csv file, genbank.csv and references.csv. Genbank.csv contains most of the attributes parsed by BioPython into csv format and references.csv will contain the multiple references associated with each Genbank record.

crosenth commented 9 years ago

See subcommand gb2csv.py