topdownproteomics / sdk

Software solution for common top-down proteomics tasks
http://www.topdownproteomics.org/
MIT License
9 stars 4 forks source link

UniProt XML, FASTA parsers? #107

Open acesnik opened 1 year ago

acesnik commented 1 year ago

Would it be within scope to add parsers for UniProt XML / FASTA here?

rfellers commented 1 year ago

I'd be up for that. We have them internally, but they aren't the best code ever. Could use a refresh. Does mzLib have these parsers?

acesnik commented 1 year ago

mzLib has some. They're not perfect, either, e.g., they don't create protein isoforms from XML, but they might be a place to start.

https://github.com/smith-chem-wisc/mzLib/blob/master/mzLib/UsefulProteomicsDatabases/ProteinDbLoader.cs