bhattlab / SmORFinder

A command line tool to identify and annotate small proteins in microbial sequencing datasets.
MIT License
14 stars 4 forks source link

smorfam link to Sberro et al. 2019 #10

Open bsiranosian opened 2 years ago

bsiranosian commented 2 years ago

The "smorfam" column of the output tsv file contains names like "smorfam00946"

How do these families compare to the data in Sberro et al. 2019? Looking at Table S3 from that paper, the first column contains a "family ID" that does not map to the smORFfinder output.

It's also not just the integer order, since the length of the sequences in Table S3 and the hmm file are different.

durrantmm commented 2 years ago

Hi Ben. There's no direct mapping between them. It's just for your own reference, and can be used to look up families on DBSmORF. I'd recommend searching the Sberro et al. sequences using the HMMs if you'd like to map them to each other.