Closed guillermomarco closed 8 years ago
Hi,
You actually have to run pfam and provide the domain table output file as the parameter value to --retain_pfam_hits. The documentation from transdecoder.github.io is:
Search the peptides for protein domains using Pfam. This requires hmmer3 and Pfam databases to be installed.
hmmscan --cpu 8 --domtblout pfam.domtblout /path/to/Pfam-A.hmm transdecoder_dir/longest_orfs.pep Just as with the blast search, if you have access to a computing grid, consider using HPC GridRunner.
Integrating the Blast and Pfam search results into coding region selection The outputs generated above can be leveraged by TransDecoder to ensure that those peptides with blast hits or domain hits are retained in the set of reported likely coding regions. Run TransDecoder.Predict like so:
TransDecoder.Predict -t target_transcripts.fasta --retain_pfam_hits pfam.domtblout --retain_blastp_hits blastp.outfmt6 The final coding region predictions will now include both those regions that have sequence characteristics consistent with coding regions in addition to those that have demonstrated blast homology or pfam domain content.
Hi I'm trying to use PFAM
--retain_pfam_hits
option with TransDecoder.Predict I'm getting no error or results at all regarding PFAM.I've downloaded both
Pfam-A.hmm.gz
andPfam-B.hmm.gz
and joined them using the following commands (found them in an old SourceForge post):I've tried passing as input both
Pfam-AB.hmm
orPfam-AB.hmm.bin
and none of them working. I'm getting no errors nor PFAM in results.