UCLOrengoGroup / cath-tools

Protein structure comparison tools such as SSAP and SNAP
http://cath-tools.readthedocs.io
GNU General Public License v3.0
57 stars 14 forks source link

Enable CRH to filter out hmmsearch output hits covering little of the HMM #41

Closed tonyelewis closed 7 years ago

tonyelewis commented 7 years ago

Jon requests a CRH feature to filter out hmmsearch output hits covering little of the HMM

This would just apply to input from hmmsearch output files.

It would compare 100.0 * ( hmm_to +1 - hmm_from ) / hmm_length to some threshold (where hmm_length comes from the [M=296] part of the file) and filter out hits that failed the threshold.

Jon has given me an example data file (hslu.hmmsearch) and ID (dc_4dc3ac2c5e4a70703d5d9cf0ba2f0ac9) that I can try to use to build the feature and that I can then add as a test case.

tonyelewis commented 7 years ago

Jon and I have discussed a bit further about this. We propose:

tonyelewis commented 7 years ago

This has been done in 62fc29fc1fad5344f3365a40472bcb88aead8be2 (with extra tests added in d92ad056734d3ffbbc68b9b81ccf0611dd87bdd7).

Thanks to 93c9b8e31143e57a5d4ded75b3ea7d02c85a3134, the hidden options can now be seen using the (hidden) options --hidden-help.