steineggerlab / ufcg

UFCG: Universal Fungal Core Genes
https://ufcg.steineggerlab.com
GNU General Public License v3.0
29 stars 0 forks source link

[Done] Add sensitivity option for `profile` module #19

Closed endixk closed 11 months ago

endixk commented 1 year ago

Next update will include a new feature for the profile module that allows for the customization of profiling sensitivity.

Current algorithm validates the AUGUSTUS search result using 3-step fallback search, which uses 80% coverage first, 50% coverage upon failure, and 0% coverage (e-value only) upon failure.

A sensitivity option will be included that can tweak with this algorithm. In detail:

Lowering sensitivity will significantly reduce the amount of false positive gene calls from the result, however, will become prone to miss some true positive genes as a trade-off. (e.g. failing to call genes with subtle evidences)

BrigidaGallone commented 1 year ago

Hi Daniel,

Great to hear and I'm looking forward to testing it. I am actually dealing with a percentage of hits (on average ~8%) wrongly assigned as "Duplicated". Is the copy-level assignment currently based on this 3-step search?

endixk commented 1 year ago

Hey Brigida,

Unfortunately adjusting the sensitivity won't affect the number of copies, since it takes next step only if there was no hit found (i.e. lowering sensitivity won't change duplicated calls to singular calls).

Instead, we are also planning to set different coverage thresholds for each gene with empirical values, which will improve the copy-level assignment. It will be super helpful if you could share more details around these false duplicate assignments (e.g. which genes or genomes are more susceptible).

BrigidaGallone commented 1 year ago

Thanks for your reply and sure, let me put together a proper test case and I will get back to you.

endixk commented 11 months ago

This feature was implemented at release v1.0.5.