genome-nexus / genome-nexus-annotation-pipeline

Library and tool for annotating MAF files using Genome Nexus Webserver API
MIT License
8 stars 27 forks source link

Account for Genome Nexus Timeout issue - check if all the variants in the MAF are annotated #206

Open rmadupuri opened 2 years ago

rmadupuri commented 2 years ago

Issue: Sometimes when annotating a huge MAF, some variants do not get a protein change or the annotation times out. Re-running those failed records produces the amino acid change.

Solution: Would be nice to add a flag to the annotate subcommand or let the default behaviour be such that if all the records are not successfully annotated on the first attempt, then the script will continue running the annotator on the remaining unannotated records until no new annotated records are produced in further attempts.

Logic:

Refer to the python implementation here for more details - https://github.com/cBioPortal/datahub-study-curation-tools/tree/master/GN-annotation-wrapper