Closed SheaCheng2000 closed 1 year ago
Hi Shea,
According to the log file, you seemed to be using the FAVOR Full Database to annotate the GDS file. However, you should use the FAVOR Essential Database to annotate the GDS file in Step 2 of FAVORannotator.
In addition, it seems that some of the dimensions were not matched. For example, the genotype
field in your GDS file indicates there are 1,650,425 variants in your data, however the position
field indicates there are 1,611,114 variants in your data. These discrepancies should be fixed before running FAVORannotator.
Hope this helps.
Best, Xihao
Hi Xihao!
Thanks for your advice! I have turned to the essential DB now. And I think the inconsistency of genotype and position may be the QC problem. I will try to figure it out.
Shea
You're welcome, Shea.
Best, Xihao
Hi xihao!
Just a feedback ) The inconsistency of genotype and position was caused by the QC problem. I forgot to split multiallelic variants😢.
Thanks! Shea
Thanks so much for letting me know, Shea.
Best, Xihao
Hi,
Recently I was trying to use our in-house data in STAARpipeline for WGS burden test. When I ran the Annotate.R, the log showed errors like this:
However, it did not stop executing the job, and generated the output "Anno_chr1_STAARpipeline.csv".
I wondered how these errors happen (is it because of the VCF format?) and whether these errors would affect the final burden result.
Attached is the log file. slurm-59206.log
Looking forward to your reply! Thanks a lot!
Shea