jzook / testHG008curation

This repository is primarily for the Genome in a Bottle Consortium to curate structural variants in HG008
https://www.nist.gov/programs-projects/cancer-genome-bottle
1 stars 0 forks source link

[chr12:16630818-16630993][INS][SVLEN=175bp] #236

Open jzook opened 1 month ago

jzook commented 1 month ago

chr12 16630818 Minda_24 N <INS> . PASS SVLEN=175;SVTYPE=INS;SUPP_VEC=ONT_severus_INS11657,ONT_Sniffles2.INS.D0MB,ONT_i_63,ONT_ID_20376_1,PB_severus_INS10178,PB_Sniffles2.INS.C5MB,PB_i_149

https://v2.genomeribbon.com/?session=https://42basepairs.com/download/s3/giab-data/ribbon-json/ribbon-hg008-cov-bedpe.json&locus=chr12:16630818#ribbon

Faizal-Eeman commented 1 month ago
Screenshot 2024-10-14 at 11 19 39 AM Screenshot 2024-10-13 at 11 42 54 PM
Faizal-Eeman commented 1 month ago
Screenshot 2024-10-11 at 6 32 54 PM Screenshot 2024-10-11 at 6 38 56 PM
Faizal-Eeman commented 1 month ago
Screenshot 2024-10-14 at 7 58 40 PM
jo-mc commented 1 month ago

Hi, nice work showing all the data.

If we look at the blat results, we need to consider the size field along with %. Here the top two matches are the only significant as they match ~148bp which is close to the length of the insert less the poly A tail. The remainder match only ~20-30bp. There are some characteristics of line-1 insertion due to target site duplication "AAAGATCAT" and poly -A tail. Also it is interesting that this is the third insert from the same region being intron 3 of Long non coding RNA, LINC01500.

Faizal-Eeman commented 1 month ago

If we look at the blat results, we need to consider the size field along with %. Here the top two matches are the only significant as they match ~148bp which is close to the length of the insert less the poly A tail. The remainder match only ~20-30bp.

Thanks @jo-mc I missed the size match. Also, replaced the previous image which showed the full BLAT results with just the top segment of the results.