Open HLHsieh opened 3 months ago
Hi Hsin,
It is difficult to say what exactly happened from the log only, but I see that the repeat is very long, ~25k bp and the error denotes something about the size of an array. Can you please share the repeat information? I'll try to genotype a sample at this locus.
Best, Helia
Hi Helia,
I was trying to analyze the same repeat across several datasets, but only one dataset encountered an issue. Here is the repeat information:
chr4 190066141 190092504 3300 8 D4Z4
I have also attached the file that is causing the issue for your reference (https://buckeyemailosu-my.sharepoint.com/:f:/g/personal/hsieh_332_buckeyemail_osu_edu/EuD_CyvayNxNlKdmZGlDQdsBgQoBni_UxuZqVm93mzSIzg?e=D1YGAA).
Thank you for your assistance.
Many thanks, Hsin-Lun
Hi Hsin,
I apologize for the late reply. Would that be possible for you to upload the bam file with reads aligning to this region only? current bam files are a bit large to download.
Best, Helia
Hi Helia,
I apologize for missing your reply. I have uploaded the requested file: https://buckeyemailosu-my.sharepoint.com/:f:/g/personal/hsieh_332_buckeyemail_osu_edu/EuD_CyvayNxNlKdmZGlDQdsBgQoBni_UxuZqVm93mzSIzg?e=KJbvMZ
Additionally, regarding the TR region BED file, I have a few questions. Should the NUM_COPIES be an integer? Also, how should I consider a TR motif? For instance, I want to test a VNTR with 4 copies, but the motif lengths are not exactly the same in the reference genome, ranging from 46 to 50 bp due to variants.
Thank you, Hsin
Hi Hsin,
Sorry for the delayed reply. I will look into the files asap.
For the region BED, NUM_COPIES doesn't need to be an integer. The TR motif sequence doesn't affect the final output in normal setting.
Best, Helia
Hi Helia,
Thank you again for your suggestions. I applied LongTR to twenty sets of data, and only one set encountered the following issue:
I have no idea how to fix the error. Any suggestions would be appreciated. PS, I still use the previous version.
Best, Hsin