Open ptranvan opened 2 months ago
Hi,
Thank you for point this out. We must have altered the T2T GTF file in TEtranscripts recently (as it probably evolved since the initial release).
I would definitely work with T2T_CHM13_v2_rmsk_TE.gtf.locInd.locations
file then, since that was generated concurrently with the prebuilt index.
Thanks.
We are currently in the process of rebuilding the TElocal index (will probably take days), so if you want to use one that is consistent with the TEtranscripts GTF, we can let you know when this is done. Otherwise, the genomic positions should largely match (even if the names do not).
Thanks.
Hi,
Thanks for quick response. I am interested in specific regions and I have extracted TEs from the gtf using bedtools. Therefore I will wait for the new index so it's easier to get back to the data
Hi,
Are the .gtf and the prebuilt_indices are somehow related ?
I download both and I got different annotations:
https://labshare.cshl.edu/shares/mhammelllab/www-data/TEtranscripts/TE_GTF/T2T-CHM13v2_rmsk_TE.gtf.gz
https://labshare.cshl.edu/shares/mhammelllab/www-data/TElocal/annotation_tables/T2T_CHM13_v2_rmsk_TE.gtf.locInd.locations.gz
In the .gtf, 'AluYh7_dup533' is in chr3 and in the prebuilt_indices, it's in chr7.
Also, 'L1HS_dup1655' seems to be present in the gtf but not in the prebuilt_indices: