mhammell-laboratory / TEsmall

A pipeline for profiling TE-derived small RNAs
GNU General Public License v3.0
6 stars 5 forks source link

How to obtain all annotation files and sequence files (tDNA) of a custom genome? #18

Open LIUXING-bio opened 11 months ago

olivertam commented 11 months ago

Hi,

Thank you for your interest in the software.

There are several files that you would need to generate, and organized in a folder structure similar to that found here

The `annotation subfolder should contain:

The `sequence subfolder should contain:

All of this should be in a folder named after your custom genome build (e.g. for human T2T build, we called the folder T2Tv2), which should be able to be called from TEsmall (as of version 2.0.5) using the custom genome name (as long as it's located in the genomes subfolder of the folder indicated by --dbfolder.

I understand that this is a lot of information, and we could provide some help with your custom genome. However, we can't guarantee how easy/hard it is given the varying style of annotations.

Please don't hesitate to reach out if you encounter major issues.

Thanks.