mhammell-laboratory / TElocal

A package for quantifying transposable elements at a locus level for RNAseq datasets.
GNU General Public License v3.0
21 stars 8 forks source link

TElocal index #18

Closed olivertam closed 3 months ago

olivertam commented 2 years ago

Dear Oliver Tam,

Thank you for providing very useful software. TEtranscripts and TElocal are the same, but, in order to run TElocal, we have to provide, indexed GTF files.

Could you show me some example how to generate this file, I do not have any ideas about this.

I can also use TEcount instead of TEtranscripts, and generate only count table alone that can be used for differential analysis.

the only difference is that TEtrnscripts will peroform differentiall analsis,

TElocal use indexed GTF file.

with regards

Ramky

image

Originally posted by @Ramkyeri in https://github.com/mhammell-laboratory/TEtranscripts/issues/33#issuecomment-987579062

olivertam commented 2 years ago

Hi

You can generate the pre-built index using the TE GTF that was previously created (the first one). As mentioned previously, you make the prebuilt index using the TElocal_indexer script (requires TElocal to be installed). The command line arguments are as follows:

$ TElocal_indexer --afile TE.gtf --itype TE

It is advisable to provide around 30Gb of RAM for index building. Like your previous TEtranscripts run, the index building will take a long time (~30 hours, if I recall correctly), but once completed, TElocal will just load the index for its run, which will substantially improve run-time.

Just a note: TElocal functions similarly to TEcount, in that it would not perform the differential analysis, but generate the count tables that you can then combine into a single count table to run your differential analysis. Please also note that TEtranscripts requires DESeq2 installed in order to perform differential analysis.

Please let us know if you have other questions.

Thanks

Ramkyeri commented 2 years ago

Dear Oliver Tam,

Thank you so much for your time and for showing me an example.

Yes, you are correct, it 30 hours to build an index. Thank you for remembering.

with regards

Ramky

Ramkyeri commented 2 years ago

Dear Oliver Tam,

while running the TElocal_indexer --afile TE.gtf --itype TE , it got the error, I have also installed TElocal .

image

with regards

Ramky

olivertam commented 2 years ago

Hi Ramky,

You will need to first make the script an executable:

$ chmod a+x TElocal_indexer

You can then run it from the directory where the script is located:

$ ./TElocal_indexer --afile TE.gtf --itype TE

Or copy the script to a folder containing your other binaries/executabes (e.g. where you TElocal is installed). This will then allow you run it from anywhere on your system

$ TElocal_indexer --afile TE.gtf --itype TE

Thanks.

Ramkyeri commented 2 years ago

Dear Oliver Tam,

Thank you for sharing the script.

with regards

Ramky

Ramkyeri commented 2 years ago

Dear Oliver Tam,

I do not why it still shows the error, I executed the file, using py extension.

image

With regards

Ramky

Ramkyeri commented 2 years ago

Dear Oliver Tam,

I think it is correct, I used this ./TElocal_indexer.py --afile TE.gtf --itype TE (.py)

image

with regards

Ramky

olivertam commented 2 years ago

Hi Ramky,

Yes, it appears the script was saved with the .py extension, and thus your solution is correct.

Thanks

Ramkyeri commented 2 years ago

Dear Oliver Tam,

Thank you for confirming this.

with regards Ramky

Ramkyeri commented 2 years ago

Dear Oliver Tam,

this is just for the update, TE indexing is completed, it took more than 50 hours. The output also says that this can be used only in TElocal.

TElocal do not require whether it is a control sample or treatment sample,

Can we run multiple BAM files, it shows error to me.

with regards

image

image

olivertam commented 2 years ago

Hi Ramky,

As previously mentioned, TElocal operates like TEcount, and takes only one sample at a time (agnostic to whether treatment or control). You will need to combine the outputs of various TElocal runs to perform your own differential analysis.

Thanks.

Ramkyeri commented 2 years ago

Dear Oliver Tam,

many thanks for your reply.

with regards

Ramky

Ramkyeri commented 2 years ago

Dear Oliver Tam,

Greetings! I would like to update the analysis. I used TE2.gtf fod indexing. it did not break the program in both TElocal_indexer.py and TElocal. with regards Ramky

image

olivertam commented 2 years ago

Thank you for the update.

PinpinSui commented 1 year ago

Hi Oliver Tam,

I install TElocal with the commond pip install TElocal in conda env. Would you like to tell me where can I find TElocal_indexer? Thank you,

Pinpin

olivertam commented 1 year ago

Hi Pinpin,

The TElocal_indexer is still in beta, hence we didn't include with TElocal. You can obtain it here

Thanks.

ChiaraGTH commented 1 year ago

Hi Oliver Tam, would it be possible to have a library where the .locInd files, of the various species, can be downloaded directly? The ones that are there are not up to date, and building index files takes a long time. Thanks.

olivertam commented 1 year ago

Hi,

Thank you for your interest in the software Currently, the .locInd index has to be built separately for each genome version. If you have a list of genomes that you would like us to build, let us know and we can try to help.

Thanks.

pabisk commented 11 months ago

Could you add a progress indicator to the indexing script? Since it can take multiple days to run it would be good to know whether it is working or getting stuck and how much time is left until it finishes.

olivertam commented 11 months ago

Hi,

We are currently in the process of improving the TElocal software to significantly speed up the TE index building process, and thus we are envisioning that the current indexing script will be deprecated. Therefore, we have no current plans to update the indexing script.

Thanks.

watamine commented 11 months ago

Dear Oliver Tam,

Thank you for your software. I installed TElocal and used TElocal_indexer. I started 7/20 but now 7/25 building index doesn't finish.

image

Why is it happened ? I used TEtranscripts index file for --afile TE.gtf

I would greatly appreciate it if you could let me know

watamine

olivertam commented 11 months ago

Hi watamine,

Thank you for your interest in the software. Depending on the size of the TE GTF file, it can take quite a while to build that index (days). If it is still not done after a week, please let us know and we can try to troubleshoot.

Apologies.

watamine commented 11 months ago

Hi Oliver Tam,

I see, Thank you. I am going to wait for it.

I will contact you if anything happens

Thanks.

watamine commented 11 months ago

Hi Oliver Tam,

image

TElocal_indexer finished ! Thank you for your advice.

Thanks.