Xinglab / rMATS-long

Other
23 stars 2 forks source link

About rmats_long.py #26

Open border-info-nt opened 1 week ago

border-info-nt commented 1 week ago

Hello,

I've completed the analysis with ESPRESSO and am now planning to proceed with rMATS-long analysis. However, I generated the .esp and updated.gtf files individually for each sample, so I’m unable to perform a multi-sample analysis. Is it acceptable to merge these files, or should I start the analysis from scratch?

For exmple

How can I output [gs689_1 gs689_2 gs689_3 pc3e_1 pc3e_2 pc3e_3] in espresso like this? Or should I merge the results output one sample at a time?

EricKutschera commented 1 week ago

Ideally the espresso run would include all of your samples and the .esp from espresso would then have all the sample names as columns. When you run espresso you can list the sample names along with the input files in the samples.tsv file described in the README: https://github.com/Xinglab/espresso/tree/v1.5.0?tab=readme-ov-file#basic-usage

It's not easy to merge the results from multiple espresso runs since the novel isoforms detected would have different IDs (ESPRESSO:chr1:2:3). Also there may be novel isoforms which have slightly different transcript start or end coordinates between runs but are otherwise the same