WGLab / LIQA

Long-read Isoform Quantification and Analysis
Other
37 stars 12 forks source link

reproduce figure 6 in your manuscript #27

Closed HaojiaWu closed 1 year ago

HaojiaWu commented 1 year ago

Hi team, Excellent work. I want to reproduce the figure 6 in your manuscript: https://genomebiology.biomedcentral.com/articles/10.1186/s13059-021-02399-8#Sec11 Would you provide more detail how to generate the figure 6? More specifically:

  1. How did you run LIQA on short-read Illumina data?
  2. How did you get the stats for Figure 6A?
  3. What software/tool you would recommend to generate Figure 6B? I want to generate the same graph using my own short-read and long-read data. Many thanks in advance!
huyustats commented 1 year ago

Hi,

Thanks for your interesting in using LIQA.

I used PennSeq and Penndiff to analyze short reads. The stats of DE or non-DE were calculated based on p-values at 0.05 significance level. I used IGV plot to generate the Figure. Thanks!

HaojiaWu commented 1 year ago

Thanks that's very helpful! I find that the quantify step for long read data (Nanopore) is very slow. I have a bam file with 11GB in size and it ran about 24 hours without completion. Do you have any suggestions for how to improve the run time? Here is the command line I used.

liqa -task quantify -refgene GRCh38.refgene -bam bp1_sort.bam -out bp1_isoform_expression_estimates -max_distance 10 -f_weight 1

I like your tool a lot. Thanks for developing!

huyustats commented 1 year ago

Hi, One reason might be the EM algorithm or KM estimator for each isoform. We are under updating this step (debugging gene-missing problem etc.) and thanks for your feedback. Thanks!

HaojiaWu commented 1 year ago

Thanks! look forward to the update. I am thinking if I should split the bam file into 50 chunks (my server has 56 cores) and run quantify on each chunk in parallel to speed up the analysis. Then I will merge the results from each chunk after done. Do you think it will affect the outcome?

huyustats commented 1 year ago

It should not affect the outcome. Thank you!