Questions on Last-train in case of multiple samples

mcfrith / last-rna

MIT License

48 stars 6 forks source link

Hello!

I am analyzing multiple nanopore human WGS data.

I noticed that the training results are slightly different between samples.

I also tried merging FASTA from multiple samples and running last-train with the merged FASTA.

This also gave a slightly different result compared to the results from individual samples.

Q1 : Is it better to get the training results from merged FASTA than using the training results for individual samples for the matched samples?

Q2 : If the training result from merged FASTA is the better option, is there any saturation point where increasing the number of samples does not affect the training results significantly?

With regards Jinyoung

mcfrith / last-rna

Questions on Last-train in case of multiple samples #6