nextgenusfs / funannotate

Eukaryotic Genome Annotation Pipeline
http://funannotate.readthedocs.io
BSD 2-Clause "Simplified" License
301 stars 82 forks source link

Use several bam files for gene prediction #880

Open lalalagartija opened 1 year ago

lalalagartija commented 1 year ago

Hi, I wonder how to pass several bam files to provide evidence for gene prediction. I would like to avoid to use samtools merge not to create chimeras. I am running trinity first on each sample. I will then merge the outputs of trinity into one file.

nextgenusfs commented 1 year ago

The bamfile input for funannotate predict is only used to generate intron/exon hints files for augustus. It likely will not matter if merged or not.

nextgenusfs commented 1 year ago

The pipeline is setup in funannotate train to pass multiple pairs of Illumina files which get cleaned up and passed to Trinity. If you let funannotate run Trinity all the input files for predict will be generated for you.... https://funannotate.readthedocs.io/en/latest/tutorials.html#genome-assembly-and-rna-seq

lalalagartija commented 1 year ago

I prefer to control the mapping and the Trinity assembly (genome alignment based) before the prediction. Merging the final bam would be a good option after what you said. Thanks !