urol-e5 / timeseries_molecular

0 stars 0 forks source link

Align RNA-seq time series data (Apul) #1

Closed sr320 closed 1 month ago

sr320 commented 1 month ago

Take time series data that is from A pulcra and align to "new" genome using Hisat.

Genome: https://gannet.fish.washington.edu/seashell/bu-github/deep-dive-expression/D-Apul/data/Apulcra-genome.fahttps://gannet.fish.washington.edu/seashell/bu-github/deep-dive-expression/D-Apul/data/Apulcra-genome.fa.fai

Gff: https://gannet.fish.washington.edu/seashell/bu-github/deep-dive-expression/D-Apul/data/Apulcra-genome.gff

sr320 commented 1 month ago

note A pulcra is ACR in https://github.com/urol-e5/timeseries_molecular/blob/main/data/rna_metadata.csv (colony ID)

kubu4 commented 1 month ago

Which repo should code go in? deep-dive-expression or timeseries_molecular?

sr320 commented 1 month ago

timeseries_molecular - the repo this issue is in - thanks!

kubu4 commented 1 month ago

Also, is this repo going to be multi-species, like deep-dive? I'm asking so that we can get things organized before we start adding too many files to the existing directory structure.

sr320 commented 1 month ago

yes will be multi species.

On Fri, Oct 4, 2024 at 11:38 AM kubu4 @.***> wrote:

Also, is this repo going to be multi-species, like deep-dive? I'm asking so that we can get things organized before we start adding too many files to the existing directory structure.

— Reply to this email directly, view it on GitHub https://urldefense.com/v3/__https://github.com/urol-e5/timeseries_molecular/issues/1*issuecomment-2393978595__;Iw!!K-Hz7m0Vt54!iuhyyNKY6xWlomLDqleencgw6crd5NwP-yWMfFw8k7OiWCxW-U2AMyvlYQeGM9qZUYR0W02g_20AMAczMFiCQXk$, or unsubscribe https://urldefense.com/v3/__https://github.com/notifications/unsubscribe-auth/ABB4PN2IBDRQDNZVKOUU5PLZZ2Y5XAVCNFSM6AAAAABPKBBJZSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGOJTHE3TQNJZGU__;!!K-Hz7m0Vt54!iuhyyNKY6xWlomLDqleencgw6crd5NwP-yWMfFw8k7OiWCxW-U2AMyvlYQeGM9qZUYR0W02g_20AMAczIhZCEuQ$ . You are receiving this because you authored the thread.Message ID: @.***>

kubu4 commented 1 month ago

Alignments complete.

Full output from alignments (excluding BAMS) is here:

https://github.com/urol-e5/timeseries_molecular/tree/main/D-Apul/output/02.20-D-Apul-RNAseq-alignment-HiSat2

Will be adding/updating README later today.

There's a subdirectory for each sample, containing individual alignments and alignment info, as well as Ballgown-formatted table files.

MultiQC alignment report (GitHub):

https://github.com/urol-e5/timeseries_molecular/blob/main/D-Apul/output/02.20-D-Apul-RNAseq-alignment-HiSat2/multiqc_report.html

DESeq2 count matrices (raw):

StringTie GTF (raw):

Apulchra-genome.stringtie.gtf

Currently rsync-ing repo to have access to BAMS. Will report back when that's complete.

kubu4 commented 1 month ago

rsync-ing complete.

All files available here:

https://gannet.fish.washington.edu/Atumefaciens/gitrepos/urol-e5/timeseries_molecular/D-Apul/output/02.20-D-Apul-RNAseq-alignment-HiSat2/