sr320 / ceabigr

Workshop on genomic data integration with a emphasis on epigenetic data (FHL 2022)
4 stars 2 forks source link

Function enrichment for change max # transcript & predominant isoform #94

Closed sr320 closed 4 months ago

sr320 commented 4 months ago

isoform done.......

still need list of genes in males and females where there was a change in max number of transcripts

kubu4 commented 4 months ago

Females vs. Males:

https://github.com/sr320/ceabigr/blob/main/output/34-transcript-counts/diffs.max.transcripts_per_gene.females.vs.males.csv

Females Controls vs. Exposed:

https://github.com/sr320/ceabigr/blob/main/output/34-transcript-counts/diffs.max.transcripts_per_gene.controls_females.vs.exposed_females.csv

Males Controls vs. Exposed:

https://github.com/sr320/ceabigr/blob/main/output/34-transcript-counts/diffs.max.transcripts_per_gene.controls_males.vs.exposed_males.csv

These aren't the "final" lists. These still require filtering.

Here's the filtering method (genes are rows):

So, for the first file linked above (diffs.max.transcripts_per_gene.females.vs.males.csv), here's how the filtering criteria works:

I can happily do the filtering an produce just the desired list(s). Just let me know.

EDITED: Fixed typo identified in this comment

sr320 commented 4 months ago

At the end we want one table for males, one for females

x    Control MaxT #   Exposed MaxT #   Control Predom Isoform   Exposed Predome Isoform
Gene 1
Gene 2
Gene 3
kubu4 commented 4 months ago

Yep, working on the combined table, per #93 .

sr320 commented 4 months ago

If Column 3 < 0, then males have higher number of max transcripts for a given gene. If Column 3 < 0, then females have a higher number of max transcripts for a given gene.

????

Seems same?

kubu4 commented 4 months ago

Seems same?

Yep, that's a typo.

Should be (If Column 3 > 0, then females have a higher number of max transcripts for a given gene.

Will edit previous post.

kubu4 commented 4 months ago

But, basically, using the filename as a guide, it's always LEFT minus RIGHT. E.g.

diffs.max.transcripts_per_gene.females.vs.males.csv

This is comparing females.vs.males. So, the calculation is done FEMALES minus MALES.

Females will be Column 1 and males will be Column 2 in that file.

sr320 commented 4 months ago

done. enrichment added to text for max transcripts (done irregardless of direction).

yaaminiv commented 4 months ago

@sr320 where's the code for the enrichment?

yaaminiv commented 4 months ago

Did functional enrichment with topGO for predominant isoforms.

Next step is examining methylation changes for genes with sig BPGO and repeating similar analysis for change in max transcripts

yaaminiv commented 4 months ago

Did functional enrichment wiht topGO for changes in the maximum transcript expressed

Females