sr320 / paper-pano-go

Draft manuscript describing Panopea gonad transcriptome
2 stars 7 forks source link

Compile list of genes and gene sets to look for in transcriptome #8

Closed sr320 closed 8 years ago

sr320 commented 9 years ago

We talked about looking for important genes in the transcriptome.

There are several sources for these. Maybe we she compile these in a central location?

Possibly in this thread or create a wiki page..

mdelrio1 commented 9 years ago

I'll work on the annotation and look for some important genes too.

sr320 commented 9 years ago

This is a useful resource http://figshare.com/articles/Crassostrea_gigas_male_gonad_transcriptional_data_comparison/1004464

It includes fasta file that allows (using blast) to identify genes used in Dheilly et al study "Gametogenesis in the Pacific Oyster Crassostrea gigas: A Microarrays-Based Analysis Identifies Sex and Stage Specific Genes" - also includes corresponding table for sex-specific genes and those differentially expressed through gametogenesis.

mdelrio1 commented 9 years ago

I have downloaded the files and be able to reproduce the box plot. I'm checking the fasta file.

sr320 commented 9 years ago

@brentners Can you provide papers and / or genes you went over in your presentation we should look for?

brentners commented 9 years ago

pmarg-dmrt

pmarg-fem1-like

pmarg-foxl2 CgDsx CgSoxH

CgFoxL2

/Crassostrea hongkongensis /transcriptome

–Tong et al. 2015

/Pinctada margaritifera /gonad transcriptome

–Teaniniuraitemoana et al. 2014, 2015

/Crassostrea gigas /gonad transcriptome

–Zhang et al. 2014

On 11/11/15 6:30 AM, Steven Roberts wrote:

@brentners https://github.com/brentners Can you provide papers and / or genes you went over in your presentation we should look for?

— Reply to this email directly or view it on GitHub https://github.com/sr320/paper-pano-go/issues/8#issuecomment-155797533.

Brent Vadopalas Principal Research Scientist University of Washington School of Aquatic & Fishery Sciences 1122 NE Boat St. Seattle WA 98105 206.685.4689

lacroix54 commented 9 years ago

Hi @sr320 ,

I could not find those genes in the file Geoduck-transcriptome-v2-GO-Slim.csv my search was with the next line (for example):

carmen@carmen-pc:~/Desktop/Geoduck$ grep -i "CgFoxL2" Geoduck-transcriptome-v2-GO-Slim.csv |head |wc 0 0 0

sr320 commented 9 years ago

Curious how you looked for them. Maybe the were not top hits but possibly still homologs are present? One way to search would be to get fasta files and blast against Geoduck database

On Wed, Nov 11, 2015 at 11:42 AM lacroix54 notifications@github.com wrote:

Hi @sr320 https://github.com/sr320 ,

I could not find those genes in the file Geoduck-transcriptome-v2-GO-Slim.csv

— Reply to this email directly or view it on GitHub https://github.com/sr320/paper-pano-go/issues/8#issuecomment-155889759.

Steven Roberts http://faculty.washington.edu/sr320/

lacroix54 commented 9 years ago

@sr320 I'm on it...

mdelrio1 commented 9 years ago

@sr320 @lacroix54 Steven, would it be possible that we can't find those genes as GO, because they are not reviewed in the uniprot database for instance the Cg-Dsx (Doublesex protein) is in the database but it says it is Unreviewed. http://www.uniprot.org/uniprot/A0A0D3LXK1 I agree that getting the fasta files and blast against Geoduck database, but the Pmag genes are not alone in the GeneBank (as nucleotide alone), we need to check in the sequences from the papers. A different approach may be checking the contigs with "sex" GO, as you searched in class, and from those check the annotation and see which may be closer to the genes Brent proposed.

sr320 commented 9 years ago

@mdelrio1 absolutely

I suggest we do this from both "directions" you mention

1) Simply use GO terms to identify subset of interesting genes and 2) Go into papers and get fasta files.

mdelrio1 commented 9 years ago

@sr320 I have written how to use the excel in order to get the reproduction genes https://github.com/mdelrio1/mdelrio-panopea1/blob/master/notes/panopea_analysis_of_goslim_without_bacteria_using_excel.md Is this what you asked me to do? In case you need more info, let me know.

mdelrio1 commented 9 years ago

The excel file is in https://github.com/mdelrio1/mdelrio-panopea1/blob/master/data/Geoduck-transcriptome-v2-GO-SlimAnalisis.xlsx