Closed sr320 closed 7 years ago
Reminder: I have transcriptomes of male and female O. lurida gonads from an ocean acidification experiment.
My first three (baby) steps for my class project:
You might also be interested in another Porites transcriptome while the genome assembly is running. It is a holobiont transcriptome, so may not be optimal either. Shinzato et al 2014 Plos One Porites australiensis http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0085182 DDBJ Sequence Read Archive (DRA) under accession number DRA000906 (BioProject ID: PRJDB731)
Thanks @hputnam
There is also another newly available transcriptome for P. astreoides but it is also a holobiont transcriptome. I suppose I could try filtering symbiont sequences out of it with the various Symbiodinium resources available, and combine it with the other P. astreoides transcriptome I used (from the Matz lab), as well as perhaps the P. australiensis one for good measure.
Probably? first project steps:
Working with RADseq data to find population structure...
I'm a little familiar with the first few steps in stacks for single-read RADseq data, but since I have paired end data I was thinking it might be interesting to take the class project in a different direction than just population structure - while there isn't a Pacific cod genome, there is an Atlantic cod genome, so once I've done the de novo assembly and identified SNPs, it might be interesting to align this to the Atlantic cod genome and see how different / similar the species are.
Using RNAseq data from coho to conduct a differential expression analysis
Although I've ran through this once, I think I might get better results if I go back and spend some time cleaning up the sequencing data and try working with the PE data instead of merging it as I did the first time around.
My goal is to characterize a Pacific oyster (Crassostrea gigas) proteome for this class.
Potential first steps: 1) Gain a full understanding of what my dataset represents. 2) Use Blast to identify matched proteins between a database and our query. 3) Further research the functions of these identified proteins.
I eventually will be analyzing differences in proteomes between P. oysters reared under different conditions so I think this will be important in getting "baseline" data or just familiar with different types of proteins that are typically expressed in these oysters.
My project is to annotate a small portion of the geoduck genome sequenced by BGI. My first steps might be:
My goal is to compare microbial metaproteomic depth profiles from both a traditional database searching strategy and de novo peptide sequencing. Ultimately I'll be comparing the numbers of peptide and proteins matched or sequenced (some quality control will have to come in here), and the resulting taxonomic and functional characterizations that each output leads to. My first steps are:
0) Obtain MS/MS spectra (I ran some samples last week and and was running more today but just decided to shut down the instrument in case we lose power...) Should be all done by next week. Once I have raw data (.RAW Waters file directories), I need to peak pick in Progenesis (Waters software) and convert to .mgf and .mzxML with msconvert.
1) Download search database (from assembled metagenome) from Rocap group server
2) Search .mzxML files with X!Tandem maybe via the TPP and run .mgf files with Novor CLI.
3) Figure out how to quality control the output - be it with PeptideProphet or something else.
4) Run peptide results through Unipept and MEGAN6for taxonomic and functional characterizations.
Comparing population genetics of different cohorts of Pacific cod in Puget Sound using RAD data -- first few steps:
1) clean/filter raw reads 2) build/catalogue/align loci 3) call SNPs
What do you imagine are the first three steps in your class project? Feel free to ask clarifying question in class or using issues..