Closed sr320 closed 6 years ago
In my project, I want to identify differentially methylated loci and regions between Eastern oyster samples exposed to different pCO2 conditions. First steps involve aligning bisulfite sequencing data to the Eastern oyster genome, then identifying DMLs and DMRs.
bismark
on Mox: This will align my samples to the Eastern oyster genome.methylKit
parameters using subset data while full samples are running on Mox: I previously ran bismark
on a subset of my data on a different machine. While my full samples are running on Mox, I can use the subset data to test different parameters when identifying DMLs and DMRs. I can use this test to inform the settings used for my full samples on Mox.With a MiSeq run that is already accessible I must:
My main goal right now is to get to the point of distinguishing each individual sample and identifying the number of individual sequences within each sample. This would be presented in a (very long) table.
For my metabarcoding project, I will
Big picture:
I have not identified appropriate programs for the above, just getting started thinking about these steps. Katherine pointed me to some resources from the QuantSeq people, so I'll start there, but please let me know if you have any advice.
I'm well into my project currently with 16S amplicon data. The steps so far went like this:
FastQC
to check sequencing qualitymothur
for read joining, quality filtering (homopolymer removal, read length checks,
dereplication, chimera removal), database alignmentVSEARCH
for clusteringCurrently, I'm using the R package phyloseq
for statistical analyses, including alpha-diversity calculations and ordination, and for producing publication quality graphics that allow me to make meaningful extrapolations from my data.
For my exon capture project the end goal is to get as close as possible to a phylogeny as possible. My source data are in fastq format demultiplex. Here are the first steps for this project (based on methods from Kuang et al. 2018):
For my project, I’ll be analyzing a set of eDNA data from a recently published article by Dr. Ryan Kelly’s SMEA team (they are an awesome group!!). This will be practice for when I get my own set of data this December.
The end goal of my project is to organize the original metadata file into either a table or FASTA format file with the OTU identifier, corresponding sequence, and the selected taxonomic level.
The first three steps will be:
For my project, I will compare the relative protein expression level, and types of proteins expressed, between coral tissue and coral skeletal material in bleached and non-bleached corals. My collaborators and I have run mass spectrometry on the samples and converted them to .mzXML files and ran peptide- and protein-profit on them. However, My collaborators have moved the files to a directory that I can not access them. So...
My first step is to gain access to those files which I am working on setting up a meeting with my collaborators. For the next steps, I am unclear, but I imagine I will quality check and clean up the data, then I think the next step will be to start comparing differences in protein expression between the treatments and identifying each treatments' unique proteins and what those mean.
My project will be using a data set of snailfish RADSeq data. My goal is to create a phylogeny based on the data. The first three steps will be:
curl
doesn't seem to work). Create a checksum in case I ever have to download it again.Fastqc
to visualize the data before doing anything to it.My first three steps of my pangolin genomics project is:
For my project I will first download the data, then:
For my project, I'm going to:
What do you imagine are the first three steps in your class project? Feel free to ask clarifying question in class or using issues..