Closed sr320 closed 7 years ago
Project Goals for this Week: (1) Repeat Week 5 comparisons made with ustacks -m parameter for the ustacks -M and cstacks # individs in catalog
(2) Determine which quantitative measurements should matter the most when comparing parameter outputs.
Progress (1) repeated Week 5 comparisons with the ustacks -M parameter (# of loci retained AND graphing heterozygosity with R code) as well as the cstacks # individs for catalog (# of loci retianed ONLY)
(2) Decided that I will most likely use a stack depth (-m) of 5 and and -M of 3. Have not thoroughly looked over the cstacks' parameter, # individs for catalog.
(3) Decided that I might also explore the upper / lower bound error rate parameters in ustacks (--bound_low and --bound_high
)
(4) Tackling the question of whether the stacks program populations
undercalls heterozygotes by running independent scripts on the output. So far have run into a lot of trouble here, as these scripts were written for a different version of stacks.
This week, I was mainly troubleshooting all of the problems I ran into last week.
kallisto quant
to work with my dataProgress from this week:
1) Double checked to make sure my blast output was complete by checking to see if the tail end of output file contained contigs from my query. Here is my notebook
2) Turned my geoduck blast output that I merged with a Uniprot database into a large table
3) I also created a table with the proteins found in the Taylor oyster seed experiments after merging with the uniprot database.
I figured out how to run sstacks, the penultimate program in the Stacks pipeline, and wrote a python script so that I can rerun it from the command line simply. The script writes and runs a shell script to accomplish this. Here's the markdown file.
In addition, I've made some progress on figuring out how to call filepaths so that I can streamline use no matter the directory structure. I've also made progress on sorting out what additional analyses and filtering my lab group does with RAD data to do after running populations, the final program in the Stacks pipeline.
For week 6:
I spent this week trying to get to a point where I could run my Bismark analysis. I was able to:
This week's accomplishments and goals for next week have been updated on my README
Since my v01 "pre-release" I have done the following:
Progress Made
Next Steps
Macbook_local
for work done on a repository on my personal laptop.prcomp
function for principal components analysis. Progress post release:
Homebrew
and installed gdrive
so I can now interact with my Google Drive directories to access files there. blastp
to identify de novo sequenced peptides.
Last week you took to bold move to make your first
release
. For this week project-progress simply provide a bulleted explanation of all the improvements to your project since therelease
.