Closed ccrobertson closed 4 years ago
First. Yeah those are all tools you’d need to install yourself. Preseq can be tricky so if you hit errors let me know.
Refgenie has undergone some major renovations so it is a new system. Check out the refgenie docs and then next Monday I can also meet in person to go over stuff, but Ill be working from home over next couple days.
Are you running PE samples? If so I need to push some changes to the repo and youll need to update.
From: Cassie Robertson notifications@github.com Sent: Wednesday, July 3, 2019 4:55:01 PM To: databio/peppro Cc: Subscribed Subject: [databio/peppro] Installing PEPPRO on Rivanna (#3)
Hi Jason, I'm working on setting up PEPPRO to run on Rivanna. I have a few logistics questions
Assemblies: Is the environmental variable $REFGENIE in the PEPPRO documentation equivalent to $GENOMES in the PEPATAC documentation? Wondering because I don't see a "genome_config.yaml" file in the reference genome directories I was pointing to previously (which are maintained by you/Nathan I think)
Required software: I'm guessing most/all are already installed on Rivanna/CPHG systems. I dug around a bit and found "modulefiles" for some of the required tools on the t1 drive. Couldn't find modulefiles for seqkit, fastp, preseq, wigToBigWig, or bigWigCat. Do I need to install these myself or are there existing installations I can use?
Thanks for the help!
Cassie
— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHubhttps://github.com/databio/peppro/issues/3?email_source=notifications&email_token=AG6MFXZZR3346ERFV2HO2DLP5UG2LA5CNFSM4H5KBUD2YY3PNVWWK3TUL52HS4DFUVEXG43VMWVGG33NNVSW45C7NFSM4G5HJP6A, or mute the threadhttps://github.com/notifications/unsubscribe-auth/AG6MFX2LKHBPO6A3PDUIYF3P5UG2LANCNFSM4H5KBUDQ.
Cassie, the refgenie docs are now here: refgenie.databio.org -- we're shortly going to move PEPATAC to this system as well. it's much easier. But we're in a transition stage... so to answer your question specifically, $REFGENIE is not the same thing as $GENOMES.
Jason are you planning on building a singularity container like you did with pepatac?
our new genomes folder is:
echo $REFGENIE /project/shefflab/genomes/genomes.yaml
Ok great. Thanks to both for the info!
Yes, they are paired end. Jason, I'll read about refgenie now and would be great to chat on Monday afternoon if you're free.
I see now that there's actually already a section on running with a container in the documentation. Sorry, I didn't notice that before.
Monday works for me. Just pushed changes necessary for PE data. And yes, container should be good to go.
do you have a singularity version built? Or are you going to wait for a "real" release?
The docs link points to: http://big.databio.org/simages/peppro but this is a 404 error.
Hi Jason, I'm working on setting up PEPPRO to run on Rivanna. I have a few logistics questions
Assemblies: Is the environmental variable $REFGENIE in the PEPPRO documentation equivalent to $GENOMES in the PEPATAC documentation? Wondering because I don't see a "genome_config.yaml" file in the reference genome directories I was pointing to previously (which are maintained by you/Nathan I think)
Required software: I'm guessing most/all are already installed on Rivanna/CPHG systems. I dug around a bit and found "modulefiles" for some of the required tools on the t1 drive. Couldn't find modulefiles for seqkit, fastp, preseq, wigToBigWig, or bigWigCat. Do I need to install these myself or are there existing installations I can use?
Thanks for the help!
Cassie