Daniel-Mietchen / events

Materials related to events I might attend, and to talks I am giving
Creative Commons Zero v1.0 Universal
24 stars 15 forks source link

12 June 2019, Charlottesville: Bioinformatics in a High-Performance Computing Environment #573

Closed Daniel-Mietchen closed 5 years ago

Daniel-Mietchen commented 5 years ago

as per https://education.cadre.virginia.edu/#/workshop/1596 and https://workshops.somrc.virginia.edu/lesson/hpc-bioinformatics/

Daniel-Mietchen commented 5 years ago

Good mix of people in the room, including some people from iGEM Virginia with whom I was trying to get in touch anyway, following the synthetic biology symposium last week (cf. https://github.com/Daniel-Mietchen/events/issues/568 ).

In the afternoon, there will be room to discuss what software is/ should be running on the UVA HPC cluster Rivanna (see my separate notes on that). I'd like to use a MyBinder/ Docker/ Wikibase combo, and I'd like to learn more about the different file formats used in sequence analysis as well as about dark taxa.

Daniel-Mietchen commented 5 years ago

Another thing that's on my mind in this context is how things like http://wikigenomes.org/ can be integrated into sequence analysis workflows, or https://wikidp.org/ into software curation workflows.

Daniel-Mietchen commented 5 years ago
Daniel-Mietchen commented 5 years ago

I asked about how the environmental footprint of Rivanna is managed, and the VP for IT was mentioned as overseeing that. Will contact them.

Daniel-Mietchen commented 5 years ago

as allocation, users can use "rivanna-training" up to certain limits.

Daniel-Mietchen commented 5 years ago

For large file transfers, use Globus.

Daniel-Mietchen commented 5 years ago

For the hands-on, I am using the Terminal under rivanna-portal, along with instructions from https://workshops.somrc.virginia.edu/lesson/ngs-seq-alignment/ .

Daniel-Mietchen commented 5 years ago

The hostname command gives udc-ba35-36 for me right now. Gonna avoid that for running my jobs.

Daniel-Mietchen commented 5 years ago

after launching the ijob, hostname gives udc-ba26-28a.

Daniel-Mietchen commented 5 years ago

to get a list of all of a user's Rivanna jobs on that host machine, type squeue -u UVACOMPUTEID, where UVACOMPUTEID is their compute id

Daniel-Mietchen commented 5 years ago

to get a list of most recently run jobs, type sacct, or sacct -S 2017-01-30 to get all the jobs run since a given start date. Output format can be specified using things like sacct --format=jobid,jobname,start,elapsed,end -S 2017-01-30. See sacct -h for details.

Daniel-Mietchen commented 5 years ago

echo $SLURM_CPUS_PER_TASK gives the number of cores requested by the ijob command

Daniel-Mietchen commented 5 years ago

If there is no scratch directory, then instead of cd /scratch/dm7gn/ngs-aln-workshop/ref/, one can do cd /sfs/lustre/scratch/dm7gn/ngs-aln-workshop/ref/.

Daniel-Mietchen commented 5 years ago

met Sarah French. will send email to follow up.

under /project/genomes, there is a list of genomes for which indexes for alignment have already been built.

Daniel-Mietchen commented 5 years ago

Both the iGEM team and Sara are working on styrofoam.