cambiotraining / bacterial-genomics

Course materials for "Working with Bacterial Genomes"
https://cambiotraining.github.io/bacterial-genomics/
Other
0 stars 2 forks source link

Commands for testing software setup #20

Closed tavareshugo closed 7 months ago

tavareshugo commented 7 months ago

@cambioinfo below the code to test the setup for day 1:

# directory for day 1
cd M_tuberculosis/

# samplesheet
python scripts/fastq_dir_to_samplesheet.py data/reads \
    samplesheet.csv \
    -r1 _1.fastq.gz \
    -r2 _2.fastq.gz

# activate environment
eval "$(conda shell.bash hook)"
source $(mamba info --base)/etc/profile.d/mamba.sh
mamba activate nextflow

# bacQC - takes ~6 min
nextflow run avantonder/bacQC \
  -r main \
  -profile singularity \
  --max_memory '16.GB' --max_cpus 8 \
  --input samplesheet.csv \
  --outdir results/bacqc \
  --kraken2db databases/minikraken2_v1_8GB \
  --brackendb databases/minikraken2_v1_8GB \
  --genome_size 4300000

# bactmap ~ takes ~10 min
nextflow run nf-core/bactmap \
  -profile singularity \
  --max_memory '16.GB' --max_cpus 8 \
  --input samplesheet.csv \
  --outdir results/bactmap \
  --reference resources/reference/MTBC0.fasta \
  --genome_size 4.3M

# activate new env
mamba activate seqtk

# seqtk script - <1 min
bash scripts/03-pseudogenome_check.sh

# activate new env
mamba activate remove_blocks

# masking script - <1 min
bash scripts/04-mask_pseudogenome.sh
cambioinfo commented 7 months ago

All working on current course environment for 11/12/23