Closed SarahBeecroft closed 1 year ago
Hi, Sorry for coming back so late. I must have missed this in my mails.
The pipeline does not accept a sample sheet, because its input is only a plain FASTA file.
Please try
nextflow run nf-core/pangenome -r a_brave_new_world -profile singularity --input FASTA_INPUT.fa.gz --n_haplotypes 12 --wfmash_map_pct_id 70 --wfmash_segment_length 2000 --smoothxg_poa_length '700,900,1100' --outdir outdir
The FASTA file has to be zipped with bgzip
beforehand.
No worries! That's working for me now, thanks. I think I got confused because there's mention of the sample sheet in the readme currently. Nice when it's a simple fix! Thanks :)
I have five genomes. Do I merge them and compress them bgzip
?
You put all sequences of all the five genomes into one FASTA file. Ideally, you rename the names of the sequences to respect the https://github.com/pangenome/PanSN-spec.
Check Documentation
I have checked the following places for your error:
Description of the bug
No matter what format I put inside my samplesheet.csv, I get the following error
I have copied in the exact format from the usage.md, as well as from the python script that parses the spreadsheet. It still fails if I create dummy files which match the names in the sample sheet.
Steps to reproduce
Steps to reproduce the behaviour:
Command line:
See error:
Expected behaviour
I would expect it to read my sample sheet, since it should follow the expected format.
Log files
System
Nextflow Installation
Container engine
Thanks!!