Closed martinghunt closed 4 years ago
Actually, looks like it's the clustering that is failing, resulting in a VCF with no records.
@martinghunt I think this is fixed through cluster_vcf_records
module?
Yes I fixed cluster_vcf_records
, so that example works now. But I'm wondering if it's still possible to end up with no records in the VCF. What will gramtools do in that case?
I see.
Here is the behaviour if you gramtools build
using a VCF with no records:
$ gramtools build --reference toy_datasets/noRecords/H37Rv.fasta --vcf toy_datasets/noRecords/mutant.vcf --gram-dir test
2020-01-07 10:02:48,084 gramtools INFO Start process: build
2020-01-07 10:02:48,085 gramtools INFO Running vcf_record_clustering on ['toy_datasets/noRecords/mutant.vcf'].
2020-01-07 10:02:48,085 gramtools INFO Running vcf_to_PRG_string_conversion on /home/brice/Desktop/work_PhD/git_repos/gramtools/tmp_work/test/build.vcf
2020-01-07 10:02:48,097 gramtools INFO stdout:
maximum thread count: 1
Executing build command
Generating integer encoded PRG
Number of characters in integer encoded linear PRG: 2660
Generate coverage graph
Number of variant sites: 0
No variant sites found.
Exiting 1
2020-01-07 10:02:48,099 gramtools ERROR Error code != 0
2020-01-07 10:02:48,100 gramtools ERROR Unsuccessful build. Process reported to /home/brice/Desktop/work_PhD/git_repos/gramtools/tmp_work/test/build_report.json
I figure that's good behaviour: clustering and prg string production happen, and then we exit stating no variants found.
What do you think?
Looks good to me. Thanks for testing :)
?? isnt the natural thing to do output a PRG with no variants?
That is produced, but the error message comes from that we do not proceed to build an FM-Index, PRG masks and kmer index
ok, understood.
Current behaviour is this output if clustering results in
build.vcf
with no variants:build.vcf
contains this:Would be good to check for variants in
build.vcf
before trying to continue