genomic-medicine-sweden / nallo

An analysis pipeline for long-reads from both PacBio and Oxford Nanopore Technologies (ONT), written in Nextflow.
https://genomic-medicine-sweden.github.io/nallo/
MIT License
17 stars 4 forks source link

[User story] Small test data #32

Open fellen31 opened 7 months ago

fellen31 commented 7 months ago

Need

As a developer, I need the pipeline to have a test profile with accompanying relevant test data, which should test if not all then at least the majority of the pipeline.

Suggested approach

Currently there are three presets available: pacbio, revio and ONT_R10. Using the revio preset should cover all pacbio cases.

Can be closed when

HIPHASE_SV takes input from both DeepVariant and an SV caller to phase reads. Sniffles is not supported and it will fail depending on how the calls in the VCF looks. Ideally the test data would maybe provide all possible types of SVs, because with the current test data HiPhase will not fail (or remove HiPhase, or add psbv as input which is the only supported SV-caller).

fellen31 commented 7 months ago

Added working revio test data in #33. Points above still valid.

If possible, the test data should also be able to:

fellen31 commented 7 months ago

Adding CNV-workflow as separate issue (#60)

fellen31 commented 6 months ago

Add SNV-annotation to test profile as separate issue (#74)