odelaneau / shapeit4

Segmented HAPlotype Estimation and Imputation Tool
MIT License
89 stars 17 forks source link

Assertion generated at shapeit4: src/phaser/phaser_algorithm.cpp:50 #1

Closed mdkeehan closed 3 years ago

mdkeehan commented 5 years ago

Hi,

Thanks for writing such a good tool and making it MIT licensed. We have been evaluating shapeit4 on low density cattle snp chip data and a colleague has been generating the following assertion.

shapeit4: src/phaser/phaser_algorithm.cpp:50: void phaser::phaseWindow(int, int): Assertion `threadData[id_worker].Kvec[w].size()>0' failed.
Aborted (core dumped)

The command line we have been using is

shapeit4 --really.simple.ped.phased.vcf.gz --map  1cmMb.map -O  out.vcf -R Chr5 --thread 4  -W 20000000 --effective-size 400  --pbwt-disable-init --mcmc-iterations  6b,1p,2b,1p,2b,20m

The input vcf has 1935 markers ranging over Chr5 which (in cattle) is 121 MB long. So with -W 20,000,000 we have 6 windows with 400 markers per window.

By experimenting with the parameters I have found setting -W to 200,000,000 i.e force 1 window over the chromosome the command completes.

Is there a recommended minimum number of markers per window?

odelaneau commented 5 years ago

Looking at the assert, it's definitely related to the low density of variants. But It's a tricky problem to solve with few info. I am happy to look at this in more details if you could send me a small example dataset repoducing the error.

Cheers,

mdkeehan commented 5 years ago

Hi Olivier,

Thanks for responding. It will be difficult for me to release a public dataset plus given that we have a work around i.e. to use longer windows I won’t be following this up furthur. Please mark the bug as closed.

Mike

From: Olivier Delaneau notifications@github.com Sent: Friday, 22 February 2019 9:15 PM To: odelaneau/shapeit4 shapeit4@noreply.github.com Cc: Michael Keehan Michael.Keehan@lic.co.nz; Author author@noreply.github.com Subject: Re: [odelaneau/shapeit4] Assertion generated at shapeit4: src/phaser/phaser_algorithm.cpp:50 (#1)

Looking at the assert, it's definitely related to the low density of variants. But It's a tricky problem to solve with few info. I am happy to look at this in more details if you could send me a small example dataset repoducing the error.

Cheers,

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHubhttps://github.com/odelaneau/shapeit4/issues/1#issuecomment-466312105, or mute the threadhttps://github.com/notifications/unsubscribe-auth/AUDjdTEoajqEnb8hsEFbZOLvWf0kTKgIks5vP6cZgaJpZM4a4Lkf.

Michael Keehan

Senior Scientist (Bioinformatics)

Private Bag 3016 (Cnr Ruakura & Morrinsville Roads, Newstead) Hamilton 3240 NEW ZEALAND

Phone: 07 856 0661 Web: www.lic.co.nz

This message is for the named person's use only. It may contain confidential, proprietary or legally privileged information. No confidentiality or privilege is waived or lost by any mis-transmission. If you receive this message in error, please immediately delete it and all copies of it from your system, destroy any hard copies of it and notify the sender. LIC reserves the right to monitor all e-mail communications through its networks. Although LIC regularly scans its computer systems for viruses using anti-virus software, LIC is not responsible for the consequences of any viruses, data corruption, interception or unauthorised amendment to this email. The recipient should check this email and any attachments for viruses and defects. This email does not designate an information system for the purposes of section 214(a) of the NZ Contract and Commercial Law Act 2017. Without limiting the above, if the recipient of this email is a current or previous customer of LIC, the LIC Service Rules, available at www.lic.co.nz, shall apply to the information set out in this email and any attachments.