kundajelab / chipseq_pipeline

AQUAS TF and histone ChIP-seq pipeline
BSD 3-Clause "New" or "Revised" License
105 stars 40 forks source link

Some reps get ERROR:root:--extsize must >= 1! #45

Closed jloupe closed 6 years ago

jloupe commented 6 years ago

We have run several ChIP-seq datasets through the Aquas pipeline, and occasionally we will get ERROR:root:--extsize must >= 1! It will cause any rep with this error to fail. What causes this error and what can be done to correct it?

I have found that if the pipeline shows this error, we can change the macs2 --extsize parameter to 200, and restart the job. The pipeline will pick up from macs2 and apparently complete the job without error. The resulting output seems to be correct when looking at the number of peaks called, passing IDR, and visualizing bigwigs with IGV. However, the cross-correlation plot is clearly effected and the estimated fragment length is negative. Is this the correct or best way to address this issue?

I have attached the output log from such a run, along with the summary html and an example image of bigwig visualization between reps. Rep1 ran normally, Rep2 had the error. Thanks.

akundaje commented 6 years ago

Yes occasionally the fragment length estimation fails for some datasets which have weak fragment length peaks in the cross-correlation. In such a case you can set the fragment length explicitly to something like 150-200 and rerun the pipeline. So what you did is absolutely fine.

-Anshul.

On Wed, Oct 3, 2018 at 4:18 PM jloupe notifications@github.com wrote:

We have run several ChIP-seq datasets through the Aquas pipeline, and occasionally we will get ERROR:root:--extsize must >= 1! It will cause any rep with this error to fail. What causes this error and what can be done to correct it?

I have found that if the pipeline shows this error, we can change the macs2 --extsize parameter to 200, and restart the job. The pipeline will pick up from macs2 and apparently complete the job without error. The resulting output seems to be correct when looking at the number of peaks called, passing IDR, and visualizing bigwigs with IGV. However, the cross-correlation plot is clearly effected and the estimated fragment length is negative. Is this the correct or best way to address this issue?

I have attached the output log from such a run, along with the summary html and an example image of bigwig visualization between reps. Rep1 ran normally, Rep2 had the error. Thanks.

DEAF1_CB_Output_report.pdf https://github.com/kundajelab/chipseq_pipeline/files/2444273/DEAF1_CB_Output_report.pdf [image: screen shot 2018-10-03 at 2 50 35 pm] https://user-images.githubusercontent.com/37379607/46443475-5e0d7580-c733-11e8-966d-a860f7949245.png DEAF1_CB_pipeline.json.log https://github.com/kundajelab/chipseq_pipeline/files/2444275/DEAF1_CB_pipeline.json.log

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/kundajelab/chipseq_pipeline/issues/45, or mute the thread https://github.com/notifications/unsubscribe-auth/AAI7ES_g8uLx_kOkaHu20JjIjzXxBqfgks5uhUWrgaJpZM4XHG5X .

jloupe commented 6 years ago

Thank you very much for the response.