Closed baohua100 closed 7 years ago
Greetings,
When given a bed file of targeted regions, by default NEAT will sample a majority of the reads from that region, and a fraction of reads from outside the region. This is controlled by the input parameter “-to”, which is 0.02 by default (meaning regions outside the specified regions will be sampled from with probability 2%). By setting -to 0.0 you should able to restrict read sampling to entirely within targeted regions.
Hope this helps! Zach
On Dec 30, 2016, at 3:24 PM, baohua100 notifications@github.com<mailto:notifications@github.com> wrote:
python genReads.py -r hg19.fa -R 150 -c 3000 -t target.bed -o sim1_norm --bam --vcf --pe 350 30
The program still trying to sampling all reads from the genome, but not limited to the target.bed regions.
— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHubhttps://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_zstephens_neat-2Dgenreads_issues_16&d=DQMCaQ&c=8hUWFZcy2Z-Za5rBPlktOQ&r=iSEBNMfCHfS5bLIS7AqI6wkxgRNv8-2dMwRF43ddddU&m=EjuJaHdlivcxGYGQXUCHypXZrfUA_0rwgA-qn-b-3xA&s=Tan3a3Q2Plynd8VFS1G6TJGMvOii_vug5Bb4XifyJ5Y&e=, or mute the threadhttps://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_notifications_unsubscribe-2Dauth_AIHT1dueN2oZxqs1L21HgLjRYXMY2Iitks5rNWiAgaJpZM4LYX8D&d=DQMCaQ&c=8hUWFZcy2Z-Za5rBPlktOQ&r=iSEBNMfCHfS5bLIS7AqI6wkxgRNv8-2dMwRF43ddddU&m=EjuJaHdlivcxGYGQXUCHypXZrfUA_0rwgA-qn-b-3xA&s=StKe9qdC7B5bpFWY9XaJbTxmzNYkt7cGUIyIW0VxNjQ&e=.
python genReads.py -r hg19.fa -R 150 -c 3000 -t target.bed -o sim1_norm --bam --vcf --pe 350 30
The program was still trying to sample all reads from the genome, but not limited to the target.bed regions.