Closed yarden closed 11 years ago
Hi Yarden,
I think I am being dense. To my eye, all of those intervals overlap and should thus be in a single cluster. The output reports them all to be in cluster number "1" (the last column). At which interval do you think a new cluster should be started?
Hi Aaron,
Ack, sorry, I was the one being dense: I misread the docs. It said: "In the example below, the 4th column is the cluster ID" and then showed a BED where the 4th column in the file is the cluster ID. Therefore, I was expecting the 4th field of my BED (i.e. the one containing 102919-4, 302711-1, ...
) to contain the cluster ID, but I see that for my BED it made the cluster ID the fifth column of the file. My bad. Thanks, --Yarden
Ah, I see. No worries...
I never leave the house without Bedtools, and recently discovered the immensely useful feature
clusterBed
and am finding many applications for it. I'm having trouble applying it to BAM files, though, and wanted to see if there's a way around it or if this utility is incompatible with BAMs.I have a BAM file generated by
tagBam
that I'd like to runclusterBed
on. I convert it to BED withbamToBed
and then sort it in preparation forclusterBed
:That works great, but the fourth field of IDs is already populated by the conversion. As far as I can tell from the docs,
clusterBed
uses the fourth field to group the intervals into clusters since it does not merge them into meta-intervals. It looks likeclusterBed
cannot work with this output:Now
clusterBed
does not appear to correctly group the intervals:Is there a way around this? I'm basically trying to find a way to run
clusterBed
on BAM files, and am happy to convert them to BEDs in the process. Thanks.