ComparativeGenomicsToolkit / cactus

Official home of genome aligner based upon notion of Cactus graphs
Other
505 stars 111 forks source link

cactus-hal2chains makeover #1334

Closed glennhickey closed 5 months ago

glennhickey commented 6 months ago

This PR adds some basic functionality to cactus-hal2chains

It also fixes up the terminology. As pointed out by #1273 the --targetGenomes option was backwards with respect to how it's defined in the chain format. The option names are fixed/changed so that --queryGenomes and --targetGenomes refer to what you get in the output.

This script will still be very inefficient for big hal files on clusters (or anwyhwere without file caching), because the hal file will need to be copied to a local job for each pairwise comparison. I think the only work-around is to implement a batching system like in cactus-hal2maf eventually...