Closed DSuveges closed 1 year ago
This problem has been solved by removing any region based filtering, which was applied only in one of the branches. The region based filtering was implemented to drop HLA regions, to make the process performant enough, but this is no longer necessary, so dropped.
Distance based clumping can be executed in two ways: with or without collecting the locus around the identified semi index. Both processes apply a shared clumping step, which is optinally folled by joining back to the source summary statistics. This logic assumes the number of resulting semi indices are the same regardless the locus collection. Sadly this is not the case apparently...