Closed stschiff closed 2 years ago
This is exactly the sort of thing I had in mind! This would be a great quality of life feature.
Even better, including ascertainment:
groupDefs:
group1: a,b,-c,-<d>
group2: e,f,-<g>
group3: [group1, group2]
Right1: ["<Ind1>", "<Ind2>"]
Right2: ["Group1", "Group2"]
AllRights: [Right1, Right2]
stats:
- type: f4 // this would yield 15 statistics
ascertainment:
based_on: group3
min_af: 0
max_af: 0.02
outgroup: Chimp.REF
popA: [a, b, c, group1, <ind1>]
popB: [<i1>, <i2>, group2]
popC: [d]
popD: [Chimp.REF]
- type: f3
ascertainment:
based_on: All_Rights
min_af: 0
max_af: 0.02
outgroup: Chimp.REF
popA: [Left1, Left2, Left3,...]
popB: [Right1, Right2, Right3,...]
popC: [d]
- type: FST
popA: [a]
popB: [b]
Done.
Currently, stat files for
fstats
look like:I am planning to add a new YAML format config file, while keeping the option of giving these statFile inputs for backwards compatibility. The new input format seeks to improve on two fronts:
ras
Here is an example:
which would then create 12 f4 statistics, 2 f3 statistics and one FST statistics, which would be simply listed out in the output.
I think this gives quite some flexibility while power to those who want it. What do you think, @TCLamnidis?