The goal of that worker is to create files with indices that split the dataset into a distinctive subsets.
The required usage:
1) user provides the output dir where files with indices will be created (--o)
2) user provides the problem name (--p) OR length of the dataset (--l)
3) user provides split --s (value from 1 to l-2, which are border cases when one of the other split will contain a single index)
3) additional option: random_sampling (--r, DEFAULT: true)
-- when random_sampling is on, both files will contain list of indices
-- when off, both files will contain ranges, i.e. [0, s-1] and [s, l] respectivelly
The goal of that worker is to create files with indices that split the dataset into a distinctive subsets.
The required usage: 1) user provides the output dir where files with indices will be created (--o) 2) user provides the problem name (--p) OR length of the dataset (--l) 3) user provides split --s (value from 1 to l-2, which are border cases when one of the other split will contain a single index) 3) additional option: random_sampling (--r, DEFAULT: true) -- when random_sampling is on, both files will contain list of indices -- when off, both files will contain ranges, i.e. [0, s-1] and [s, l] respectivelly