Charlie converted the sample_groups variable (in yaml files) to a sample_set variable. This solves the following problems with the sample_groups variable:
sample_groups grabs both the sample_set and the probe_set as a list (instead of as separate variables)
it requires an exact match to the probe_set column, resulting in the potential exclusion of valid samples (e.g. if a user inputs ['JJJ', 'DR1,VAR4'] then any matches to JJJ with 'DR1,VAR5' would be ignored.
it makes the assumption that there might be multiple probe sets to analyze (inherent in being a list) when in reality our variant calling system can only handle one probeset at a time
it's completely redundant with the probe_set variable (which actually specifies the probe set a user wants to analyze), which also confuses new users.
Charlie converted the sample_groups variable (in yaml files) to a sample_set variable. This solves the following problems with the sample_groups variable: