The canonical way to identify variants should be consistent, either:
The Series name in a Series or DataFrame
The variant ID associated with the GenotypeArray
Choice 1 would render the variant ID useless when it exists in a dataframe.
Choice 2 would require more careful validation of dataframes to avoid duplicate IDs
Either choice requires carefully considering how to name encoded genotype results.
The canonical way to identify variants should be consistent, either:
Choice 1 would render the variant ID useless when it exists in a dataframe. Choice 2 would require more careful validation of dataframes to avoid duplicate IDs
Either choice requires carefully considering how to name encoded genotype results.