The sample should demonstrate something to make the schema less complicated such as changing the call.genotype repeated field to instead be two fields call.first_allele and call.second_allele. (But be sure to add a check and fail the job if the data has any triploid genotypes).
The sample should demonstrate something to make the schema less complicated such as changing the
call.genotype
repeated field to instead be two fieldscall.first_allele
andcall.second_allele
. (But be sure to add a check and fail the job if the data has any triploid genotypes).See Google Cloud Platform Hadoop release announcement for details.