opencb / hpg-bigdata

This repository implements converters and tools for working with NGS data in HPC or Hadoop cluster
Apache License 2.0
17 stars 14 forks source link

Convert VCF to Parquet without going through Avro #94

Closed jtarraga closed 7 years ago

jtarraga commented 8 years ago

Currently, when converting a VCF file to Parquet, first, the VCF is converted to Avro and then, to Parquet (i.e.: VCF -> Avro -> Parquet). A more effective approach should be implemented to convert from VCF to Parquet without going through Avro.