opencb / hpg-bigdata

This repository implements converters and tools for working with NGS data in HPC or Hadoop cluster
Apache License 2.0
17 stars 14 forks source link

Convert BAM to Parquet without going through Avro #97

Closed jtarraga closed 7 years ago

jtarraga commented 8 years ago

Currently, when converting a BAM file to Parquet, first, the BAM is converted to Avro and then, to Parquet (i.e.: BAM -> Avro -> Parquet). A more effective approach should be implemented to convert from BAM to Parquet without going through Avro.