opencb / hpg-bigdata

This repository implements converters and tools for working with NGS data in HPC or Hadoop cluster
Apache License 2.0
17 stars 14 forks source link

Improve the 'alignment convert' command line #45

Closed jtarraga closed 7 years ago

jtarraga commented 8 years ago

The command line should offer some basic filters (e.g., region filter, mapping quality filter,...) and take into account some important Parquet parameters (e.g. page size, row group size), and parameters to compress de SAM sequences and qualities

./hpg-bigdata.sh alignment convert -i .... -o .... --to {avro | parquet} [--from {bam|sam|cram||avro}] --compression {deflate | snappy | ...} [--page-size ...] [--row-group-size ...] [--skip-sequence] [--bin-qualities] [--region ....] [--region-file ....] [--mapping-qual ....]