twitter / scalding

A Scala API for Cascading
http://twitter.com/scalding
Apache License 2.0
3.48k stars 703 forks source link

Parquet Complex Data Support for Big Decimals and INT96 #1916

Open gna-phetsarath opened 5 years ago

gna-phetsarath commented 5 years ago

Motivation

The primitive type support was extended to support:

Supported Complex Data Type Conversions

Parquet Data Type Converted Scalding Type
fixed_len_byte_array(L) field_name (DECIMAL(p,s)) BigDecimal(p,s) => String
if s == 0, and p <= 10 => Int
if s == 0 and p <= 18 => Long
INT96 Date to String formatted -
Timestamp.toString
"2011-01-01 00:00:00.123456789"
"yyyy-mm-dd hh:mm:ss.SSSSSSSSS"

Changes

Tests

Tested

sbt clean test

CLAassistant commented 5 years ago

CLA assistant check
All committers have signed the CLA.

gna-phetsarath commented 4 years ago

@johnynek and @non: Would it be possible for you to review? Thank you.