Open GregoryKimball opened 5 months ago
Just noting that pandas does not support datetime32/64[D]
and I don't think has plans to (because daily resolution logic gets murky with timezone data and DST), but should still be feasible for cudf Python
That suggests to me that we want this support in pylibcudf, but not in cuDF Python. That wouldn't be as convenient for the original use case that @GregoryKimball was interested in though.
Is your feature request related to a problem? Please describe. Libcudf support 5 timestamp types, but cuDF python only supports 4.
TIMESTAMP_DAYS
int32
Date32Type
TIMESTAMP_SECONDS
int64
datetime64[s]
TimeUnit::SECOND
TIMESTAMP_MILLISECONDS
int64
datetime64[ms]
TimeUnit::MILLI
TIMESTAMP_MICROSECONDS
int64
datetime64[us]
TimeUnit::MICRO
TIMESTAMP_NANOSECONDS
int64
datetime64[ns]
TimeUnit::NANO
Describe the solution you'd like Adjust cuDF python type support to leverage all of the available libcudf types.
Additional context I was generating TPC-H files for Velox benchmarking, and cuDF python cannot write files with int32 timestamp days type. This means that Velox fails to execute scalar functions without additional query changes to cast from Velox
TIMESTAMP
to VeloxDATE
.