NVIDIA / spark-rapids

Spark RAPIDS plugin - accelerate Apache Spark with GPUs
https://nvidia.github.io/spark-rapids
Apache License 2.0
798 stars 232 forks source link

[FEA] Support time zones that are not UTC #6839

Open revans2 opened 2 years ago

revans2 commented 2 years ago

Is your feature request related to a problem? Please describe. This is related to #6603 but a super set of it. I will list out the issues here to complete both sets of work, and then reference some of them in #6603 too.

The following are the tasks to do:

Phase I (P0) - Release 23.12

Phase II (P0) - Release 24.02

Phase III/Uncategorized (P1)

Uncategorized

Test improvements

Benchmarking

In general we are going to try to have timestamps follow what Spark does and store them in UTC. Then convert the inputs to the desired timezone whenever we need to operate on it in a different time zone.

We should also go through other operators and see if there are things that we can/should support once we can support timezones.

nvliyuan commented 1 year ago

Not sure if FromUTCTimestamp support PST should be a sub task. CC @viadea

res-life commented 11 months ago

Follow on issue: https://github.com/NVIDIA/spark-rapids/issues/9570

revans2 commented 11 months ago

Adding in some issues for testing too

https://github.com/NVIDIA/spark-rapids/issues/9627

https://github.com/NVIDIA/spark-rapids/issues/9633

res-life commented 10 months ago
Test cases statistics: code snapshot TZ passed skipped xfailed xpassed total
Before xfail all the impacted cases #9773 UTC 19361 2210 373 283 22227
After xfail all the impacted cases #9773 non-UTC 7589 2214 3719 8613 22135
Introduced fine grained checker from #9719 non-UTC 20173 1411 358 382 22324
res-life commented 7 months ago

Not planing work on this for release 24.04 @sameerz can we move it to next release?