Open alamb opened 5 hours ago
take
@alamb Is there a certain range for which float point numbers are compared equal to each other? any best practices?
@alamb Is there a certain range for which float point numbers are compared equal to each other? any best practices?
For determining groups, the comparison is exact equality even for floats (this is different for aggregates like SUM / AVG where floating point roundoff needs to be accounted for)
Is your feature request related to a problem or challenge?
Part of https://github.com/apache/datafusion/issues/7065
We have a great aggregation fuzz tester that is run like this:
This fuzz tester is important to cover all the various combinations of types and columns and aggregates, given DataFusion has many different specialized code paths based on types.
The fuzz tester currently covers many different types, but not all of them. This ticket covers adding support for a few more types
Describe the solution you'd like
Add support for the following data types:
Float32/Float64
Timestamp
(Timestamp(Seconds, ..)
,Timestamp(Milliseconds, ..)
, ...Binary
/LargeBinary
/BinaryView
Describe alternatives you've considered
I think following the model in this PR from @LeslieKid where they added time/interval/ decimal/utf8view in https://github.com/apache/datafusion/pull/13226
Additional context
No response