NVIDIA / spark-rapids

Spark RAPIDS plugin - accelerate Apache Spark with GPUs
https://nvidia.github.io/spark-rapids
Apache License 2.0
822 stars 235 forks source link

[FEA] optimize the multi-contains generated by rlike #11729

Open res-life opened 3 days ago

res-life commented 3 days ago

Is your feature request related to a problem? Please describe.

#11413 added a GpuMultiContains which generates an array of bool column. #10977added a GpuMultipleContains for rlike optimization, with a little diff, it returns a bool column. We can leverage GpuMultiContains in GpuMultipleContains, and execute an or operator on bool column array, then get a bool column.

Describe the solution you'd like Do perf test first to check if this can get benifit.