openlookeng / hetu-core

570 stars 423 forks source link

Added Bloom Filter for Orc Files #345

Open shalumariamsheji opened 2 years ago

shalumariamsheji commented 2 years ago

What type of PR is this?

task

What does this PR do / why do we need it:

Added bloom index to ORC Files for performance improvement while reading.

The table created with bloom filter is skipping few splits and reading less rows:

Screenshot from 2022-04-19 19-17-43 (copy)

Which issue(s) this PR fixes:

None

Special notes for your reviewers:

it-is-a-robot commented 2 years ago

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: shalumariamsheji To complete the pull request process, please assign sraghunandan You can assign the PR to them by writing /assign @sraghunandan in a comment when ready.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files: - **[OWNERS](https://github.com/openlookeng/hetu-core/blob/master/OWNERS)** Approvers can indicate their approval by writing `/approve` in a comment Approvers can cancel approval by writing `/approve cancel` in a comment
Nitin-Kashyap commented 2 years ago

lgtm