flyteorg / flyte

Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
https://flyte.org
Apache License 2.0
5.48k stars 585 forks source link

[Plugin]BlazingSQL and Rapids dataframe support #1556

Open kumare3 opened 2 years ago

kumare3 commented 2 years ago

Why would this plugin be helpful to the Flyte community It would be great to have a simple plugin that would allow users to simply write BlazingSQL queries in their workflows to manipulate data-frames on a GPU as part of their Flyte workflows.

An example of writing a query from the User point of view could be

task = BlazingSQL(
     input_schema=FlyteSchema,
    outputs=...,
    query=""" SELECT count(*) FROM taxi GROUP BY year(key)""",
    use_gpus=False, # True by default
)

Type of Plugin

DasithEdirisinghe commented 2 years ago

@kumare3 can you assign me to this issue. and refer me to some more resources relevant to this issue

samhita-alla commented 2 years ago

Hi, @DasithEdirisinghe! Thanks for looking into this. :)

This plugin has to be a task template plugin and on the same lines as that of SQLAlchemy plugin.

During plugin development, follow the directory structure used for SQLAlchemy plugin. To test the plugin, refer to https://docs.flyte.org/projects/flytekit/en/latest/contributing.html#plugin-development.

kumare3 commented 2 years ago

@DasithEdirisinghe are you working on this?

DasithEdirisinghe commented 2 years ago

@kumare3 yes I started working

kumare3 commented 2 years ago

@kumare3 yes I started working

Cool I am assigning it to you! Thank you

samhita-alla commented 2 years ago

@DasithEdirisinghe, two more days till Hacktoberfest ends! Would you mind creating a PR in the next day or two?

github-actions[bot] commented 1 year ago

Hello 👋, This issue has been inactive for over 9 months. To help maintain a clean and focused backlog, we'll be marking this issue as stale and will close the issue if we detect no activity in the next 7 days. Thank you for your contribution and understanding! 🙏

github-actions[bot] commented 1 year ago

Hello 👋, This issue has been inactive for over 9 months and hasn't received any updates since it was marked as stale. We'll be closing this issue for now, but if you believe this issue is still relevant, please feel free to reopen it. Thank you for your contribution and understanding! 🙏

github-actions[bot] commented 2 months ago

Hello 👋, this issue has been inactive for over 9 months. To help maintain a clean and focused backlog, we'll be marking this issue as stale and will engage on it to decide if it is still applicable. Thank you for your contribution and understanding! 🙏