airbytehq / PyAirbyte-Hackathon

Tasks for PyAirbyte Hackathon June 2024
0 stars 2 forks source link

Create dbt package generator using GenAI and PyAirbyte #6

Open aaronsteers opened 1 month ago

aaronsteers commented 1 month ago

Summary

The goal with this application is to take a specific raw data schema for a source being run with PyAirbyte and to auto-generate a simple dbt project for that data.

This could be the foundation of a new type of integration opportunity for Airbyte users.

Definition of Done

These are not specifically related to GenAI, but are the foundation of the code-gen:

The GenAI "code gen" application portion of this project is:

In terms of documentation:

Suggestions (Per Author's Discretion)

These are some suggestions - but are not required:

Resources to Assist

Hashcode-Ankit commented 4 weeks ago

hi @aaronsteers can I have more information on this topic would love to contribute to it.

siddhant3030 commented 3 weeks ago

can you give us the summary on this?

marcosmarxm commented 3 weeks ago

@Hashcode-Ankit it is yours

Hashcode-Ankit commented 3 weeks ago

Hi @marcosmarxm thanks for assigning it to me @aaronsteers can I expect some more details on it? I have set up Airbyte and built some connectors using Airbyte. Need more information about how to approach this so that I can break down it into even more simple chunks.....

aaronsteers commented 3 weeks ago

@Hashcode-Ankit - I've updated the above with a description. Admittedly, this is a large and ambitious project. Let us know if it is still interesting, and/or if you have any questions or changes you would like to propose.

Hashcode-Ankit commented 3 weeks ago

@aaronsteers Yes as I mentioned on Slack as well, I am very much excited about it, and it matches my previous work on dbt and dbt-clickhouse.

Hashcode-Ankit commented 6 days ago

Hi @aaronsteers @marcosmarxm as discussed on slack i have pushed the code for it : https://github.com/Hashcode-Ankit/pyairbyte-dbt Need some more guidance on task to complete the exact use case @aaronsteers let me know if we can connect.