vmware / versatile-data-kit

One framework to develop, deploy and operate data workflows with Python and SQL.
Apache License 2.0
417 stars 54 forks source link

Collab Notebook: Processing data using SQL and local database #3061

Open antoniivanov opened 5 months ago

antoniivanov commented 5 months ago

For the Definition of done see the milestone description: https://github.com/vmware/versatile-data-kit/milestone/29

Create Google Collab notebook for

https://github.com/vmware/versatile-data-kit/wiki/Processing-data-using-SQL-and-local-database

Indranil0603 commented 4 months ago

Hi @antoniivanov i would like to work on the issue, would you suggest any starting points for it

antoniivanov commented 4 months ago

Hi @Indranil0603

Would you like to join our slack channel using. It may provide more efficient interaction :

  1. Join the CNCF Slack workspace.
  2. Join the #versatile-data-kit channel.

    To be able to efficiently work the ticket.

    1. ARe you familiar with notebooks and Google Collab ? If not , you may want to spend some time . This is a good starting point
    2. You may learn more about VDK and the integration between VDK and Notebooks :

The ultimate goal of the story is to write a Google Collab Notebook tutorial for our https://github.com/vmware/versatile-data-kit/wiki/Processing-data-using-SQL-and-local-database tutorial/example. Similar to https://bit.ly/vdk-ingest

Indranil0603 commented 4 months ago

Hi @antoniivanov ,

I have joined the slack channel. I have some experience on working with Collab notebooks. I will be going through the resources and start working on the notebook.

Indranil0603 commented 3 months ago

I have made the notebook - https://colab.research.google.com/drive/1ujx6AaKsMkjeTeXvMUZQHwdFuLR0Qhz4?usp=sharing

I will be looking forward to suggestions, also where do i need to commit the notebook. @antoniivanov