apache / gravitino-playground

A playground to experience Gravitino
Apache License 2.0
31 stars 31 forks source link

Add Flink-Hive use case in the Jupyter notebook #91

Open TungYuChiang opened 3 weeks ago

TungYuChiang commented 3 weeks ago

Feature Request:
We propose adding a new Jupyter notebook to the gravitino-playground repository that demonstrates how to use the Flink stream processing engine to query a Hive database. The notebook will provide an example of setting up the PyFlink environment, connecting to a Hive database, executing queries, and displaying the results. This feature will showcase how to leverage Flink and Hive in a Jupyter notebook environment.

Requirements:

  1. Set up PyFlink Environment: Integrate the PyFlink API and configure the Flink environment.
  2. Connect to Hive Database: Set up the connection between Flink and Hive, and demonstrate how to execute queries on the Hive database.
  3. Query and Result Presentation: Write example queries and display the results within the notebook.

Resources for Reference: