apache / gravitino

World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
https://gravitino.apache.org
Apache License 2.0
971 stars 304 forks source link

[EPIC] Gravitino complete docker experience playground #791

Closed xunliu closed 10 months ago

xunliu commented 10 months ago

What would you like to be improved?

It would be great if we could easily launch a full Gravitino experience environment so that more people could learn about Gravitino.

We can use docker-compose to launch this environment, and use DBeaver/DataGrip to connect it and execute SQL.

We need fixed issue

The process of mounting trino-conf into the Trino-docker container and setting the correct configuration can be quite complex. Additionally, when creating a hive-catalog via RESTful, it's necessary to wait for the hive-catalog entity record to sync in Trino before executing the show catalogs SQL.

Furthermore, the Gravitino Server runs on the local host, while Hive and Trino run in a docker container due to macOS limitations. This means that the host cannot directly access the hive and Trino instance IP address, requiring the execution of ./dev/docker/tools/mac-docker-connector.sh to enable network connectivity between the host and the container.

In summary, the integration and setup of Hive, Trino, and Gravitino present significant challenges and complexities.

Subtasks

xunliu commented 10 months ago

hi @jerryshao @justinmclean If you have any suggestions for this issue, Please add comments, Thanks!

mchades commented 10 months ago

DataGrip is a paid commercial product. Can we use other open-source or free clients as alternatives to reduce the user's barrier to use?