h2oai / h2o-3

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
http://h2o.ai
Apache License 2.0
6.94k stars 2k forks source link

Data Connector for Druid.io #10619

Open exalate-issue-sync[bot] opened 1 year ago

exalate-issue-sync[bot] commented 1 year ago

Feasibility: I was looking into details:

From technical point of view:

Result: it is feasible, but will need non-trivial technical effort

exalate-issue-sync[bot] commented 1 year ago

Michal Kurka commented: This project looks interesting: https://github.com/himanshug/druid-hadoop-utils

This might be the right approach for data ingesting because people complain about the SELECT API being slow: https://groups.google.com/forum/#!topic/druid-development/uWGWCPlg0c8

h2o-ops commented 1 year ago

JIRA Issue Migration Info

Jira Issue: PUBDEV-3720 Assignee: New H2O Bugs Reporter: Venkatesh Yadav State: Open Fix Version: N/A Attachments: N/A Development PRs: N/A