h2oai / h2o-3

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
http://h2o.ai
Apache License 2.0
6.88k stars 1.99k forks source link

Support for Azure Blob-Store - WASB Hadoop Filesystem #15023

Open exalate-issue-sync[bot] opened 1 year ago

exalate-issue-sync[bot] commented 1 year ago

Support for Azure Blob-Store - WASB Hadoop Filesystem

Hi,

From h2ostream:

we are using Azure Blobstore (wasb interface - part of Hadoop release 2.7.1 https://hadoop.apache.org/docs/stable/hadoop-azure/index.html)) which is the default storage location for Hortonworks based HDInsights Clusters.

Running H20 in Hadoop mode does not seem to allow us to import data from there as H20 does seem to implement its own storage access.

Any plans when this could be supported?

DinukaH2O commented 1 year ago

JIRA Issue Migration Info

Jira Issue: PUBDEV-2083 Assignee: New H2O Bugs Reporter: Raymond Peck State: Open Fix Version: N/A Attachments: N/A Development PRs: N/A