TritonDataCenter / hadoop-manta

Hadoop Filesystem Driver for Manta
Apache License 2.0
6 stars 6 forks source link
drill hadoop hadoop-filesystem joyent manta sqoop triton

Build Status

Manta Filesystem for Hadoop

Introduction

This project provides a Hadoop FileSystem for the open source Manta object store. Unlike other object stores, Manta is strongly consistent and uses a hierarchical file system (like a Unix filesystem) to organize file storage whereas S3/Swift use a key/value system. This model aligns closely with the Hadoop Filesystem model and it leaves us with very few divergences from the default behavior.

Quirks / Divergences

Run Requirements

Build Requirements

Configuration

You will need to have the public/private keys needed to access Manta on the machine in which Hadoop is running. It is often best to verify that these keys are setup correctly using the Node.js Manta CLI.

Configuration will be done using the Hadoop configuration files or environment variables. Refer to the table below for the available configuration options.

Configuration Parameters

Configuration used are the same as Java Manta Client SDK. Hadoop-specific property keys use the same keys as system properties within the Java Manta SDK.

Installation

Please refer to the following guides for installing the Manta Filesystem for Hadoop on different Hadoop ecosystem applications.

License

The Manta Filesystem for Hadoop is licensed under the Apache 2.0 license.