infotechsoft / docker-accumulo

Docker images for running Accumulo cluster
MIT License
1 stars 1 forks source link

Accumulo Docker

A collection of Dockerfiles extending uhopper/hadoop, for creating an Accumulo cluster. The following Accumulo components are provided:

All images inherit from a common base that provides hadoop, zookeeper and accumulo installations in /opt/ and allows configuration through environment variables.

Accumulo configuration

The Accumulo configuration is controlled via the following environment variable groups:

  1. CORE_CONF: affects /etc/hadoop/core-site.xml
  2. HDFS_CONF: affects /etc/hadoop/hdfs-site.xml
  3. YARN_CONF: affects /etc/hadoop/yarn-site.xml
  4. HTTPFS_CONF: affects /etc/hadoop/httpfs-site.xml
  5. KMS_CONF: affects /etc/hadoop/KMS-site.xml
  6. ACCUMULO_CONF: affects /opt/accumulo/conf/accumulo-site.xml

Accumulo properties are specified by setting an environment variable with the appropriate prefix in the form <PREFIX>_<PROPERTY>.

Due to restrictions imposed by docker and docker-compose on environment variable names the following substitutions are applied to property names:

Following are some illustratory examples:

Additionally the ACCUMULO_MEM_USAGE environment variable is used for bootstrapping the initial Accumulo configuration memory settings. It accepts values of 1GB, 2GB, 3GB, or 512MB.

Dependencies

Accumulo requires running HDFS and Zookeeper instances. HDFS can be configured by following the documentation at uhopper/hadoop, and Zookeeper using the official zookeeper image.