openminted / alvis-docker

Dockerizing Alvis and its components
6 stars 7 forks source link
docker nlp workflow-engine

Alvis Docker

Dockerizing Alvis and its components

OpenMinTeD compatible components

Find the OpenMinteD compatible components here.

The components are wrapped following the OpenMinTeD guidelines.

Use the docker image

A docker image for the alvisnlp engine in command line is available into Docker Hub

/!\ Prerequisites are docker installed (>= 1.13.1) and 4Go of free disk space

  1. display the alvis help
docker run mandiayba/alvisengine:1.0.0 \
       alvisnlp -help
  1. list the modules supported by alvis
docker run mandiayba/alvisengine:1.0.0 \
       alvisnlp -supportedModules
  1. display the doc of the module named SimpleProjector
docker run mandiayba/alvisengine:1.0.0 \
       alvisnlp -moduleDoc GeniaTagger
  1. run an alvis workflow that trains a ML model for binary relation extraction from a text corpus
docker run -i --rm -v $PWD/workdir:/opt/alvisnlp/data \
       -a stderr mandiayba/alvisengine:1.0.0 \
       alvisnlp /opt/alvisnlp/data/plans/train.plan
  1. run an alvis workflow that classifies binary relations from a text corpus
docker run -i --rm -v $PWD/workdir:/opt/alvisnlp/data \
          -a stderr mandiayba/alvisengine:1.0.0 \
           alvisnlp /opt/alvisnlp/data/plans/predict.plan
  1. run the train workflow with the main parameters passed to the workflow
    docker run -i --rm -v $PWD/workdir:/opt/alvisnlp/data -a stderr mandiayba/alvisengine:1.0.0 \
           alvisnlp \
           -param train textDir /opt/alvisnlp/data/corpus/train \
           -param dev textDir /opt/alvisnlp/data/corpus/dev \
           -param test textDir /opt/alvisnlp/data/corpus/test \
           -param TEESTrain model /opt/alvisnlp/data/models \
           /opt/alvisnlp/data/plans/train.plan

The train and classify workflows (called plans into alvis) are based on GeniaTagger and TEES tools integrated to AlvisNLP. The corpus used is Bacteria Biotope 2016. The binary relation here is named "Lives_in" ant it expresses the fact that some bacteries live in some habitats.

The docker image creation

* Not the latest version, we might want to test with the latest version.