Bibliome / alvisnlp

ALvisNLP corpus processing engine
https://bibliome.github.io/alvisnlp/
Apache License 2.0
16 stars 6 forks source link
alvis corpus-processing java machine-learning natural-language-processing nlp pipeline workflow workflow-engine

Description

AlvisNLP is a configurable NLP batch processing pipeline. It annotates text documents for the semantic annotation of textual documents. It integrates Natural Language Processing (NLP) tools for sentence and word segmentation, named-entity recognition, term analysis, semantic typing and relation extraction. These tools rely on resources such as terminologies or ontologies for the adaptation to the application domain. Alvis NLP/ML contains several tools for (semi)-automatic acquisition of these resources, using Machine Learning (ML) techniques. New components can be easily integrated into the pipeline. Part of this work has been funded by the European project Alvis and the French project Quaero.

AlvisNLP is held by the Bibliome group at Inra Jouy-en-Josas, France

See

Please contact Robert Bossy if you have any questions.

Foreword

This file contains instructions to download, compile AlvisNLP, and install the command-line interface.

The instructions assume you are running on a Unix system with a shell. If you are running on Windows, then check the instructions in the WINDOWS.md file.

Prerequisites

Download

git clone https://github.com/Bibliome/alvisnlp

Compile and build

cd alvisnlp

mvn clean install

Command-line interface

Install

Host-specific parameter values

We recommend that you set default parameter values for your host. These parameter values avoid to set parameters in plans for external tools.

cp share/default-param-values.xml.template share/default-param-values.xml

This will create a standard default parameter file in share/default-param-values.xml. Edit this file and fill parameter values appropriate for your host.

Default command-line options

If you wish your installed AlvisNLP to run with default command-line options, then you put them on the file named default-options.txt in the share directory. Once installed, the options will be automatically prepended to each invocation of alvisnlp.

In the share directory, there is an example file named default-options.txt.template.

Copy files to installation directory

./install.sh DIR

DIR is the base directory of your AlvisNLP install. This directory must exist. Launch this script as root if necessary.

Running AlvisNLP

DIR/bin/alvisnlp -help

DIR is the base directory of your AlvisNLP install. You migh also add the DIR/bin sub-directory to your PATH environment variable.

export PATH=DIR/bin:$PATH