nytud / NoSketch-Engine-Docker

A NoSketch Engine Docker image which is easy to use
GNU Lesser General Public License v3.0
0 stars 0 forks source link

NoSketch Engine Docker

This is a dockerised version of NoSketch Engine, the open source version of Sketch Engine corpus manager and text analysis software developed by Lexical Computing Limited.

This docker image is based on Debian stable and the NoSketch Engine build and installation process contains some additional hacks for convenient install and use. See Dockerfile for details.

TL;DR

  1. git clone https://github.com/ELTE-DH/NoSketch-Engine-Docker
  2. make pull – to download the docker image
  3. make compile – to compile sample corpora
  4. make execute – to run a CLI query on susanne corpus
  5. make run – to launch the docker container
  6. Navigate to http://localhost:10070/ to try the WebUI

Features

Further info on how to analyse a plain text corpus by e-magyar and convert it to the right format suitable to fit in the system.

Usage

1. Get the Docker image

2. Compile your corpus

  1. Put vert file(s) in: corpora/CORPUS_NAME/vertical directory\ (see examples in corpora/susanne/vertical and corpora/emagyardemo/vertical directories)
  2. Put config in: corpora/registry/CORPUS_NAME file\ (see examples in corpora/registry/susanne and corpora/registry/emagyardemo)
  3. Compile all corpora listed in corpora/registry directory using the docker image: make compile
    • To compile one corpus at a time, use the following command: make execute CMD="compilecorp --no-ske CORPUS_REGISTRY_FILE"

3a. Run the container

  1. Run docker container: make run
  2. Navigate to http://SERVER_NAME:10070/ to use

3b. CLI Usage

4. Additional commands

make parameters, multiple images and multiple containers

By default,

If there is a need to change these, make commands can be supplemented by IMAGE_NAME=myimage and/or CONTAINTER_NAME=mycontainer and/or PORT=myport.

E.g. make build IMAGE_NAME=myimage build an image called myimage; and make run IMAGE_NAME=myimage CONTAINER_NAME=mycontainer PORT=12345 launches the image called myimage in a container called mycontainer which will use port 12345. In the latter case the system will be availabe at http://SERVER_NAME:12345/.

See the table below on which make command accepts which parameter:

command IMAGE_NAME CONTAINER_NAME PORT
make pull . . .
make build . .
make compile . .
make execute . .
make run
make connect . .
make stop . .
make clean .

In the rare case of multiple different docker images, be sure to name them differently (by using IMAGE_NAME).\ In the more common case of multiple different docker containers running simultaneously, be sure to name them differently (by using CONTAINER_NAME) and also be sure to use different port for each of them (by using PORT).

If you want to build your own docker image be sure to include the IMAGE_NAME parameter into the build command: make build IMAGE_NAME=myimage and also provide IMAGE_NAME=myimage for every make command which accepts this parameter.

License

The following files in this repository are from https://nlp.fi.muni.cz/trac/noske and have their own license:

The rest of the files are licensed under the Lesser GNU GPL version 3 or any later.