minhptx / iswc-2016-semantic-labeling

Apache License 2.0
11 stars 6 forks source link

Semantic Typing

Automatically assign semantics to large data sets from heterogeneous sources based on their features using several Statistical and Machine Learning techniques.

Prerequisites

  1. Elasticsearch
  2. Pyspark
  3. scikit-learn
  4. pandas

Run API

  1. Build docker image

cd container; docker build -t isi/semantic-labeling .

  1. Start elasticsearch:

docker-compose up

  1. Calling API

bin/semantic_labeling.sh <train_dataset> <test_dataset> <train_dataset2>