rasa NLU (Natural Language Understanding) is a tool for intent classification and entity extraction. You can think of rasa NLU as a set of high level APIs for building your own language parser using existing NLP and ML libraries. Find out more on the homepage of the project, where you can also sign up for the mailing list.
Extended documentation:
If you are new to rasa NLU and want to create a bot, you should start with the tutorial.
Contents:
From pypi:
pip install rasa_nlu
From github:
git clone git@github.com:golastmile/rasa_nlu.git
cd rasa_nlu
python setup.py install
To test the installation use (this will run a very stupid default model. you need to train your own model to do something useful!):
python -m rasa_nlu.server &
curl 'http://localhost:5000/parse?q=hello'
Before you start, ensure you have the latest version of docker engine on your machine. You can check if you have docker installed by typing docker -v
in your terminal.
docker build -t rasa_nlu .
docker run -p 5000:5000 rasa_nlu start
Caveat for Docker for Windows users: please share your C: in docker settings, and add -v C:\path\to\rasa_nlu:/app
to your docker run commands for download and training to work correctly.
curl 'http://localhost:5000/parse?q=hello'
The intended audience is mainly people developing bots, starting from scratch or looking to find a a drop-in replacement for wit, LUIS, or api.ai. The setup process is designed to be as simple as possible. rasa NLU is written in Python, but you can use it from any language through a HTTP API. If your project is written in Python you can simply import the relevant classes. If you're currently using wit/LUIS/api.ai, you just:
localhost:5000/parse
.https
call to parse every message.These points are laid out in more detail in a blog post. rasa is a set of tools for building more advanced bots, developed by LASTMILE. rasa NLU is the natural language understanding module, and the first component to be open sourced.
Short answer: English, German, and Spanish currently. Longer answer: If you want to add a new language, the key things you need are a tokenizer and a set of word vectors. More information can be found in the language documentation.
We are very happy to receive and merge your contributions. There is some more information about the style of the code and docs in the documentation.
In general the process is rather simple:
You pull request will be reviewed by a maintainer, who might get back to you about any necessary changes or questions.
Licensed under the Apache License, Version 2.0. Copyright 2016 LastMile Technologies Ltd. Copy of the license.