-
# Install English JoBimText distributional thesaurus to Serelex search engine
## Background
A distributional thesaurus (DT) is a word similarity graph, where each node is a lexical item, such as noun…
-
## Motivation
We need to develop a prototype of the system that builds structured topics (creates a model) and is able to label new texts according to these topics. This prototype is supposed to have…
-
1. lemmatize the ukwac+wacky corpus using Jobimify tool:
```
frink:/home/panchenko/jobimify
```
- use the concatenation of these corpora http://cental.fltr.ucl.ac.be/team/~panchenko/d…
-
Add a settings button to Serelex:
![settings](https://cloud.githubusercontent.com/assets/1456830/7746787/832b59c0-ffb5-11e4-897b-b78d0e09df9a.png)
1. Find a similar settings image (a wheel) on the i…
-
**Is your feature request related to a problem? Please describe.**
Word embeddings can be considered one way of finding synonyms, but give no way of distinguishing between antonyms and synonyms other…
-
# Background
So, I've suggested to integrate Distributional Semantics into the YodaQA pipeline by using JoBim Text, a framework developed by TU Darmstadt (in Germany) and IBM that is also used for do…
k0105 updated
8 years ago
-
Given a word, like 'python' generate the list of candidate, like in Google 'python vs ...' .
1. Get all sentences containing the target words (python)
2. Classify them (first word = python, seco…
-
A useful project: Would be great also to have some usage examples in the readme (or a python notebook). I think this project can be very helpful for students and other users of JoBimText...
-
## General motivation
Computational lexical semantics is a subfield of Natural Language Processing that studies computational models of lexical items, such as words, noun phrases and multiword expres…
-
## Motivation
Currently one important component of the JoBimText pipeline is conducted in a non-distributed fashion, namely the word sense induction. This means transfer of files from the HDFS and ba…