-
BigramCollocationFinder.from_words() in collocations.py does strange things with duplicate words:
```
>>> b = nltk.collocations.BigramCollocationFinder.from_words('this this is is a a test test'.spli…
-
```
What is the feature you want?
I would like to be able to highlite text.
How important is it to you?
I use anymemo to learn languages, I want to highlight some collocations in
sentences so then I…
-
The ECCO database has many in-word tags. This breaks tokenization in collocations.
We could fix this by removing the element that causes the issue. In the case of ECCO, it's
```
Richard, how do you…
-
A search like public opinion should filter out both words, not just the full expression.
-
```
What is the feature you want?
I would like to be able to highlite text.
How important is it to you?
I use anymemo to learn languages, I want to highlight some collocations in
sentences so then I…
-
```
What is the feature you want?
I would like to be able to highlite text.
How important is it to you?
I use anymemo to learn languages, I want to highlight some collocations in
sentences so then I…
-
```
Elapsed: 10949ms
#1
smu = 0.6019654
smu = org.apache.lucene.index.collocations.CollocationScorer Object {
term: nus ntu eee
coincidentalTerm: smu
coIncidenceDocCount: 93
termADocFreq: 500
ter…
-
```
StandardWrapperValve[km.web.filters.RSApplication]: Servlet.service() for servlet km.web.filters.RSApplication threw exception
java.lang.IllegalArgumentException: docID must be >= 0 and < maxDoc=3…
-
Accessor functions:
texts()
words()
data() - (return only the attribs or texts + attribs?)
tokenizedTexts() - I suggest that when we run tokenize() we should store the result in the corpus object and…
-