-
There are cases where it's useful to index separate fields using different sets of stop words or with field-specific word separators. How difficult is this to add?
-
`datrie` returns a segmentation fault and exits python in py2.7.11:
```
>>> import datrie
>>> t = datrie.Trie(' foo bar ')
>>> state = datrie.State(t)
>>> it = datrie.Iterator(state)
>>> while it.nex…
-
When I type "London", i feel like it should start appearing first in autocomplete results for [Lon,Lond,Londo] even though there are other results that offer exact name matches.
-
Hi,
I'm using quanteda to generate ngrams for word prediction. Try the following:
test
-
Originally reported on Google Code with ID 217
```
I need a new meta collector that collects character ngram frequencies in the boundaries
of an annotation.
The existing collector collects them over …
-
Originally reported on Google Code with ID 208
```
Request: Add a feature for character ngrams in unit classification.
```
Reported by `Tobias.Horsmann` on 2014-10-29 15:30:20
-
Originally reported on Google Code with ID 219
```
The method call of
NGramUtils.getAnnotationCharacterNgrams()
should create character ngrams and mark the beginning/end of the span that is split
(e…
-
Originally reported on Google Code with ID 133
```
What steps will reproduce the problem?
1. Run the lucene POS ngram DFE on twitter data
2. Get soundex exception of "character cannot be phonetized"
…
-
@ziqizhang We can support three types of candidates extraction. This is optional as user can configure solr.TermRecognitionRequestHandler to rank and filter only on pre-processed term field.
- Part-o…
-
Hi,
Great job on making Caffe as a service. I wanted to find out a few more details about the text classification capabilities in Deep Detect. Are you using a embed layer (word2vec or bag of words)? …