-
A lexical unit like ^xyz1.1abc2.2$ isn't getting the right colors when it comes out of the transfer. The first pieces are correct, but everything else is incorrectly green.
Probably the same issue …
-
Hello, When I use bpe as the modeling unit to train the English ASR model, the output of model are bpe subwords, and words can be obtained by spaces and ‘__’. But this method doesn't seem to be able …
-
open-editions/corpus-joyce-ulysses-tei#47 represents a lot of @sk3853's work with identifying Joycean words, neologisms, compound words, and so on. It follows [the taxonomy from our discussion in open…
-
### Is there an existing issue for this?
- [X] I have searched the existing issues
### Problem description
Good afternoon colleagues, I would like to know if there is a possibility of making Freeca…
-
In one case, it is annotated where all three words are a `fixed` expression:
```
# sent_id = w01119076
# text = With her appearance finalized, Jasmine became Disney's first non-white princess as …
-
```
What steps will reproduce the problem?
1. Use a token filter that contains some set of words
2. Use an accepted word list that contains a disjoint set of words
3. run any semantic space main and s…
-
### 2.2 Create a document-term matrix from the preprocessed press releases and to explore top words (5 points)
**A.** Use the `create_dtm` function I provide (alternately, feel free to write your o…
-
Misspelled words which are not detected: avalible, handeled, evalulated, deciced, pressent, senting.
-
Hunspell is not very good at suggesting correct word form if word is compounded using COMPOUNDRULE feature.
For example: if word 'četiristosedamdesetosmoga' is written in dic file, Hunspell can sugge…
ghost updated
7 years ago
-
```
What steps will reproduce the problem?
1. Use a token filter that contains some set of words
2. Use an accepted word list that contains a disjoint set of words
3. run any semantic space main and s…