-
**Describe the bug**
Manticoresearch service crashes irregular.
The cause of the crash not found.
**Describe the environment:**
- `Manticore 5.0.2 348514c86@220530 dev (columnar 1.15.4 2fef34e…
-
It would be nice if the API provided a way of loading a custom dictionary without resorting to patching the data in the module. In some languages, the lemmatizer coverage can be rather poor, and other…
-
In UD_Persian 2.0, about 6% of words have empty lemma (underscore in CoNLL-U). What is the reason?
If these are punctuation symbols or foreign-origin/proper/indeclinable/whatever words where lemmat…
-
Update the glossary and style guide with information from the ALTLab dictionary database. Add quotes / citations / references in support of terms where possible. Make these documents more visible by i…
-
This work is a part of a master course DS Project @ University of Vienna
- Add [CheXpert](https://arxiv.org/abs/2004.09167) and [CheXbert](https://arxiv.org/abs/2004.09167) labelers to Knodle
- A…
-
**Describe the bug**
It seems like some tokens with apostrophe causing some misalignment in word-level's text.
**To Reproduce**
```python
import stanza
nlp = stanza.Pipeline("en", processors=…
-
- Nuitka version, full Python version, flavor, OS, etc. as output by *this exact* command.
2.2.3
Commercial: 2.5.1
Python: 3.11.9 (main, Apr 19 2024, 16:39:34) [GCC 11.2.0]
Flavor: Anaconda Pyth…
-
So far language data are stored as dictionaries, i.e. key/value pairs of words and ground forms. Python dictionaries are quite efficient in terms of speed and rather efficient space-wise.
However t…
adbar updated
6 months ago
-
**Describe the bug**
Wn loses some merged synsets in translation, even though the original CILI mappings correctly link the merged source synsets to the same target synset.
**To Reproduce**
F…
-
### Bug Description:
We utilize bigrams or trigrams in our CALL SUGGEST feature, which could potentially be the root cause of the problem we're observing. When the lemmatizer is enabled, it appears t…