-
## How to reproduce the behaviour
I'm trying to train the parser for polish. I'm using the PDB treebank, in the conllu format (because it contains one sentence per paragraph, I've used the option --n…
-
Hi,
I am trying to figure out how to annotate clefting constructions in Norwegian UDv2. (I am working on a gold-standard UD corpus for Norwegian.) I haven't found a lot about clefting in the document…
-
https://github.com/researchart/rose6icse/tree/master/submissions/reusable/mathewSLACC
## Authors
* Corresponding author → George Mathew - **Email**: [george2@ncsu.edu](mailto:george2@ncsu.edu…
-
We are trying to replicate your English results using tacotron2 for Arabic. While we were able to replicate English results seamlessly, we haven't been successful with Arabic. Because our data is not …
-
Are there any plans to consolidate all the immensely valuable information Jim and all the other people have collected about the ROOT binary dataformat and put it into a format description document? I …
-
From https://github.com/nltk/nltk/blob/develop/nltk/corpus/reader/wordnet.py#L1396, it looks like the line parsing wasn't done correctly for the example vs the definition of the Synset gloss:
```py…
-
I ran the following code:
from spacy import displacy
import zh_core_web_sm
nlp = zh_core_web_sm.load()
error Traceback (most recent call last)
in
…
-
## Todos
* [x] Extract corpus vectorization into independent unit
* [x] Add unit tests for existing behavior
* [x] Extend with linear weighting strategy
* [x] read factor from config
* [x] ex…
-
We are realizing that the UDv2 guideline for [`PROPN`](https://universaldependencies.org/u/pos/PROPN.html) is a pretty radical departure from the previous approach, which for English followed PTB guid…
-
Reported by Takoboto in e-mail:
:::
I found a few typos in the file jpn_indices.csv. I'm not sure how to report it so here is the concerned 3 lines:
117073\t286593\t彼(かれ)[01]{彼の} 車 は|1 青 で 彼女…