-
I'd like to "merge" Ontonotes (coref) and PropBank (SRL) annotations.
Could someone provide me with a detailed instruction to do this?
-
**Describe the bug**
When trying to read a CoNLL-U formatted file with more than 10 columns the following error occurs:
`pyconll.exception.ParseError: The number of columns per token line must be …
-
我用的模型是hanlp.pretrained.mtl.CLOSE_TOK_POS_NER_SRL_DEP_SDP_CON_ELECTRA_BASE_ZH
模型在ner-msra任务的输出有一种为CAPACTITY,而在https://hanlp.hankcs.com/docs/annotations/ner/msra.html
中为CAPACITY。应该是模型最后拼写有问题?
-
The label of "AM-ADV" (or "C-V") appears to be wrongly classified as the tag of "V" in order to use the condition of `"V" in label` (judgement for a verb), when I used the script of "CoNLL_to_JSON.py"…
-
Hello all,
at IBM Research, we have been working on a layer of unified semantic annotations for a range of languages. We use a data-driven approach in which we re-use existing English Proposition Ba…
-
II am trying to use the naacl18 but when running preprocess_data.py I encounter this issue:
```
2020-10-07 17:36:24,165 - INFO - Building dataset for [train-salsa-2.0] with imagined_embeddings [we…
-
It seem that cat can't handle the large number of *.gold_conll in the train portion. The script print this warning and continue:
`./scripts/make_conll2012_data.sh: line 19: /bin/ls: Argument list t…
-
https://catalog.ldc.upenn.edu/LDC2012T13 says that EWT has 16,624 sentences. They actually have:
```
% wc -l `find . -name '*.tree'` | tail
...
10 ./reviews/penntree/278775.xml.tre…
-
Hi MMT team,
I don't know how to phrase my question, so here you go :-)
I noticed that for uppercase acronyms such as "IP", "TS", "DPS", they are translated as follows:
Ip, Ts, Dps
Any ide…
-
copied from https://github.com/UniversalDependencies/UD_English-EWT/issues/204
I would like to clarify the relation between https://catalog.ldc.upenn.edu/LDC2013T19 (OntoNotes 5.0) with https://cat…