-
I am using CoreNLP POS tagger to identify POS tags of tokens, but it gives me wrong tags for currency signs. For example, for € in this text, "Items under 50€", it says POS = CD (cardinal number), or …
-
## TL;DR
Unfortunately the current implementation of DependencyMatcher can take very very long time to match large documents ^^
## How slow ?
This is best illustrated with an example. Consider …
werew updated
3 years ago
-
## How to reproduce the behaviour
``` python
doc = self.nlp(sentence)
for sentence in doc.sents:
for ent in sentence.ents:
[process some stuff]
```
The code above ends up having n^2 r…
-
Let's say we have 3 proteins that get clustered together in an ortholog analysis. They all have very similar but slightly different annotations. Can this tool pick a consensus annotation? If so, what …
-
The PR #437 introduced replacement of certain XML tags like `` etc. by placeholders. Discuss:
- Is this the desired behaviour?
- What is the common practise in the NLP community?
- Do placeholders …
-
As a data scientist working on the Corporate Data Ingestion stream, i am able to access and process ESG / sustainability reports from S&P Global.
This requires access to S3 bucket where AWS has loa…
-
```
Traceback (most recent call last):
File "/Users/nmadnani/anaconda/lib/python3.6/site-packages/conda/cli/main.py", line 110, in main
from ..activate import main as activator_ma…
-
Hey all,
I'm hoping to do achieve some gospel-spreading and community growth around binder through my Software Sustainability Institute Fellowship by attending Julia, R and Python community confere…
-
I am not sure if I haven't look thoroughly enough in the docs but I want to add abbreviation exceptions to the sentence tokenizer.
E.g. `Operating income incl. JV was SEK 2.1 b. with an operating m…
-
I would like to use the coreference feature in CoreNLP, but I have had problems with the output file.
I ran the following code:
./corenlp.sh -annotators tokenize,ssplit,pos,lemma,ner,parse,core…