-
Multilingual documents are common in the computer age of today. Plethora of these
documents exist in the form of translations, books, operational manuals, etc. The
abundance of these multilingual do…
-
Likely a stream source/sink.
-
First off, thanks for open sourcing the code!
I am trying to topic model with my own dataset. I set it up to look like the `yelp` dataset and modify the code accordingly. However, an error
```
…
-
I changed the corpus from TED-LIUM release 2 to [release 1](http://www.openslr.org/7/). When the training has just started, I got exception like this:
```
+ export ds_importer=ted
+ ds_importer=t…
-
I admit this is a weird question/request :)
Swagger2openapi converts a swagger 2 apispec by modifying the in-memory json structure in place, then writing it to the output file. One byproduct of thi…
-
Dell distributes microcode files for the SH7757 BMC, which include a H8S-2117A.
See attached files
[h8s.zip](https://github.com/airbus-seclab/cpu_rec/files/2778356/h8s.zip)
-
Symbols live in [`spacy/symbols.pyx`](https://github.com/explosion/spacy/tree/develop/spacy/symbols.pyx) and are used to reference attributes internally and externally without using strings. They incl…
-
All I want for Christmas is a core `nb` model for spacy. And we're getting close!
Last week the Language Technology Group at the University of Oslo released [NER annotations on top of the Norwegian…
jarib updated
5 years ago
-
Hey,
I think the pair_list=[] should be moved under the While True statement in make_pair_iter function in pair_generator.
Why?
Because we want pair_list to be reset for each time we need to yi…
-
## How to reproduce the behaviour (requires Python 3.6+ because of f-strings)
```python
from spacy.lang.sv import Swedish
nlp = Swedish()
doc = nlp(u"Provar att tokenisera en mening med ord i.")…