-
With phyon3.7
`python app.py search` failed as below:
```
(py37-multires-lyrics-search) sz@pc:/mnt/d/jina-ai/multires-lyrics-search$ python app.py search
chunk_seg@29123[I]:starting jina.p…
-
**Describe the bug and error messages (if any)**
I've tried to run:
segmenter = DeepSegment('en')
text = Path('data/bandt.txt').read_text()
tokens = segmenter.segment(text)
In both python3.7 …
-
Sergey, i did my best to understand how to use the ArabicTokenizer, you can see my try in the following code. i hope to check it and see if this is the best way of use.
i am also trying to set the par…
-
This thread to discuss the design of output of apps as specified in their `appmetadata`, particularly in the`produces` field.
-
As of v0.3.3, pySBD shows destructive behavior in some edge-cases even when setting the option `clean` to `False`.
When dealing with OCR text, pySBD removes whitespace after multiple periods.
**…
-
The segmenter in wiki-bert-pipeline operates on [a line-by-line basis](https://github.com/jbrry/wiki-bert-pipeline/blob/bfe374cde2f0b4a7d517514b1ab5d4bd2c86e9c6/scripts/udtokenize.py#L62) which means …
jbrry updated
3 years ago
-
`import spacy
nlp = spacy.load('en_core_sci_sm')
print('the first entence segmentation:')
doc1 = nlp('Positive for translocation or inversion events involving the ROS1 gene')
for i,sent in enume…
-
**Question**
Hi there, I tried to apply haystack to a python api for a school project, which receives a file and then list of queries from users in a loop.
The contexts in the result returned fr…
-
This is for the some-hacks-branch so maybe it is already fixed in melvin's branch. I just don't dare switch right now, sorry!
after the importation of all new data in the Arawak dataset, I run the se…
-
The latest versions of Python are more strict wrt. escape in regex.
For instance with 3.6.8, there are 10+ warnings like this one:
```
...
lib/python3.6/site-packages/nltk/featstruct.py:2092: Depr…