-
It would be neat to add some sort of sentence segmentation to the query time text analysis to trigger a break in tagging. For example (a very silly one!) the input document text is:
" I want to buy s…
-
first as on https://ichi.moe
- https://readevalprint.tumblr.com/post/97467849358/who-needs-graph-theory-anyway
- https://github.com/tshatrov/ichiran
or https://jisho.org
-
We will need to be able to extract all sentences that use the word *Galaxy* from an input document. This implies that we are able to split an input document on sentence boundaries.
NLTK will be su…
-
I've tried `segmentation-lemma-tagging/run_inf.py` with various modes on the following sentence:
> āsīdaśeṣanarapatiśiraḥsamarcitaśāsanaḥ pākaśāsana ivāparacaturudadhimālāmekhalāyā bhuvo bhartā pra…
-
Ex. 1: (filename: `What the Panama Papers Reveal About the Art Market - The New York Times.txt`)
`Q: All necessary financial disclosures were made at the time of sale." The International Consortium of…
-
Hi,
I've successfully built my project into a single executable file using Nuitka, but I'm encountering a segmentation fault when running it:
```
Nuitka: A segmentation fault has occurred. This…
-
Apostrophes `ʼ` are not parsed correctly - sometimes they appear in pairs to mark quotations. The second apostrophe usually gets assigned to the following sentence and if there is none (-> end of chap…
-
I am dealing with sentence segmentation of scientific papers with inline citations (and potential parsing error).
It seems this tool cannot process the incline citations adaptively, such as (Auth…
-
I have plain text without any punctuation or sentence stop in German.
How can i make the sentence segments with stop.?
-
The sentence counter is not really usable in real-world scenarios and should at least support the inclusion of a common abbreviation list for the current language (or multiple languages?).
See http:/…