sentence-segmenter Search Results

553 results
for sentence-segmenter

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ggerganov/whisper.cpp #539

Get text divided into paragraphs?

This would be useful when transcribing to a text document because having the text divided into paragraphs makes it more readable. This may be outside the scope of this project. Just thought I would as…

sindresorhus updated 1 year ago
4
WorksApplications/Sudachi #199

Is there any part of speech table like https://www.unixuser.…

Is the part of speech rule of Sudachi compatible with any sentence segmenter POC rules? If no, is there any part of speech table like https://www.unixuser.org/~euske/doc/postag/ It would be helpfu…

i10416 updated 1 year ago
2
nipunsadvilkar/pySBD #118

Does not properly segment within quotations

When dealing with a long statement of facts quoted from legal text, the text is not split up within left double quotations and write double quotations. this is different than the " characterI cannot …

Hgherzog updated 1 year ago
1
nipunsadvilkar/pySBD #126

Specific string causes segment function to return empty arra…

**Describe the bug** A clear and concise description of what the bug is. **To Reproduce** input_str = """This is part 3 of MAMI-san's hair timelineThe previous hair timelines can be found hereOka…

NiftyliuS updated 3 months ago
1
howisonlab/softcite-dataset #667

Really long sentences in json files

Probably one for @kermitt2 :) I found some very long text elements in the sentence level json files (like > 3000 characters). e.g., file:line "quote to search" PMC4176174.json:1502 "3D re…

jameshowison updated 4 years ago
3
koreader/koreader #11728

Improve japanese word dictionary lookup with MeCab

I read Japanese and looking up words is a bit touch and go. Sometimes it works great, sometimes it comes up with nonsense. Japanese is tough because words are not separated with space. [MeCab](http…

leonard-slass updated 4 months ago
4
yanshao9798/segmenter #4

AssertionError assert len(raw) == len(sents) with 2018 share…

I've run `segmenter.py train` successfully with just `conllu` files in the workspace but when I include the raw text from the 2018 shared task as `raw_train.txt` and `raw_dev.txt`, I get ``` Traceba…

jowagner updated 5 years ago
5
poloniki/quint #7

Better pre-processing for sentences that contain abbreviatio…

Was having trouble with over 10% of my Whisper transcriptions getting through successfully. Problems with unicode encoding, or periods were included and considered end-of-sentence markers, when they…

turnkit updated 10 months ago
2
explodinggradients/ragas #1296

faithfulness.adapt(language="chinese") is no useful

[ ] I have checked the [documentation](https://docs.ragas.io/) and related resources and couldn't resolve my bug. **Describe the bug** 【faithfulness.adapt(language="chinese") is no useful】 Ra…

beatG123 updated 1 day ago
1
nlpcl-lab/ace2005-preprocessing #9

Support for Arabic

Hi @bowbowbow, thanks a lot for putting this together. Was wondering if it will be easy to extend the content in main.py to support Arabic. In my initial trials, I tried the following: 1) Create…

spookyQubit updated 4 years ago
2

上一页 1...1 2 3 4 5 6 7...56 下一页

553 results for sentence-segmenter

553 results
for sentence-segmenter