-
splitting up https://github.com/HarbourMasters/Shipwright/issues/272
> Probably because of fast text, because I had it on and skimming.
https://user-images.githubusercontent.com/102232770/167232…
-
Continuation of #7275
### Steps to reproduce:
Download [this file](https://github.com/nvaccess/nvda/files/1110530/test.pdf) and try to read it with NVDA in Adobe Reader
### Actual behavi…
-
Wouldn't your text(test set) need to go through the same pre-processing as your training set?
I fail to understand why are you splitting the text using regex to split and returning the prediction of …
-
We should be able to support splitting words by methods other than the text casing. Libraries like [instant-segment](https://lib.rs/crates/instant-segment) exist to do that.
- `redneckbossryan` ->…
-
**Is your feature request related to a problem? Please describe.**
I believe this doesn't currently existing with QuestPDF, however happy to be proven wrong. We have a new report with user-generated …
-
## What problem does this solve or what need does it fill?
OpenType font files may contain multiple fonts (e.g. regular, bold, italic, thin). When using such a file for displaying text with Bevy, o…
-
```
What steps will reproduce the problem?
1. Create 2 identic HTML file that contain either of:
-- a list (ordered or unordered)
-- a table
2. delete a word from that element and change the style of …
-
currently, the last word of a question has a question mark attached to it which is processed as a token. All question marks in the question and answer text should be removed prior to splitting the tex…
-
Look for instances where splitting text with .split(" "), tokenize with nltk instead or figure out how to tokenize with more than whitespace
-
When using an embedding model for text vectorization, I sometimes encounter exceptions. The cause of the exception is that the input exceeds the model's maximum context token size. As a result, I bega…