sentence-tokenizer Search Results

1000+ results
for sentence-tokenizer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vespa-engine/vespa #30967

Add embedding instruction prompt support for to hf-embedder

It's becoming the norm to have prompt prefixes for text embedding models. I think we should add this to the [hf-embedder](https://docs.vespa.ai/en/reference/embedding-reference.html#huggingface-embedd…

jobergum updated 6 months ago
1
run-llama/llama_index #14793

[Question]: Error

### Question Validation - [X] I have searched both the documentation and discord for an answer. ### Question I want to use the following: LLM: Llama 2 7B chat Embed Model: sentence-transformers/a…

medhsv updated 4 days ago
8
nltk/nltk #1995

word_tokenize keeps the opening single quotes and doesn't pa…

`word_tokenize` keeps the opening single quotes and doesn't pad it with space, this is to make sure that the clitics get tokenized as `'ll`, `'ve', etc. The original treebank tokenizer has the sam…

alvations updated 1 week ago
8
nltk/nltk #1214

Incorporate more accurate sentence-splitter, tokenizer, and/…

Among open issues, we have (not an exhaustive list): - #135 complains about the sentence tokenizer - #1210, #948 complain about word tokenizer behavior - #78 asks for the tokenizer to provide offsets …

nschneid updated 3 years ago
27
daswer123/xtts-webui #66

Not working with Hindi language

I’m getting error while trying hindi language File "C:\Users\contact\Desktop\xtts-webui-main\venv\lib\site-packages\TTS\tts\models\xtts.py", line 526, in inference text = split_sentence(text, la…

iamjamilkhan updated 4 months ago
6
run-llama/llama_index #15908

[Bug]: OptimumEmbedding(BaseEmbedding) cannot be selected be…

### Bug Description https://github.com/run-llama/llama_index/blob/162f5a0523f5a4de33f8cc056ec2b24713d2ee9e/llama-index-integrations/embeddings/llama-index-embeddings-huggingface-optimum/llama_index/e…

rushai-dev updated 1 month ago
1
SciSharp/CherubNLP #1

System.NullReferenceException during tokenization

Hi All, I am trying to get some very basic tokenization to work. I think I am not using the API properly because the method `Tokenize` is throwing System.NullReferenceException. Any suggestions? …

sdg002 updated 5 years ago
1
NaturalNode/natural #134

How to turn whole sentence into singular?

Hello again I'd like to turn all words of a sentence into singular. For example `my dog has lots of flees` should become `[ 'my', 'dog', 'has', 'lots', 'of', 'flee' ]` Here the code: ``` js va…

binarykitchen updated 10 years ago
7
coqui-ai/TTS #3992

Finetune XTTS for new languages

Hello everyone, below is my code for fine-tuning XTTS for a new language. It works well in my case with over 100 hours of audio. https://github.com/nguyenhoanganh2002/XTTSv2-Finetuning-for-New-Lang…

anhnh2002 updated 4 days ago
18
UKPLab/sentence-transformers #1699

tokenizer variable is uninitialized when model loaded from c…

``` from sentence_transformers import SentenceTransformer, util model = SentenceTransformer('clip-ViT-L-14') my_tok = model.tokenizer ``` results in `AttributeError: 'SentenceTransformer' …

Thomas-MMJ updated 2 years ago
1

上一页 1...10 11 12 13 14 15 16...100 下一页

1000+ results for sentence-tokenizer

1000+ results
for sentence-tokenizer