encode-sentences Search Results

1000+ results
for encode-sentences

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

chroma-core/chroma #1722

[Feature Request]: Universal Sentence Encoder(USE) Embedding…

### Describe the problem Built in support for Google's Universal Sentence Encoder (USE) which can useful for greater-than-word length text, such as sentences, phrases or short paragraphs. [USE Pap…

csbasil updated 7 months ago
1
franckbrl/morpheval_v2 #2

`morpheval.limsi.v2.en.sents` use `macintosh` encoding

If someone plans to load sentences in Python or similar, be aware that the file is encoded using the `macintosh` charset. This worked for me to get the original sentences in UTF-8: `iconv -f macint…

gsarti updated 2 years ago
2
UKPLab/sentence-transformers #429

Error: object of type 'float' has no len()

Hello, I am getting this error when try to clustering in Spanish (see ERROR below). I assume my corpus should have a problem. Could you help me to find the nature of the error? (It works perfectly …

rdpulgar updated 3 years ago
8
common-voice/common-voice #4491

Support bulk-ban or bulk-remove sentences

I originally thought that this issue is only specific to the zh-hk locale, but later realize that this is quite widespread and seriously harming the data quality of many languages. So currently, some …

laubonghaudoi updated 2 months ago
4
openai/CLIP #212

RuntimeError: Input is too long for context length 77

This happens when trying to tokenize ( clip.tokenize(train_sentences).to(device) ) sentences that have less than 77 tokens (for example 44), but some of them are unknown. I have tried to operate th…

ancordovag updated 11 months ago
3
plysytsya/EbookTranslator #2

Error when using Google special chars

Hi again, if using a German text with special characters as source language and Google as engine it comes to this error: Reading text into memory. Tokenizing text into sentences. Starting transla…

credo99 updated 4 years ago
2
Tatoeba/tatoeba2 #2984

Incorrectly formatted CSV files

Hi. Some of the dump files on the Downloads page are incorrectly formatted. The details field on the user_languages.csv file, for example, allows tabs and newlines, which should not be allowed in a…

cangareijo updated 2 years ago
6
UKPLab/sentence-transformers #820

Not able to export sentence-transformers model to PyTorch.

Hi, I would like to export sentence-transformers model to PyTorch. However, I am not able to jit trace the **stsb-distilbert-base** model. Any help is much appreciated. Thanks, -s sentenc…

sivers2021 updated 3 months ago
8
leehour/Seq2SeqWithPGN #1

你的数据文件怎么产生的？比如vobca.txt sentences.txt??

(py38nlp) ➜ /Users/admin/Seq2SeqWithPGN python build_vocab.py Traceback (most recent call last): File "build_vocab.py", line 29, in vocab, reverse_vocab = generate_vocab(sentences_path)…

northeast250 updated 4 years ago
1
UKPLab/sentence-transformers #1139

SimCSE dropout

Hi, In the sample code example provided for SimCSE, should we set the **dropout** or has it been set emplicitly: ``` model_name = 'distilroberta-base' word_embedding_model = models.Transformer…

lukemao updated 3 years ago
1

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for encode-sentences

1000+ results
for encode-sentences