sentence-tokenizer Search Results

1000+ results
for sentence-tokenizer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Yoctol/purewords #37

Filters should be invertible

``` python import re store_dict = {replacement: []} store_dict[replacement] = re.findall(pattern, sentence) filtered_sentence = re.sub(.....) tokenized_sentence = tokenizer.lcut(filtered_sentence…

SoluMilken updated 7 years ago
1
microsoft/autogen #2255

[Feature Request]: Scraping Github code for retriver ragprox…

### Is your feature request related to a problem? Please describe. Following #708 I developed some scraping code from github that downloads .py/.ipnyb for the ragproxy agent. I would like to find a…

Repomano updated 15 hours ago
2
openspeech-team/openspeech #205

학습 완료된 모델을 불러와 하나의 음성 파일을 예측하고 싶습니다

# ❓ Questions & Help 현재 학습 완료된 모델을 불러와 하나의 음성 파일을 예측하고 싶습니다. 패키지 내에 함수들은 다량의 데이터 셋으로 테스트하는 것 같아서요!... 이것저것 보면서 짜고 있는데 계속 에러가나 이렇게 질문드립니다.... 혹시 wav 파일 하나만 가지고 테스트해 해당 예측된 말 소리 텍스트를 볼 수 있을까요? 패키지 내에 어…

youngchannelforyou updated 12 months ago
1
monum/311-translation #12

French to English translation task notebook

https://colab.research.google.com/drive/14KegLD0ymq4vTRzCjUvP77w9l-IGCsnj?usp=sharing @mlevans @tejasvicsr1

Gaurav7888 updated 3 months ago
3
huggingface/candle #2418

python sentence transformer all-MiniLM-L6-v2 is almost 2x fa…

Here is my candle implementation: (Taken from the examples itself) `pub fn encode(&self, prompt: &str) -> Result { let tokens = self.tokenizer .encode(prompt, true) …

AbhishekBose updated 2 months ago
6
cadmiumcr/utilities #4

Remove cadmiumcr/utilities and merge its content in cadmiumc…

@watzon : No cadmium shard requires cadmiumcr/utilities without requiring cadmiumcr/tokenizer I can't see a use case of a program requiring cadmiumcr/utilities by itself. The content of cad…

rmarronnier updated 4 years ago
1
neuspell/neuspell #45

Huggingface example script throws all kinds of errors

I'm trying to run the huggingface example in scripts/huggingface. Running the script as-is produces the error `/tmp/ipykernel_40405/3991995924.py in _custom_bert_tokenize(batch_sentences, bert_…

bradygilg updated 4 months ago
1
zhoubenjia/GFSLT-VLP #8

Bad results on CSL-Daily dataset

Hi Zhou, I have read your paper and am very interested in the idea. Therefore, I would like to conduct some experiments on this model. However, when I switched the dataset to CSL-Daily, I did not a…

Zachary-Lau-s updated 1 week ago
16
FunAudioLLM/CosyVoice #344

Add a new language but the result is a meaningless audio.

Hello @aluminumbox , I continued training the `llm` model on a German dataset (300 hours), but after 25k steps the model could not pronounce German and the 5 available languages. My process: - I f…

drlor2k updated 2 weeks ago
17
QuantConnect/Documentation #1777

Document Huggingface Cloud

These models are available in live, backtesting & research in the cloud environment. Access installed models and their revisions ```python from huggingface_hub import scan_cache_dir …

Martin-Molinero updated 4 months ago
6

上一页 1...12 13 14 15 16 17 18...100 下一页

1000+ results for sentence-tokenizer

1000+ results
for sentence-tokenizer