-
``` python
import re
store_dict = {replacement: []}
store_dict[replacement] = re.findall(pattern, sentence)
filtered_sentence = re.sub(.....)
tokenized_sentence = tokenizer.lcut(filtered_sentence…
-
### Is your feature request related to a problem? Please describe.
Following #708 I developed some scraping code from github that downloads .py/.ipnyb for the ragproxy agent. I would like to find a…
-
# ❓ Questions & Help
현재 학습 완료된 모델을 불러와 하나의 음성 파일을 예측하고 싶습니다. 패키지 내에 함수들은 다량의 데이터 셋으로 테스트하는 것 같아서요!... 이것저것 보면서 짜고 있는데 계속 에러가나 이렇게 질문드립니다.... 혹시 wav 파일 하나만 가지고 테스트해 해당 예측된 말 소리 텍스트를 볼 수 있을까요? 패키지 내에 어…
-
https://colab.research.google.com/drive/14KegLD0ymq4vTRzCjUvP77w9l-IGCsnj?usp=sharing
@mlevans @tejasvicsr1
-
Here is my candle implementation: (Taken from the examples itself)
`pub fn encode(&self, prompt: &str) -> Result {
let tokens = self.tokenizer
.encode(prompt, true)
…
-
@watzon :
No cadmium shard requires cadmiumcr/utilities without requiring cadmiumcr/tokenizer
I can't see a use case of a program requiring cadmiumcr/utilities by itself.
The content of cad…
-
I'm trying to run the huggingface example in scripts/huggingface.
Running the script as-is produces the error
`/tmp/ipykernel_40405/3991995924.py in _custom_bert_tokenize(batch_sentences, bert_…
-
Hi Zhou,
I have read your paper and am very interested in the idea. Therefore, I would like to conduct some experiments on this model. However, when I switched the dataset to CSL-Daily, I did not a…
-
Hello @aluminumbox , I continued training the `llm` model on a German dataset (300 hours), but after 25k steps the model could not pronounce German and the 5 available languages.
My process:
- I f…
-
These models are available in live, backtesting & research in the cloud environment.
Access installed models and their revisions
```python
from huggingface_hub import scan_cache_dir
…