sentence-tokenizer Search Results

1000+ results
for sentence-tokenizer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

nlpyang/PreSumm #66

Why tokenizing 2 times ?

Data is tokenized 2 times : 1. With Stanford CoreNLP : https://github.com/nlpyang/PreSumm/blob/ba17e95de8cde9d5ddaeeba01df7cace584511b2/src/prepro/data_builder.py#L110 2. With HuggingFace's Bert…

astariul updated 2 years ago
5
unslothai/unsloth #350

Add support for Llama 3

It looks like the tokenizer patching breaks. Here's the log: ``` ValueError Traceback (most recent call last) Cell In[1], line 20 7 # 4bit pre quantized models…

rwl4 updated 4 months ago
14
microsoft/SpeechT5 #49

SpeechT5-tts fine-tuned on Chinese

I used [colab notebook](https://colab.research.google.com/drive/1i7I5pzBcU3WDFarDnzweIj4-sVVoIUFJ)to fine-tuned this model.When I run trainer.train(),It goes into error. ``` in :2 …

qlmbeck updated 8 months ago
4
pranavilingamallu/clearnlp #10

ClearNLP Error: java.lang.NullPointerException at com.google…

``` I'm trying to find a good Semantic Role Labeling tool that I can use in my java code using Netbeans. I tried ClearNLP and it work with testing the version with the right output fom this link: ht…

GoogleCodeExporter updated 9 years ago
1
huggingface/setfit #453

Best Approach for adding new vocabulary?

What would be the best approach for adding new vocab to the tokenizer before training the model? I tried accessing the tokenizer directly but realized there would be no way to resize the token_embeddi…

xsfa updated 10 months ago
1
huggingface/parler-tts #19

Benchmarks of parler-tts, the emergence of TTS!

Hey @sanchit-gandhi, like the repo. Excited to see this being worked on. Here's a benchmark of WhisperSpeech. I used your sample script on the same exact text snippet and it finished processing in …

BBC-Esq updated 2 months ago
7
antijob/neuro-parser #134

Добавить apps для обучения Bert моделей

## Ввод На вход принимает csv фаил с двумя колонками 1. Нормализованные тексты 2. Флаги valid\unvalid ## Вывод: Конфигурация `config.json` Модель в формате `pytorch_model.bin` Карта…

Vldln updated 5 months ago
1
huggingface/setfit #464

Error when uploading model checkpoints to Weights & Biases

When settting `os.environ["WANDB_LOG_MODEL"] = "end"` prior to the training loop and specify `report_to='wandb'` in `TrainingArguments`, I receive the following error: ``` Loading best SentenceTra…

simonschoe updated 6 months ago
3
TakeLab/spacy-udpipe #45

'NoneType' object has no attribute 'newTokenizer'`

Hello, I installed spacy-udpipe from the Pypi repo using the following `pip install spacy-udpipe` When I follow the tutorial code from the Pypi package tutorial ``` import spacy_udpipe sp…

CMallart updated 1 year ago
1
huggingface/transformers #31513

Special token handling breaks idempotency of sentencepiece d…

Sentenpiece tokenizers have the property that [`Decode(Encode(Normalize(input))) == Normalize(input).`](https://github.com/google/sentencepiece/blob/master/doc/api.md#detokenize-text-postprocessing). …

cat-state updated 2 weeks ago
29

上一页 1...38 39 40 41 42 43 44...100 下一页

1000+ results for sentence-tokenizer

1000+ results
for sentence-tokenizer