learned-tokenization Search Results

280 results
for learned-tokenization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/tokenizers #848

FastTokenizer becomes slow when adding new tokens

Hey, I'm trying to add a large number of new tokens (20k) to a pretrained tokenizer, but after adding the new tokens the needed time to tokenize the same data jumps from 1 minute to more than 10 hou…

AhmedIdr updated 2 years ago
6
huggingface/transformers #15559

BART Large generate predictions are wonky

## Environment info - `transformers` version: 4.16.2 (issue exists on 4.9.2) - Platform: Linux-4.4.0-210-generic-x86_64-with-glibc2.10 - Python version: 3.8.10 - PyTorch version (GPU?): 1.8.1+…

StephAO updated 2 years ago
48
huggingface/transformers #14838

Subword Tokenization Bug after Non-Space Word Boundaries (Al…

## Environment info - `transformers` version: 4.11.0 - Platform: macOS Big Sur - Python version: 3.8.5 ### Who can help @LysandreJik ## Information When using the AlbertTokenizer or A…

mcrchopra updated 2 years ago
6
explosion/spaCy #5420

How do I add custom tokenization AND preserve parser (statis…

From the recommendation on issue #5231 I start with a blank model and modify the existing tokenization rules, then save the model to disk and use that as my base model for downstream tasks includin…

erotavlas updated 3 years ago
2
facebookresearch/fairseq #1758

issues with mbart models

## ❓ Questions and Help Thanks for releasing the mbart models! However, we are unable to produce the EN-RO fine-tuned BLEU scores reported in the paper. We get a BLEU score of 26.9, using sacreBLEU…

mjpost updated 2 years ago
38
KaiyangZhou/CoOp #9

about the output of interpret_prompt.py

I trained on my customed dataset. python interpret_prompt.py give the output below: `Return the top-3 matched words Size of token embedding: torch.Size([49408, 512]) Size of context: torch.Size([1…

girafffeee updated 3 years ago
1
DandelionSprout/adfilt #63

General filter chit-chat №2

Note, 13th of February 2023: Next de facto discussion place until further notice is at #779. ———————————— So today I learned that [GitHub threads max out at 2,500 comments](https://github.com/Da…

DandelionSprout updated 1 year ago
2500
explosion/spaCy #8138

Spacy 2 training can not be reproduced under Spacy 3

## How to reproduce the behaviour The 'recomended way' of upgrading to Spacy3 is '**re-train your models**'. To justify the upgrade one would expect that re-training (using the same data) will result…

mbrunecky updated 3 years ago
8
microsoft/DeepSpeed #1686

Not being able to save T5-11B checkpoint using deepspeed

**Describe the bug** A clear and concise description of what the bug is. Not being able to save T5-11B checkpoint using deepspeed **To Reproduce** Steps to reproduce the behavior: ``` export…

tuhinjubcse updated 2 years ago
36
studio-ousia/luke #38

Adding LUKE to HuggingFace Transformers

Hi, Is there a possibility to reproduce results for NER on CPU instead of the default GPU configuration? I am unable to find any resource for this on the repo. I am using the following command, bu…

uahmad235 updated 2 years ago
46

上一页 1...18 19 20 21 22 23 24...28 下一页

280 results for learned-tokenization

280 results
for learned-tokenization