tokenization Search Results

1000+ results
for tokenization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

inception-project/inception #1778

Make tokenization editable

**Is your feature request related to a problem? Please describe.** When using the builtin tokenizer of INCEpTION, it sometimes does errors. It would be nice if the tokenization can be edited. **De…

jcklie updated 3 hours ago
1
huggingface/trl #2340

Multiple Errors with PPOTrainer. error in ppo_trainer.datalo…

All the notebooks are giving a lot of errors. `The` PPOTrainer class also has a lot of other arguments that are required to be passed. for example, 'processing_class' instead of 'tokenizer', 'policy'…

Debolena7 updated 2 days ago
9
vllm-project/vllm #8904

[Bug]: Tokenization Mismatch Between HuggingFace and vLLM

### Your current environment The output of `python collect_env.py` ```text PyTorch version: 2.4.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N…

rafapi updated 1 month ago
2
sherif-ibn-nasser/aioh #6

Config file

The user should be able to customize the background color, font face, and font size from the config file. The config file may be named ".aioh" but it will requires some tokenization, or we could simp…

sherif-ibn-nasser updated 2 days ago
1
paradedb/paradedb #1817

Expose StopWordFilter from Tantivy

### What feature are you requesting? The ability to create custom stop word removal configurations, so that not all or only certain stop words are removed during tokenization. ### Why are you reques…

dibstan updated 1 week ago
1
huggingface/transformers.js #1019

Pretrained Llama tokenizers don't yield the expected tokeniz…

### System Info TypeScript 5.5.4 transformers.js 3.0.2 Node.js v20.170 ### Environment/Platform - [X] Website/web-app - [ ] Browser extension - [X] Server-side (e.g., Node.js, Deno, Bun) - [ ] De…

JulienVig updated 2 weeks ago
2
xhluca/dl-translate #65

Tokenizer clean_up_tokenization_spaces

FYI Changes in [transformers tokenizer](https://github.com/huggingface/transformers/issues/31884) gives deprecation warning. > >/xxx/dltranslate/lib/python3.12/site-packages/transformers/tokeni…

Nickwiz updated 1 month ago
5
unslothai/unsloth #1059

[FIXED] Exception: data did not match any variant of untagge…

I get this error: ``` Traceback (most recent call last): File "/home/denis/Documents/ai/unsloth/llama3-chat-template.py", line 20, in model, tokenizer = FastLanguageModel.from_pretrained(…

djannot updated 2 weeks ago
28
UKPLab/sentence-transformers #2922

Please future prove `clean_up_tokenization_spaces`

This is the future warning we are currently reciving: transformers\tokenization_utils_base.py:1601: FutureWarning: `clean_up_tokenization_spaces` was not set. It will be set to `True` by default. T…

PhorstenkampFuzzy updated 1 month ago
28
comfyanonymous/ComfyUI #5561

!!! Exception during processing !!! 'tokenizers.AddedToken' …

### Your question Error while loading checkpoint:!!! Exception during processing !!! 'tokenizers.AddedToken' object has no attribute 'special' ### Logs ```powershell got prompt model weight dtype…

npubg updated 2 weeks ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for tokenization

1000+ results
for tokenization