tokenizers Search Results

1000+ results
for tokenizers

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/transformers.js #1019

Pretrained Llama tokenizers don't yield the expected tokeniz…

### System Info TypeScript 5.5.4 transformers.js 3.0.2 Node.js v20.170 ### Environment/Platform - [X] Website/web-app - [ ] Browser extension - [X] Server-side (e.g., Node.js, Deno, Bun) - [ ] De…

JulienVig updated 3 weeks ago
2
vllm-project/vllm #8994

[Bug]: Unable to load the tokenizers of certain models

### Your current environment The output of `python collect_env.py` ```text PyTorch version: 2.4.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A…

Wafaa014 updated 1 month ago
9
unslothai/unsloth #1108

Resize embeddings, tokenizers - adding new tokens don't work

From Twitter - adding new tokens to Qwen don't work? ```python # Add special tokens to the tokenizer num_added_tokens = tokenizer.add_special_tokens({"additional_special_tokens": special_tokens}) …

danielhanchen updated 1 month ago
3
deeepsig/tokviz #2

Add custom tokenizer support (see code)

## Purpose The script `tokviz/visualization.py` can and should have functionality to **visualize custom and local tokenizers**. Start with HF Transformers' class `PreTrainedTokenizerFast` for ease. T…

DrewGalbraith updated 1 week ago
1
FFengIll/embedding.cpp #2

Cargo needs to be installed before running the make command

When following the README instructions on Ubuntu 20.04 on Windows 11 (WSL2), the `make` command fails: ```bash [ 7%] Built target ggml [ 8%] Generating release/libtokenizers_c.a no such file o…

ivanorsolic updated 1 week ago
2
dotnet/machinelearning #6993

[Tokenizers] Port CLIP Tokenizer

Port CLIP tokenizer which leverages byte-level BPE. This tokenizer enables scenarios like StableDiffusion May be dependent on https://github.com/dotnet/machinelearning/issues/6992. Reference: h…

ericstj updated 3 months ago
1
Ucas-HaoranWei/GOT-OCR2.0 #122

pip install 出错，deepseed 安装不上

我的环境是 Windows 11 23H2，Anaconda 24.5.0 我按照 README 中的步骤执行的以下命令： ``` git clone https://github.com/Ucas-HaoranWei/GOT-OCR2.0.git cd GOT-OCR2.0/GOT-OCR2.0-master/ conda create -n got python=3.10 -…

domeniczz updated 1 week ago
5
huggingface/tokenizers #1636

NormalizedString.clear() broken?

Hello. I think there are some problems with `NormalizedString` (tokenizers 0.15.2). In the following example, `append()` works as expected. ``` from tokenizers import NormalizedString s = Norm…

lkurlandski updated 3 days ago
2
milvus-io/milvus #37498

[Bug]: For unsupported tokenizers, the error message is not …

### Is there an existing issue for this? - [X] I have searched the existing issues ### Environment ```markdown - Milvus version:master - Deployment mode(standalone or cluster): - MQ type(rocksmq,…

zhuwenxing updated 3 weeks ago
1
pypi/support #3902

Project Limit Request: tokenizers - 100 GB

### Project URL https://pypi.org/project/tokenizers/ ### Does this project already exist? - [X] Yes ### New limit 100 GB (to start) ### Update issue title - [X] I have updated the title. ### W…

Narsil updated 3 months ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for tokenizers

1000+ results
for tokenizers