tokenization Search Results

1000+ results
for tokenization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

OoriData/Toolio #12

Weirdness with tokenization in Phi-3

Server: ```sh toolio_server --model=mlx-community/Phi-3-mini-128k-instruct-4bit ``` Client: ```sh toolio_request --apibase="http://localhost:8000" --prompt='What is the average airspeed of…

uogbuji updated 3 months ago
3
AkihikoWatanabe/paper_notes #1507

LBPE: Long-token-first Tokenization to Improve Large Languag…

# URL - https://arxiv.org/abs/2411.05504 # Authors - Haoran Lian - Yizhe Xiong - Zijia Lin - Jianwei Niu - Shasha Mo - Hui Chen - Peng Liu - Guiguang Ding # Abstract - The prevalent …

AkihikoWatanabe updated 2 weeks ago
1
hiyouga/LLaMA-Factory #5878

显存充足，无法调用，显示只使用一点显存

### Reminder - [X] I have read the README and searched the existing issues. ### System Info - `llamafactory` version: 0.9.1.dev0 - Platform: Linux-6.5.0-28-generic-x86_64-with-glibc2.35 - Python …

Lgugeng updated 4 weeks ago
1
Iodine98/dora-back #59

Parametrize tokenization method

Currently, the tokenization method for processing text is by default the `RecursiveTextSplitter`, this should be given as a parameter depending on the type of document uploaded.

Iodine98 updated 6 months ago
1
FlagOpen/FlagEmbedding #964

LM_Cocktail融合模型之后出现PyPreTokenizerTypeWrapper的报错

您好,之前我微调模型已经完成,融合模型也没有出问题,但是本周使用的时候突然发现,不论FlagEmbedding或者Huggingface的调用都会出现: File "/opt/conda/lib/python3.8/site-packages/FlagEmbedding/flag_reranker.py", line 158, in __init__ self.tokenizer …

Zhouziyi828 updated 1 month ago
3
StartHua/Comfyui_CXH_joy_caption #92

Joy_caption节点报错 expected str, bytes or os.PathLike object, n…

运行Joy_caption这个节点时会报错，报错内容如下 # ComfyUI Error Report ## Error Details - **Node Type:** Joy_caption - **Exception Type:** TypeError - **Exception Message:** expected str, bytes or os.PathLike obj…

mr-bob-chang updated 3 weeks ago
1
MaartenGr/BERTopic #1936

Semantic Sentence Tokenization

I'm working with a corpus that primarily consists of longer documents. I'm seeking recommendations for the most effective approach to semantically tokenize them. Examples: ``` Original Text: "I…

TheAIMagics updated 7 months ago
1
chaudhary1337/Chatbot-Assistant #1

Repeated tokenization

The user input is tokenized again and again for each intent. This needs a complete revision of the structure ...

chaudhary1337 updated 4 years ago
1
1393Designs/Textspansion #8

Improved Tokenization

Because who wants to type in %HERPDERP

vincetran updated 11 years ago
1
Cpgeragh/Emerging-Technologies #21

Define Function to Analyze Word Validity

**Description**: Write a function to split the generated text into words and check each word against the English word list to determine its validity. **Checklist**: - [ ] Research methods for tok…

Cpgeragh updated 3 weeks ago
1

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for tokenization

1000+ results
for tokenization