tiktoken Search Results

1000+ results
for tiktoken

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

hwchase17/notion-qa #11

Add tiktoken to requirements.txt

On a fresh container with minimal prereqs, first call results in: `Traceback (most recent call last): File "/opt/conda/lib/python3.10/site-packages/langchain/llms/openai.py", line 233, in get_nu…

ryanbrackney updated 1 year ago
3
meta-llama/llama3 #227

missed double-tab merge opportunities in the tokenizer

I was playing with the tokenizer, and I noticed some missed merge opportunities. ```python >>> tokenizer = AutoTokenizer.from_pretrained("meta-llama/Meta-Llama-3-8B") >>> tokenizer(['\t', '\t\t',…

josharian updated 4 weeks ago
1
karpathy/minbpe #11

Steal token visualisation code

Hello! I don't remember if I'd shown you https://github.com/openai/tiktoken/blob/main/tiktoken/_educational.py , but consider stealing the token visualisation code from here in some form: https://gith…

hauntsaninja updated 4 months ago
5
microsoft/lida #80

Error: name 'tiktoken' is not defined

Upon executing this: charts = lida.visualize(summary=summary, goal=user_query, textgen_config=textgen_config) I am getting error: name 'tiktoken' is not defined

agvaishali updated 6 months ago
1
microsoft/autogen #2667

[Issue]: pgvector query returning byts instead of string

### Describe the issue When running the [pgvector example](https://github.com/microsoft/autogen/blob/main/notebook/agentchat_pgvector_RetrieveChat.ipynb) I get the following error: m:\One…

capella-ben updated 1 week ago
3
huggingface/tokenizers #1537

Training HuggingFace tokenizer - ignore_merges

Looking through Llama3 changes, I see that "ignore_merges" was added as a property to support conversion from tiktoken models. Can a native HF tokenizer be trained using this property? It's not clear…

ykoyfman updated 3 weeks ago
1
run-llama/llama_index #11828

[Bug]: Token counter not working for RaptorPack

### Bug Description I recently conduced a few experiments using RaptorPack and everything looks fine. The only fallback is that the token counter is not working for the RaptorPack so that I cannot ge…

mw19930312 updated 2 weeks ago
2
microsoft/autogen #1164

Potentially invalid token counting

### Describe the bug `completion_tokens += 1` looks like a bad idea. Do we expect a chunk to always be 1 token? I don't think so. ### Steps to reproduce _No response_ ### Expected Behavior…

bitnom updated 2 weeks ago
3
meta-llama/llama-recipes #509

llama 3 multilingual recipe

### 🚀 The feature, motivation and pitch The current multilingual recipes are for LLAMA 2. I would like to see LLAMA 3 multilingual recipes added. Thank you. ### Alternatives _No response_ ##…

woohwan updated 4 weeks ago
4
NVIDIA/Megatron-LM #818

Megatron-LM for LLaMa3

I'm attempting to train LLaMA-3 using Megatron-LM but have encountered an issue: LLaMA-3 utilizes Tiktoken for tokenization and doesn't provide a tokenizer.model file, which is required by Megatron-LM…

SDsly updated 2 weeks ago
8

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for tiktoken

1000+ results
for tiktoken