tinyllama Search Results

1000+ results
for tinyllama

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

jzhang38/TinyLlama #82

quantized version like GPTQ or INT4?

That'd be even more resource efficient. Thanks!

tigerinus updated 10 months ago
1
OpenNMT/CTranslate2 #1454

Support for TinyLLama 1.1B

Hello, I am trying to convert TinyLLama 1.1B model (PY007/TinyLlama-1.1B-step-50K-105b model checkpoint name), but getting some sort of shape mismatch error. Kindly look into this @guillaumekln Er…

Apoorv7092 updated 1 year ago
3
jzhang38/TinyLlama #64

How do you choose the value of batch-size?

Thank for you nice work! I calculate the batch-size use the equation from [scaling-OpenAI](https://arxiv.org/abs/2001.08361), which is 12M tokens if I want achieve a loss ~ 1.8. But I found all paper…

PeiqinSun updated 10 months ago
1
huggingface/accelerate #2377

DeepSpeed reduces loss scale until it becomes less efficient

### System Info ```Shell - `Accelerate` version: 0.26.1 - Platform: Linux-5.15.0-91-generic-x86_64-with-glibc2.35 - Python version: 3.11.5 - Numpy version: 1.26.3 - PyTorch version (GPU?): 2.1.2 …

ccruttjr updated 7 months ago
3
tairov/llama2.mojo #32

llama2.c Tinyllama1.1B supported.

I have enabled llama2.c to run the Tinyllama 1.1B chat on my [repo](https://github.com/magician-blue/llama2.c). Read [Tiny Llama 1.1B model](https://github.com/magician-blue/llama2.c#tiny-llama-11b-…

magician-blue updated 11 months ago
8
jzhang38/TinyLlama #93

data mixture

Hi all, Thanks for your great work. I am wondering the training subset of this chinchilla-optimal model. ->这个速度可以让你可以在8个A100上用32小时训练一个chinchilla-optimial的模型(11亿参数，220亿token) Is this part from sl…

NonvolatileMemory updated 10 months ago
1
xfactlab/orpo #5

prompt formatting issue

Is this line correct? ``` sample['prompt'] = [tokenizer.apply_chat_template([{'role': 'user', 'content': item[0]}], tokenize=False, add_generation_prompt=True) for item in sample['chosen']] …

RonanKMcGovern updated 5 months ago
12
ggerganov/llama.cpp #6516

CPU only, Vs GPU/CPU split VS GPU only

Windows 11 (24 core/32 processor) (nov 2023, 6MHZ processor) , 64 GIG ram, Nvidia 16 GB card (GEforce RTX 4060TI ) , version LLAMA.CPP mar 31 2024. I have noticed some anomalies after testing close…

David-AU-github updated 3 months ago
16
dottxt-ai/outlines #583

All examples fail using ExllamaV2

### Describe the issue as clearly as possible: Generating using the examples given on the front page of the repo all generate the same error: `RuntimeError: Index put requires the source and des…

dnhkng updated 7 months ago
1
jzhang38/TinyLlama #75

A question about Dataset combination

In tinyllama, the dataset is combine of Slimpajama & Starcoderdata, the total tokens in dataset is around 950B tokens; My question is: what's the meaning of `Sampled all code from Starcoderdata`? I…

PeiqinSun updated 10 months ago
3

上一页 1...93 94 95 96 97 98 99...100 下一页

1000+ results for tinyllama

1000+ results
for tinyllama