tinyllama Search Results

1000+ results
for tinyllama

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

b4rtaz/distributed-llama #98

dllama: src/commands.cpp:102: MultiHeadAttSlice::MultiHeadAt…

Hello, @b4rtaz! I'm trying to run model [nkpz/llama2-22b-chat-wizard-uncensored](https://huggingface.co/nkpz/llama2-22b-chat-wizard-uncensored) on a cluster composed of 1 Raspberry Pi 4B 8 Gb and 7…

EntusiastaIApy updated 1 day ago
3
SJTU-IPADS/PowerInfer #148

Please make a tinyllama v1.0 version for use.

curious how it performs on smaller models

kolinfluence updated 6 months ago
1
ggerganov/llama.cpp #8832

Bug: RPC inference is drastically slower even on localhost

### What happened? I am trying to run inference on RPC example. When running the llama-cli with rpc feature over a single rpc-server on localhost, the inference throughput is only 1.9 tok/sec for lla…

hafezmg48 updated 3 days ago
2
transmissions11/bistro #17

torch.compile

see tinyllama pretraining script in lit-gpt, pytorch-labs repo from torch talk

transmissions11 updated 9 months ago
2
TinyLLaVA/TinyLLaVA_Factory #77

template differences？

Are there any differences in the _make_masks function across different LLM models? Don't they all compute loss only for the response part? What causes the variations among them?

TuuSiwei updated 1 month ago
2
jzhang38/TinyLlama #154

Hi, how can I finetune tinyllama with a custom dataset as fo…

[TinyLlama/TinyLlama-1.1B-Chat-v1.0](https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0). This is my custom dataset: [BibleGPT-LORA](https://huggingface.co/datasets/oliverbob/biblegpt). Its a s…

oliverbob updated 7 months ago
2
jzhang38/TinyLlama #179

Encountered an issue while loading the model using transform…

I try to load the model with transformers, ` small_model = AutoModelForCausalLM.from_pretrained(approx_model_name, torch_dtype=torch.float16…

Yukang-Lin updated 4 months ago
1
axolotl-ai-cloud/axolotl #1223

Model is not saved for full finetune with Deepspeed Zero3

### Please check that this issue hasn't been reported before. - [X] I searched previous [Bug Reports](https://github.com/OpenAccess-AI-Collective/axolotl/labels/bug) didn't find any similar reports. …

hahmad2008 updated 3 months ago
3
dusty-nv/NanoLLM #9

Error found when using hf api

When I tried to call : ```python llm = NanoLLM.from_pretrained( model="TinyLlama/TinyLlama-1.1B-Chat-v1.0", api='hf', api_token='mytoken', …

Jiopro updated 3 months ago
1
luchangli03/onnxsim_large_model #6

Model size are not reduced after simplification

I tried to simplify TinyLlama with the code, but the simplified onnx file is almost with the same size with non-simplified one. It is appreciated if you can provide onnx sizes of the original Llama on…

junde-cadence updated 6 months ago
1

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for tinyllama

1000+ results
for tinyllama