tinyllama Search Results

1000+ results
for tinyllama

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

jzhang38/TinyLlama #168

Help me pls

```Traceback (most recent call last): File "e:\llm\TinyLlama\pretrain\tinyllama.py", line 17, in from lit_gpt.model import GPT, Block, Config, CausalSelfAttention File "E:\llm\TinyLlama\lit_g…

aritralegndery updated 5 months ago
2
jzhang38/TinyLlama #173

Why FSDP not DPP？

Could I kindly inquire as to why, given the relatively small size of the tinyllama model, the Strategy was made to utilize FSDP (Fully Sharded Data Parallel) instead of DDP (Distributed Data Parallel)…

noforit updated 2 weeks ago
1
feifeibear/LLMSpeculativeSampling #27

output logits not match. question about decoding when draft …

In my opinion, the generation should be the same when draft model and target model is the same and temparature is 0. But in this case, the output logits of draft model and target model have a bit d…

66RING updated 1 week ago
4
ngxson/wllama #31

After upgrading to version 1.8.0, the async function `loadMo…

Something interesting occurred while upgrading to version 1.8.0. Previously, it had been throwing an "Out of Memory" error, but that issue has now been resolved. However, a new problem has surfaced, w…

felladrin updated 2 months ago
6
JoeyWangTW/youtube-addiction-rehab-chrome-extension #78

Add support of local AI models.

R-udren updated 3 days ago
8
sobelio/llm-chain #295

Is there a way to run GGUF models?

I want to run the tinyllama model and I wonder if there a way to run GGUF models with this crate. It seem much more common that models are using the GGUF format over the GGML format for models and con…

tirithen updated 2 months ago
1
TinyLLaVA/TinyLLaVA_Factory #13

the loss is nan when pre-training tinyllama using share reci…

Here is my training script ``` deepspeed tinyllava/train/train.py \ --deepspeed ./scripts/zero2.json \ --model_name_or_path checkpoints/TinyLlama-1.1B-Chat-v1.0/ \ --version plain…

xushilin1 updated 3 months ago
1
langchain-ai/langchain #19770

HuggingFacePipeline with ChatPromptTemplate never ends

### Checked other resources - [X] I added a very descriptive title to this issue. - [X] I searched the LangChain documentation with the integrated search. - [X] I used the GitHub search to find a sim…

bibhas2 updated 2 months ago
3
unslothai/unsloth #379

Add support for OpenELM models from apple?

https://huggingface.co/apple/OpenELM Has models ranging from 270M to 3B parameters. Would love to see more support for small models, since I'm stuck with 4gb VRAM currently. Tinyllama can't fill ev…

NilanEkanayake updated 3 months ago
2
BrutalCoding/aub.ai #12

Performance Issue On Android

Hello, First I'll say, really impressed by this library and looking forward to TTS! I ran the example project on my android pixel 7 (Same one you used) and I am not seeing the same performance t…

mcmah309 updated 1 month ago
2

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for tinyllama

1000+ results
for tinyllama