tinyllama Search Results

1000+ results
for tinyllama

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mlc-ai/mlc-llm #1541

[Bug] Model compilation fails with memory verification error…

## 🐛 Bug ``` python3 -m mlc_llm.build --hf-path TinyLlama/TinyLlama-1.1B-Chat-v0.6 --target iphone --quantization q4f16_1 --use-cache 0 --use-safetensors ─…

scottorly updated 8 months ago
1
jzhang38/TinyLlama #101

Assessing performance of TinyLlama

I think it would be good to see how the performance of TinyLlama & TinyLlama-chat evolve over the checkpoint. We can have this through the HL leaderboard but it is quite long. What would you sugge…

galleon updated 10 months ago
3
jzhang38/TinyLlama #33

How did you determine the size of the TinyLlama model?

Were there any trade-offs or considerations you made when deciding on the model's size? Or What criteria did you use to select the specific number of layers, attention heads and Embedding Size etc. in…

dtxwhzw updated 8 months ago
2
dottxt-ai/outlines #585

Error in outlines.generate.choice: create_states_mapping thr…

### Describe the issue as clearly as possible: When I try the examples on the github front page, some do not work from a fresh conda environment. ### Steps/code to reproduce the bug: ```python …

dnhkng updated 7 months ago
9
jzhang38/TinyLlama #129

Usage documentation

Was trying to use this but running script.sh didnt work after downloading requirements in a venv. Also tried running chat_gradio but gradio package was not even in requirements. Worked after i pip ins…

simensandhaug updated 7 months ago
1
ggerganov/llama.cpp #3422

[User] AMD GPU slower than CPU

# Prerequisites Please answer the following questions for yourself before submitting an issue. - [x] I am running the latest code. Development is very rapid so there are no tagged versions as of…

oliverhu updated 3 months ago
37
jzhang38/TinyLlama #103

Settings for training/fine-tune of TinyLlama/TinyLlama-1.1B-…

Amazing work! I really like the project! If I understand correctly, TinyLlama/TinyLlama-1.1B-Chat-v0.6 is fine-tuned following the Zephyr recipes from HF4. I assume you did a full training and not…

sebastianschramm updated 10 months ago
1
ollama/ollama #2380

Ollama is unstable recently

As of at least the last two recent versions, I have been experiencing a lot of issues with Ollama. Primarily, it seems to report that it can't connect to the server when using the Ollama CLI commands…

lestan updated 7 months ago
4
ggerganov/llama.cpp #3852

Finetune produces unusable LoRA for Mistral model

# Expected Behavior I expected finetune to produce a usable LoRA adapter for all supported models. # Current Behavior For Mistral models (I tried both Mistral and Zephyr, Q8_0, Q5_K_M, Q5_0) …

maxxk updated 5 months ago
10
jzhang38/TinyLlama #114

The 4bit-quantized TinyLlama-1.1B's weight only takes up 550…

After fine-tuning the model, I obtained a 2.2 GB PyTorch model.bin file. Is it possible to reduce this model size to 550 MB, and if so, how and when can we achieve this?

TapendraBaduwal updated 8 months ago
6

上一页 1...82 83 84 85 86 87 88...100 下一页

1000+ results for tinyllama

1000+ results
for tinyllama