tinyllama Search Results

1000+ results
for tinyllama

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

jzhang38/TinyLlama #65

great work! can you do a mistral 1b tinyllama?

great work! can you do a mistral 1b tinyllama? mistral ai is good.

hiqsociety updated 11 months ago
5
Atinoda/text-generation-webui-docker #45

Container instantly crashes when trying to load GGUF

I am currently running the container on unraid. I have used the docker compose file as well as maually creating the container and changing storage mounts. I am able to download the models from hf and …

ErrickVDW updated 4 months ago
33
ggerganov/whisper.cpp #1661

Quantized model is not working properly when CUBLAS is ON

### Discussed in https://github.com/ggerganov/whisper.cpp/discussions/1656 Originally posted by **Sing303** December 19, 2023 Now when I try to use quantization models with FULL GPU CUBLAS, in…

bobqianic updated 8 months ago
12
ollama/ollama #1016

Are AMD GPUs supported on Intel Macs?

I'm currently trying out the ollama app on my iMac (i7/Vega64) and I can't seem to get it to use my GPU. I have tried running it with num_gpu 1 but that generated the warnings below. ` 2023/11/…

J0hnny007 updated 1 week ago
83
facebookresearch/xformers #870

Expected release date for a version that supports Torch 2.1.…

# ❓ Questions and Help Thanks for the great project. Recently torch 2.1.0 was released as stable. Are there any build plans or development releases for this?

caffeinism updated 3 months ago
39
jzhang38/TinyLlama #88

Gradient problem when the number of devices is 1

First thanks for all of the efforts you have done for the TinyLlama project - it's awesome! Recently I found a spurious problem. When there is only one card, the training gradient would disappear. …

SivilTaram updated 10 months ago
4
flexflow/FlexFlow #1154

Helps needed in testing out TinyLlama

The TinyLlama project aims to pretrain a 1.1B Llama on 3T tokens. So that model should be an ideal draft model for speculative inference. https://github.com/jzhang38/TinyLlama https://huggingfac…

jzhang38 updated 11 months ago
2
jzhang38/TinyLlama #80

What is the purpose of the "sanity check" which in the tinyl…

If the integrity check fails, there is no feedback for this code in the tinyllama.py： def train(fabric, state, train_dataloader, val_dataloader, monitor, resume): model = state["model"] opt…

JerryDaHeLian updated 10 months ago
2
jzhang38/TinyLlama #35

TinyLlama-1.1B-orca-gpt4

First, I want to express my gratitude about this project. I think TinyLlama has a lot of potential and we're just starting to see it. Cudos! I'm pretty new to this exciting field and this is the fi…

acalatrava updated 11 months ago
1
ollama/ollama #1865

Add GPU support for CUDA Compute Capability 5.0 and 5.2 card…

The `ollama serve` command runs as normally with the detection of my GPU: ``` 2024/01/09 14:37:45 gpu.go:34: Detecting GPU type ama 2024/01/09 14:37:45 gpu.go:53: Nvidia GPU detected ggml_in…

Subie1 updated 7 months ago
10

上一页 1...91 92 93 94 95 96 97...100 下一页

1000+ results for tinyllama

1000+ results
for tinyllama