tinyllama Search Results

1000+ results
for tinyllama

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

feifeibear/LLMSpeculativeSampling #27

output logits not match. question about decoding when draft …

In my opinion, the generation should be the same when draft model and target model is the same and temparature is 0. But in this case, the output logits of draft model and target model have a bit d…

66RING updated 3 weeks ago
4
langchain-ai/langchain #25022

OllamaLLM Connection refused from within docker container wh…

### Checked other resources - [X] I added a very descriptive title to this issue. - [X] I searched the LangChain documentation with the integrated search. - [X] I used the GitHub search to find a…

yogesh-bansal updated 2 weeks ago
5
ollama/ollama #6510

Performing GET request to registry.ollama.ai/v2/ returns 404…

### What is the issue? Background: Kubernetes 1.31 introduced a new feature: [Read-Only Volumes Based on OCI Artifacts](https://kubernetes.io/blog/2024/08/16/kubernetes-1-31-image-volume-source/).…

yeahdongcn updated 2 weeks ago
3
BrutalCoding/aub.ai #12

Performance Issue On Android

Hello, First I'll say, really impressed by this library and looking forward to TTS! I ran the example project on my android pixel 7 (Same one you used) and I am not seeing the same performance t…

mcmah309 updated 1 month ago
2
huggingface/optimum-neuron #519

Unable to deploy TinyLlama in Amazon SageMaker using Optimum…

### System Info ```shell optimum-neuron 0.0.20 neuronx-cc 2.* python 3.10 ``` ### Who can help? _No response_ ### Information - [ ] The official example scripts - [X] My own modified scripts …

ari-vedant-jain updated 12 hours ago
9
TinyLLaVA/TinyLLaVA_Factory #37

Is it possible to pretrain tinyllama-3b on 2 V100s ?

Yang-bug-star updated 5 months ago
1
unslothai/unsloth #379

Add support for OpenELM models from apple?

https://huggingface.co/apple/OpenELM Has models ranging from 270M to 3B parameters. Would love to see more support for small models, since I'm stuck with 4gb VRAM currently. Tinyllama can't fill ev…

NilanEkanayake updated 4 months ago
2
triton-inference-server/server #7077

tritonserver HTTPServer does not detect EOF/connection close…

**Description** When a user performs a long-running inference request via HTTPServer, they may lose connection or intentionally abort the connection (ctrl-c from curl). Ideally, the HTTP server will…

pathorn updated 4 months ago
4
pytorch/executorch #3550

Support Phi 3 model

With limited memory on most of phones, there's community requests on supporting a model with a smaller size like Phi-3 mini. It may be supported out of box, but need to verification, evaluation and pr…

iseeyuan updated 1 month ago
6
karpathy/llm.c #1

examples for popular models

I would like to request 1 or 2 examples of how to adapt this for a popular open models, such as: https://huggingface.co/mistralai/Mistral-7B-v0.1 https://huggingface.co/meta-llama/Llama-2-7b-hf h…

ehartford updated 5 months ago
3

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for tinyllama

1000+ results
for tinyllama