tinyllama Search Results

1000+ results
for tinyllama

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

BrutalCoding/aub.ai #12

Performance Issue On Android

Hello, First I'll say, really impressed by this library and looking forward to TTS! I ran the example project on my android pixel 7 (Same one you used) and I am not seeing the same performance t…

mcmah309 updated 1 month ago
2
pytorch/executorch #3550

Support Phi 3 model

With limited memory on most of phones, there's community requests on supporting a model with a smaller size like Phi-3 mini. It may be supported out of box, but need to verification, evaluation and pr…

iseeyuan updated 2 weeks ago
6
Nardien/KARD #1

Problems when changing t5 model to other Causal Models

Hi, I am trying to use this framework on causal models such as llama based models and other LLMs. For my case, I use Tinyllama and Pythia to replace the T5 model in the original pipeline (TinyLlam…

AlbusChen updated 3 months ago
1
unslothai/unsloth #453

different inference result

hi unslothai, i got different inference result when using unsloth, i'v tested qwen1.5-chat and tinyllama-chat and got same issue, generate by unsloth always get a bad result compare with transformers …

xd2333 updated 3 months ago
2
ilur98/DGQ #2

How to properly run W8A8?

Hi @ilur98, thanks for your great work on this repository. I am attempting to modify your work to support W8A8 as I found that static W4A8 represents gives too large of a quantization error. I am r…

casper-hansen updated 8 months ago
1
karpathy/llm.c #1

examples for popular models

I would like to request 1 or 2 examples of how to adapt this for a popular open models, such as: https://huggingface.co/mistralai/Mistral-7B-v0.1 https://huggingface.co/meta-llama/Llama-2-7b-hf h…

ehartford updated 4 months ago
3
TinyLLaVA/TinyLLaVA_Factory #37

Is it possible to pretrain tinyllama-3b on 2 V100s ?

Yang-bug-star updated 5 months ago
1
triton-inference-server/server #7077

tritonserver HTTPServer does not detect EOF/connection close…

**Description** When a user performs a long-running inference request via HTTPServer, they may lose connection or intentionally abort the connection (ctrl-c from curl). Ideally, the HTTP server will…

pathorn updated 4 months ago
4
LlamaEdge/LlamaEdge #80

Feature Request: Create a multi thread api server demo

### Summary This only works when the available RAM is several times the model. I think we could demo a PoC using TinyLlama. 1 Start several api server instances. Each on a different port. 2 S…

juntao updated 2 weeks ago
1
jzhang38/TinyLlama #158

Is there any function calling model for tinyllama?

tinyllama which finetuning by function calling

atregret updated 6 months ago
1

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for tinyllama

1000+ results
for tinyllama