tinyllama Search Results

1000+ results
for tinyllama

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/trl #696

SFT Llama 2 chat model ignoring EOS and BOS tokens

Hello. Using SFTTrainer, and Qlora, I have been finetuning a variety of LLama 2 Chat models. I have my dataset structured like the following based on what I have read to be the correct format: ``…

mallorbc updated 10 months ago
6
OpenInterpreter/open-interpreter #856

Ollama is a better LLM server for local

### Is your feature request related to a problem? Please describe. I'm using ollama for many things, running lm-studio for this seems wrong as it only runs as an app image. ### Describe the soluti…

iplayfast updated 6 months ago
14
jzhang38/TinyLlama #79

How to disable flash attention?

Hi~My GPU does not support flash attention (V100), so I want to disable it. I noticed that if flash attention is not installed in my environment, the variable [`FlashAttention2Available`](https://gith…

ZhouqyCH updated 9 months ago
3
tairov/llama2.mojo #27

Make the tokenizer better

I'm trying to make llama2.mojo work on tinyllama-1.1B. Which is a GQA and not tie_embedding model. Now I have finish converting the model and modify part of llama2.mojo(llama.cpp,llama.c). I have n…

magician-blue updated 11 months ago
33
mlc-ai/mlc-llm #1379

[Bug] Loading Model and cold start prompting freezes applica…

## 🐛 Bug I'm noticing this both with using the default LLama-2-7b and [TinyLlama-1.1b](https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v0.6). When loading the model for the first time or when …

tobrun updated 9 months ago
4
cg123/bitnet #2

InternalTorchDynamoError

Thank you for the implementation! Have you come across this error? `InternalTorchDynamoError: 'NoneType' object is not subscriptable` Code is a hello world basically: ```python from bitnet.con…

geronimi73 updated 6 months ago
1
Mobile-Artificial-Intelligence/maid #197

App crash when running certain 1B models

Running Maid on Moto G9 Android 11. Tried to run two 1B models obtained from Hugging face [this one](https://huggingface.co/TheBloke/Tinyllama-2-1b-miniguanaco-GGUF) and another one. The model is load…

RookieIndieDev updated 9 months ago
7
ggerganov/llama.cpp #4185

update_slots : failed to decode the batch

# Prerequisites Please answer the following questions for yourself before submitting an issue. - [x] I am running the latest code. Development is very rapid so there are no tagged versions as of…

rvandernoort updated 5 months ago
8
google/sentencepiece #931

[Question] How does encoding work?

I am trying to understand how SentencePiece encoding works. My current understanding is: * A model is loaded. The model can map "pieces" to "scores". * A given text is prepended with the `"▁"` cha…

99991 updated 10 months ago
2
janhq/jan #1859

bug: halt and message: "Message queued. It can be sent once …

**Describe the bug** until today Jan always worked, but now I get directed to this message: Message queued. It can be sent once the model has started and nothing happens...no activity on cpu...nothi…

lineality updated 5 months ago
40

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for tinyllama

1000+ results
for tinyllama