-
See https://huggingface.co/PY007/TinyLlama-1.1B-step-50K-105b/blob/main/tokenizer_config.json#L22
-
When I began to try and determine working models for this application (https://github.com/imartinez/privateGPT/issues/1205), I was not understanding the importance of prompt template:
Therefore I h…
-
Hi, I see the mention of running this model on llama.cpp in README. Did you get a manage to get it to run and quantize with good output? I'm trying to evaluate if this model can be used for speculativ…
-
I adapted TimDettmers filtered Openassistant dataset in order for it to take the Llama 2 prompt format (e.g. with INST), see [here](https://huggingface.co/datasets/Trelis/openassistant-llama-style/).
…
-
I noticed that your tokenizer doesn't add the bos and eos token to the final tensor during encoding. Does this have any impact on pretraining? If it's intentional not to add them, what is the reason …
-
### Contact Details
rpchastain@proton.me
### What happened?
I'm attempting to use `mixtral-8x7b-instruct-v0.1.Q5_K_M.gguf` weights on an AWS ec2 instance with an AMD EPYC 7R13 and 4 NVidia L4 gpus.…
-
When running the whisper.swiftui example, compiled in XCode, transcription fails with the following log:
> About to run whisper_full
whisper_full_with_state: failed to encode
Failed to run the mo…
-
### Bug description
right now im building an ML library and i built my own tensors and there's a little problem with loading the data from buffer and i get this error belove :
```text
Hidden Size…
-
Opening a new issue (see https://github.com/ollama/ollama/pull/2195) to track support for integrated GPUs. I have a AMD 5800U CPU with integrated graphics. As far as i did research ROCR lately does su…
-
Trying to use a quantized version of the ultra small (260k) tinyllamas model here:
https://huggingface.co/klosax/tinyllamas-stories-gguf/blob/main/tinyllamas-stories-260k-f32.gguf
F32 and F16 work…