-
### OS Platform and Distribution
tried on Windows 11 and mac M3
### Mobile device
Android emulator and real device api 34
### Programming Language and version
flutter 3.22 and dart 3.4
…
-
the instruction code for mpt-7b works fine when using older version 20240123, but when updating to the latest branch, using the new code, always have OOM error with multiple gpus, even when using 8*A1…
-
### Please check that this issue hasn't been reported before.
- [X] I searched previous [Bug Reports](https://github.com/OpenAccess-AI-Collective/axolotl/labels/bug) didn't find any similar reports.
…
-
As I understand, its quite straightforward to load a 4-bit quantized model with `litgpt serve` through CLI using:
`litgpt serve google/gemma-2-2b-it --quantize bnb.nf4-dq`
However, is there a way …
-
**Problem**
HF repositories do not all have the same structure, e.g., random file names, varying folder structures. I would like to download a model using an absolute model URL.
**Success Criteria…
-
**Describe the bug**
Hi
I am trying sampler example here https://keras.io/examples/generative/text_generation_gpt/ in Gemma
the preprocessor in Gemma return dictionary of token_ids and padding…
-
### Discussed in https://github.com/xtekky/gpt4free/discussions/2217
Originally posted by **AlirezaAbavi** September 11, 2024
Hello.
I just found this repository. So I have a few questions.
…
-
### Motivation
Hi friends,
I'm opening this issue as a place to discuss small vision-language models, please share your thoughts below!
There's recently been great success in research with sm…
-
I would like to request 1 or 2 examples of how to adapt this for a popular open models, such as:
https://huggingface.co/mistralai/Mistral-7B-v0.1
https://huggingface.co/meta-llama/Llama-2-7b-hf
h…
-
I'm trying to deploy Llama3 8b on GKE using optimum but running into some troubles.
Following instructions here: https://github.com/huggingface/optimum-tpu/tree/main/text-generation-inference. I bu…