-
The cookbook aims to provide a comprehensive guide for researchers and practitioners interested in fine-tuning the Gemma model from Google on a mental health assistant dataset.
Key components of th…
-
Hello.
I'm trying to reproduce the results in the leaderboard.
For each model, I run the following script, according to the README.md.
The script is run with python 3.10.15 environment created by…
-
- [x] MiniCPM-Llama3-V-2_5
- [x] Florence 2
- [x] Phi-3-vision
- [x] Bunny
- [x] Dolphi-vision-72b
- [x] Llava Next
- [x] Qwen2-VL
- [x] Pixtral
- [x] Llama-3.2
- [x] Llava Interleave
- [x] …
-
It's effectively being used broader than the VNG/GEMMA & Zaakgericht Werken context - it really is aimed at JSON-based OpenAPI 3 driven services.
Proposal: `oas3-client` (boring but carries the wei…
-
Hi.
I've been fine-tuning Gemma-2-2B-it on Google Colab, saved the fine-tuned model to Huggingface.
When I load the model from Huggingface hub, I keep getting inference errors.
`from unsloth impo…
-
**Qwen2**
warning: not compiled with GPU offload support, --n-gpu-layers option will be ignored
warning: see main README.md for information on enabling GPU BLAS support
Log start
main: build = 2…
-
Hi @danielhanchen
I am trying to fine-tune gemma2-2b for my task following the guidelines of the continued finetuning in unsloth. Howver, I am facing OOM while doing so. My intent is to train gemm…
-
**Describe the bug**
git lfs pull --include gemma-2-9b-it-Q8_0_L.gguf
vs
git lfs pull gemma-2-9b-it-Q8_0_L.gguf (typed accidentally)
does not make it very clear how many files, or how much data …
-
- [ ] [llm-adaptive-attacks/README.md at main · tml-epfl/llm-adaptive-attacks](https://github.com/tml-epfl/llm-adaptive-attacks/blob/main/README.md?plain=1)
# Jailbreaking Leading Safety-Aligned LLMs…
-
Nice project. Open source models are usually good at one task, such as coding, writing, etc. And it would be interesting if we could set a parameter in the .env to specify which model that agent shoul…