-
```
IllegalArgumentException: Cannot pass vectors from more than one quantitation type: QuantitationType Id=532865 Name=VALUE - Processed version General Type=QUANTITATIVE Type=AMOUNT Scale=LOG2 Repr…
-
Just opening this to add support for all models following #34184
Lets bring support to all model! 🤗
- [x] Llama
It would be great to add the support for more architectures such as
- [ ] Qwe…
-
## タイトル: ShieldGemma:Gemmaに基づく生成AIコンテンツモデレーション
## リンク: https://arxiv.org/abs/2407.21772
## 概要:
Gemma2を基盤としたLLMベースの安全コンテンツモデレーションモデル群「ShieldGemma」を紹介します。これらのモデルは、ユーザー入力とLLM生成出力の両方において、主要な有害タイプ(性的描写…
-
I didn't modify the script. When use 4bit lora training, when starting training, it reports error of " masked_scatter_: expected self and source to have same dtypes but got BFloat16 and Float". How t…
-
LM Studio 0.2.27
GPU acceleration: On, with CUDA.
From: [bartowski/Gemma-2-9B-It-SPPO-Iter3-GGUF](https://huggingface.co/bartowski/Gemma-2-9B-It-SPPO-Iter3-GGUF/resolve/main/Gemma-2-9B-It-SPPO-Iter3…
-
see #27
https://ai.google.dev/gemma/docs?hl=en
https://www.kaggle.com/models/google/gemma
Gemma on Vertex AI Model garden
https://console.cloud.google.com/vertex-ai/publishers/google/model-gard…
-
I'm having an issue with Gemma model. I used swift playground. Here is sample code to reproduce below error:
```
import UIKit
import Tokenizers
func testTokenizer() async throws -> Tokenizer…
-
Evaluating gemma-2b with xcopa looks good, but the xnli result looks weird.
xcopa result:
```
"results": {
"xcopa_zh": {
"acc,none": 0.616,
"acc_stderr,none": 0.021772369465…
-
Exceptions that are intercepted by DWR and serialized for the client-side should be reported on the server-side.
They should receive the [same treatment that we reserve to other Gemma Web exception…
-
hi, can you share some performance data on MTK or Qualcomm chips?
such as QWen or Gemma model's prefill and decode speed?
thanks very much.
yuimo updated
3 months ago