gemma-2b-it Search Results

544 results
for gemma-2b-it

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

google-deepmind/gemma #9

Getting 'Killed' message trying to run sampling.py on 2b-it

I'm running on WSL2/Ubuntu on Win11. Deliberately using CPU mode as my GPU is too weak. Using Python 3.10.12. Here is the output when trying to run sampling.py: ``` ~/gemma$ python3 examples/sa…

davidebbo updated 1 month ago
5
huggingface/optimum #1755

ONNX conversion of google/gemma-2b-it logits values are very…

### System Info ```shell Optimum version: d87efb2 Transformers version: d479665 ONNXRuntime version: 1.17.1 ONNX version: 1.15.0 ``` ### Who can help? @michaelbenayoun @echarlaix ### Informat…

jacob-vincent-mink updated 3 months ago
9
google/flutter-mediapipe #56

[mediapipe_task_genai] On Android llm engine failed to valid…

## Bug report **Describe the bug** LLM Engine failed in ValidatedGraphConfig Initialization step. ### Steps to reproduce Steps to reproduce the behavior: 1. Download gemma-2b-it-gpu-int8.…

fengwang updated 2 weeks ago
13
mlc-ai/mlc-llm #2134

[Model Request] upgrade Gemma

Hello, please upgrade Google Gemma models to version 1.1 and include it in Android prebuild app. Links to models: https://huggingface.co/google/gemma-1.1-2b-it https://huggingface.co/google/gemma-1.1…

BlindDeveloper updated 2 months ago
1
Adamliu1/SNLP_GCW #98

[🚧WIP🚧] Experiments Plan

# Experiments Idea: Repeat most of the unlearning experiments (continuous, batch, sequential) with harmfulness and evaluate. Based on the results decide the best hyperparameters for unlearning fren…

TheRootOf3 updated 1 month ago
6
google-deepmind/gemma #29

Unused tokens in gemma tokenizer

I am using "google/gemma-2b-it" model from HuggingFace. I realized there are 99 unused tokens (\ ,\,\...) in first 106 token ids. Does anyone know their purpose? Just wondering.

hboyar updated 2 weeks ago
2
PaddlePaddle/PaddleNLP #8663

【LLM】模型参数支持列表

# 模型参数支持专区大家好，PaddleNLP 团队在这里为大家整理了各个模型参数的详细信息，方便大家使用。 ## 模型参数 ### Base Models | Model | 0.5B | 1~2B | 3~4B | 6~7B | 13~14B | 30~32B | 50~60B | 65~72B | 110B | >110B | |:---------:|:--…

DrownFish19 updated 5 days ago
1
keras-team/keras-nlp #1613

GemmaBackbone.get_layout_map broken for gemma_2b_en

**Describe the bug** When attempting to shard a `gemma_2b_en` model across two (consumer-grade) GPUs, I get: ``` ValueError: One of device_put args was given the sharding of NamedSharding(mesh=…

josharian updated 1 week ago
5
huggingface/transformers #31962

Keep Tuple of past key values as an option

### Feature request I see [llama](https://github.com/huggingface/transformers/blob/main/src/transformers/models/llama/modeling_llama.py#L829-L835) will remove tuple past key values in 4.43. ### Moti…

jiqing-feng updated 18 hours ago
9
aws-neuron/transformers-neuronx #82

Add support for `gemma` models

Hi 👋 , It would be really great if you could add support for the Gemma model series (i.e. 2B and 7B variants, particularly the 7B is what I would like most), since I see that it is currently not su…

benglewis updated 3 weeks ago
1

上一页 1...1 2 3 4 5 6 7...55 下一页

544 results for gemma-2b-it

544 results
for gemma-2b-it