gemma-2b-it Search Results

889 results
for gemma-2b-it

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ollama/ollama #1005

Improved context window size management

Context window size is largely manual right now – it can be specified via `{"options": {"num_ctx": 32768}}` in the API or via `PARAMETER num_ctx 32768` in the Modelfile. Otherwise the default value is…

jmorganca updated 3 weeks ago
8
google-ai-edge/mediapipe-samples #379

Is Gemma on device really this slow ?

I used [llm_inference](https://github.com/googlesamples/mediapipe/tree/main/examples/llm_inference) sample with `gemma-2b-it-cpu-int4.bin` on Pixel 8 Pro emulator. The prefill speed seems to be in…

MJ1998 updated 6 days ago
6
huggingface/transformers #33147

Multi-GPU setup: indices should be either on cpu or on the s…

### System Info python version: 3.11.9 transformers version: 4.44.2 accelerate version: 0.33.0 torch version: 2.4.0+cu121 ### Who can help? @gante ### Information - [X] The official example sc…

justnoxx updated 1 day ago
17
unslothai/unsloth #584

unsloth/codegemma-2b-bnb-4bit: Model Error when Setting max_…

**Issue: Model Error when Setting max_seq_length > 8192** **Description:** The `unsloth/codegemma-2b-bnb-4bit` model throws an error when attempting to set `max_seq_length` greater than 8192. …

terraformmachine updated 2 weeks ago
5
datamllab/LongLM #46

LongLM isn't compatible with gemma-2-27b-it or gemma-2b-it

I found that the current version of LongLM can not load Gemma 1 or Gemma 2 model successfully. I wrote a minimum test to help reproduce the issue: ```python # transfromers version 4.38.2 # this exa…

uebian updated 3 months ago
8
microsoft/autogen #3975

NameError: name 'cerebras_AuthenticationError' is not define…

### What happened? I'm encountering an issue with the autogen library (version 0.3.1) when using OpenAI as the LLM provider (version 1.52.2). The error occurs during the generation of responses with …

dspencej updated 6 days ago
3
irthomasthomas/undecidability #891

vidore/colpali · Hugging Face

- [ ] [vidore/colpali · Hugging Face](https://huggingface.co/vidore/colpali) # ColPali: Visual Retriever based on PaliGemma-3B with ColBERT strategy ## Model Description This model is built iterati…

ShellLM updated 3 months ago
1
google-ai-edge/mediapipe #5720

mediapipe/tasks-genai for web uses a Deprecated function "re…

### Have I written custom code (as opposed to using a stock example script provided in MediaPipe) None ### OS Platform and Distribution Windows 11, Chrome V130 ### Mobile device if the issue happe…

kdshk updated 3 days ago
5
Lightning-AI/litgpt #1705

Way to load quantized model on the fly through Python SDK

As I understand, its quite straightforward to load a 4-bit quantized model with `litgpt serve` through CLI using: `litgpt serve google/gemma-2-2b-it --quantize bnb.nf4-dq` However, is there a way …

sovit-123 updated 2 months ago
2
microsoft/onnxruntime-genai #957

I could not load the tokenizer with the self-built model (Ge…

Hello everyone, I'm excited to be using ONNX Runtime GenAI. It's an amazing library for anyone looking to run models on their device. I've been learning how to use ONNX GenAI by following various t…

escon1004 updated 1 month ago
5

上一页 1...28 29 30 31 32 33 34...89 下一页

889 results for gemma-2b-it

889 results
for gemma-2b-it