gemma-2b-it Search Results

882 results
for gemma-2b-it

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Lightning-AI/litgpt #1031

Gemma: A study of the effect of the new issues

[Daniel Han](https://twitter.com/danielhanchen) in his [blogpost](https://unsloth.ai/blog/gemma-bugs) shared his discoveries of what is ~also~ wrong with the Gemma implementation. Some of them are on…

Andrei-Aksionov updated 7 months ago
7
privacy-scaling-explorations/acceleration-program #4

Privacy preserving machine learning using MPC

### Open Task RFP for Privacy preserving machine learning inference using MPC #### Executive Summary - Project Overview: In this project, we want to see current state of the privacy preserving m…

tkmct updated 1 month ago
5
crabml/crabml #178

Any speed testment?

Any speed testment?

lucasjinreal updated 6 months ago
1
OpenMOSS/Language-Model-SAEs #60

`LanguageModelConfig` constructor raises exception for `meta…

First, thanks for this work. Providing open source SAEs for a model like Llama is a huge boon to the community. I'm working on a simple script to use your `generate_description` function to assign …

jacknewsom updated 1 day ago
2
ml-explore/mlx-examples #1095

LoRA trains on tokens with duplicated BOS for some models

During LoRa training, iterate_batches [calls](https://github.com/ml-explore/mlx-examples/blob/main/llms/mlx_lm/tuner/trainer.py#L104) `tokenizer.encode()` (with default arguments) on the dataset item,…

chimezie updated 2 days ago
3
parthsarthi03/raptor #3

ValueError: Input contains NaN.

I encountered this error when I was adding text. Hope to get a solution to deal with this error.Thank you very much. Traceback (most recent call last): File "/home/jyc23/raptor-master/demo/newdemo…

chenyujiang11 updated 4 months ago
8
modelscope/ms-swift #2184

llama-3.2-3b instruct doesn't stop writing

**Describe the bug** The model response doesn't stop. It keeps writing. I tried both `swift deploy` and `vllm` Training arguments: ```bash HF_HUB_ENABLE_HF_TRANSFER=1 \ USE_HF=1 \ CUDA_VISIBLE…

Aunali321 updated 3 weeks ago
10
EleutherAI/lm-evaluation-harness #1583

XNLI weird result with gemma-2b

Evaluating gemma-2b with xcopa looks good, but the xnli result looks weird. xcopa result: ``` "results": { "xcopa_zh": { "acc,none": 0.616, "acc_stderr,none": 0.021772369465…

SefaZeng updated 7 months ago
1
jiaweizzhao/GaLore #20

GaLore in HuggingFace

Hi team, very thanks for GaLore. I'm currently using HuggingFace for fine-tuning. Just curious to integrate GaLore with HuggingFace. It's not an issue, I'm just interested to use GaLore with Huggin…

IamExperimenting updated 7 months ago
12
ollama/ollama #4370

Ollama’s speed in generating chat content slowed down by ten…

### What is the issue? I just set the chat format to JSON, then the Ollama’s speed in generating chat content slowed down by tenfold. For example, when I use the gemma7b model and the chat forma…

XDesktopSoft updated 2 days ago
11

上一页 1...14 15 16 17 18 19 20...89 下一页

882 results for gemma-2b-it

882 results
for gemma-2b-it