gemma Search Results - Githubissues

1000+ results
for gemma

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

iusztinpaul/hands-on-llms #86

Nothing works with Beam V2

Hello, Beam later version is V2 and they did drastic changes to their SDK and client that makes most of the training (fine-tuning) and inference code useless. There is no "beam run" and so on... …

rmadabusiml updated 2 months ago
1
pytorch/xla #6610

Gemma finetuning on Kaggle TPU doesn't work

## 🐛 Bug Not sure if this is a feature request or bug. I took the [SPMD Gemma ft code from Hugging Face](https://huggingface.co/google/gemma-7b/blob/main/examples/example_fsdp.py) and tried to run …

windmaple updated 8 months ago
28
google-ai-edge/mediapipe #5277

How to set a system prompt for RAG implementation for Infere…

### Have I written custom code (as opposed to using a stock example script provided in MediaPipe) None ### OS Platform and Distribution IOS ### MediaPipe Tasks SDK version _No response_ ### Task…

omkar806 updated 3 weeks ago
5
OpenNMT/CTranslate2 #1769

Inference failed with "axis 2 has dimension xxxx but expect…

I tried to use ctranslate2 as the inference framework to do model inference, but failed with error as below: "axis 2 has dimension 8192 but expected 7680" What I've done: 1. First I must con…

GangLiCN updated 3 months ago
2
abetlen/llama-cpp-python #1598

Gemma 2 : flash_attn is not compatible with attn_soft_cap - …

# Prerequisites Please answer the following questions for yourself before submitting an issue. - [ Yes] I am running the latest code. Development is very rapid so there are no tagged versions as…

iamsaurabhgupt updated 4 months ago
2
gradio-app/gradio #8771

ChatInterface aditional params when open>close height of the…

### Describe the bug 1) your ui initial start is like this (perfect): 2) whenever you open additional inputs and then closed then height end like this: in terms of UX is no functional. …

pabl-o-ce updated 2 weeks ago
3
RLHFlow/RLHF-Reward-Modeling #26

Regarding the Gemma2 Reward Model Structure

I tried to reproduce your gemma2B reward model training again and found that the reward model architecture fine tuned with internlm2 had an output header of 1. I downloaded your GRM-Gemma-2B-Sftrug re…

Loong435 updated 4 months ago
2
NLie2/what_features_jailbreak_LLMs #1

Attack implementation

Hello Authors, Thank you for your incredible work and the comprehensive experiments presented in the paper. I have a question regarding the implementation of attacks. Specifically, some attacks,…

krishnakanthnakkav2 updated 5 days ago
1
Dao-AILab/flash-attention #922

Feature Request: Fused Linear and Cross-Entropy Loss

Since the latest models, such as Llama 3 and Gemma, adopt extremely large vocabularies (128-256K), the size of logits can become very large, consuming a large proportion of VRAM. For example, the foll…

imoneoi updated 10 hours ago
2
igor-sosnowicz/knowledge_verificator #29

Find better default learning materials

Compile the database of learning materials that contains longer and more reasonable materials.

igor-sosnowicz updated 2 weeks ago
4

上一页 1...25 26 27 28 29 30 31...100 下一页

1000+ results for gemma

1000+ results
for gemma