dropout Search Results - Githubissues

1000+ results
for dropout

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

THUDM/ChatGLM-6B #1445

[BUG/Help] <title>ptuning 出现这种异常该如何处理

### Is there an existing issue for this? - [X] I have searched the existing issues ### Current Behavior ChatGLMTokenizer(name_or_path='THUDM/chatglm-6b', vocab_size=64794, model_max_length=1000…

brianzhangrong updated 2 days ago
3
wagiminator/ATtiny85-USB-C-Tester #1

Dropout voltage of 78l05

The dropout voltage of 78l05 is 1.7v, so I'm curious will the circuit work when input voltage is also 5v from usb? The input is 5v and 78l05's output is also 5v. From my understanding, the LDO won'…

jxltom updated 2 years ago
1
fastmachinelearning/hls4ml #1048

FPGA Output is Zero in CNN model with 8,512 parameters.

I have a CNN model. I used the hls4ml and all file and bitfile generated completely. Now I used the deployment code to implement on FPGA(ZCU104), the prediction output of FPGA is always Zero. **Tot…

zsrabbani updated 2 months ago
7
kohya-ss/sd-scripts #1485

Lora extract >512x512 fails RuntimeError: quantile() input t…

Select input model, base model (analog madness v7) in this case. It works at 512x512 on cpu/gpu, larger throws: INFO UNet2DConditionModel: 64, 8, 768, False, False …

SarahPeterson2854 updated 3 months ago
1
josephjaspers/blackcat_tensors #70

could support Dropout layer?

It's easy to overfit, so add some dropout layer could solved this problem?

xinsuinizhuan updated 2 years ago
4
pytorch/pytorch #95290

Continuous dropout layer

### 🚀 The feature, motivation and pitch Hello! I am working on information theory application for neural networks [(here)](https://openreview.net/forum?id=bQB6qozaBw). With my research I show tha…

link-er updated 1 year ago
3
togethercomputer/stripedhyena #20

flash attention not compatible?

When I try to train a stripedhyena model I keep getting issues with the stripedhyena modules seemingly trying to import modules from Flash Attention in an outdated way. example: AttributeError: mod…

oxPJ updated 3 months ago
1
QwenLM/Qwen2-VL #280

Qwen2-vl 8张 48G 的显卡，启动每有报现存不够，但是推理图片报cuda out of memory？显存远远…

modeling_qwen2_vl.py", line 350, in forward attn_output = F.scaled_dot_product_attention(q, k, v, attention_mask, dropout_p=0.0) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 122…

cqray1990 updated 2 weeks ago
5
unslothai/unsloth #1037

Fine tune and infer llama3 with cpu

import logging import os import json import torch from datasets import load_from_disk from transformers import TrainingArguments from trl import SFTTrainer from unsloth import FastLanguageModel…

SidneyLann updated 1 week ago
18
GANGREEK/TVA-GAN #1

Test results not as expected

Good afternoon everyone, I trained the TVA GAN model for 200 epochs using the same parameters and using 1072 images for train (trainA -> thermal and trainB ->visual) and 460 images for validation fr…

CeliaSgUVa updated 1 week ago
11

上一页 1...22 23 24 25 26 27 28...100 下一页

1000+ results for dropout

1000+ results
for dropout