dropout Search Results - Githubissues

1000+ results
for dropout

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #9322

[Misc]: remove dropout related stuff from triton flash atten…

### Anything you want to discuss about vllm. within vllm/attention/ops/triton_flash_attention.py, we don't need dropout, philox_, etc. stuff. should consider to clean them up for code simplicity. #…

HaiShaw updated 1 month ago
1
vshallc/PtrNets #2

Dropout

Hi, Great code and thanks for sharing :). How do you run the model with dropout? It doesn't seem to actually be implemented in the training process. -Peter

ppotash updated 8 years ago
9
zackchase/mxnet-the-straight-dope #515

Dropout

Hi, In the "dropout from scratch" chapter, there is no significative difference between adding or not adding dropout. See metrics below: with drop_prob=0.5: Epoch 0. Loss: 0.7281168998689048, T…

patriciaaa82 updated 6 years ago
1
BlinkDL/RWKV-LM #266

RWKV-v4 training doesn't stop after max_epochs defined

I tried training from scratch as explained in readme. ``` Training / Fine-tuning pip install deepspeed==0.7.0 // pip install pytorch-lightning==1.9.5 // torch 1.13.1+cu117 NOTE: add weight de…

shamilajeewantha updated 4 weeks ago
2
karpathy/nanoGPT #567

How best to implement a differential transformer?

I'm not sure issues is the greatest place to post this but I just wanted to see if anyone else had been trying this idea: There was [a paper that came out recently](https://arxiv.org/abs/2410.05258…

Wilsontomass updated 2 weeks ago
2
VikParuchuri/marker #323

gpu setting

I've revised the setting.py as below, but when running marker or marker_single, still it works under CPU mode. ----reivsion --- line10: class Settings(BaseSettings): # General TORCH_DEV…

samqin123 updated 3 weeks ago
1
ClimbsRocks/auto_ml #410

Dropout

You define a dropout_rate hyperparameter but don't seem to use it anywhere. What's that about?

BernierCR updated 5 years ago
1
StefOe/indrnn-pytorch #5

Dropout

The paper mentions > "Dropout [9] was applied after each IndRNN layer with a dropping probability of 0.25 and 0.1 for CS and CV settings, respectively." and > "Dropout [9] with a droppin…

sbrugman updated 5 years ago
1
DigitalShoestringSolutions/PowerMonitoring #7

Data dropouts adversely affecting analysis SM

It's accepted that sometimes out solutions will lose data connection for a few seconds. In most Grafana dashboards, we add either ` |> aggregateWindow(every: limited_window, fn: mean, createEmpty:…

tobyaharris updated 1 month ago
4
karnakgp/Karna #1

Dropout

[Paper](https://www.cs.toronto.edu/~hinton/absps/JMLRdropout.pdf) [PyTorch implementation](https://github.com/pytorch/pytorch/blob/master/torch/nn/_functions/dropout.py) Assigned to @americast a…

nishnik updated 6 years ago
1

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for dropout

1000+ results
for dropout