gumbel-softmax Search Results

403 results
for gumbel-softmax

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

AIDC-AI/Ovis #5

About the equivalence and a slightly more complex MLP connec…

Thank you for sharing the dataset and open-source model. Ovis employed VE + Head + Tokenize (essentially a softmax) and subsequently obtained the same hidden dimension features for the LLM. I remain …

lucasjinreal updated 1 month ago
11
microsoft/nni #3814

[Retiarii] migrate FBNet from NAS 1.0 to Retiarii framework

**What would you like to be added**: migrate FBNet from NAS 1.0 to Retiarii framework **Why is this needed**: NAS 1.0 will be deprecated **Without this feature, how does current nni work**…

QuanluZhang updated 3 years ago
21
arcee-ai/mergekit #294

Idea: Scaling the Down-Projection Matrix in 'Mixture of Expe…

## Problem In a Mixture of Experts (MoE) LLM, the gating network outputs a categorical distribution of $n$ values (chosen from $n_{max}$), which is then used to create a convex combination of the $n$…

jukofyork updated 4 months ago
7
Theano/Theano #5685

Problem with grad_overrides in OpFromGraph

There seems to be a problem with OpFromGraph when the user's gradient function uses a variable which doesn't belong to the main graph. Here's a minimal sample to recreate the bug: ``` import thean…

mtomassoli updated 6 years ago
12
Future-Power-Networks/MAPDN #34

Multiple actions per agent in the distributed mode

In the `distributed` mode each agent is responsible to control one generator. In your case one agent has only one action. If I want to have multiple actions per agent what changes should I make?

kosmylo updated 1 day ago
3
howardyclo/papernotes #21

On Accurate Evaluation of GANs for Language Generation

### Metadata Authors: Stanislau Semeniuta, Aliaksei Severyn, Sylvain Gelly Organization: Google AI Conference: NIPS 2018 Paper: https://arxiv.org/pdf/1806.04936.pdf

howardyclo updated 5 years ago
3
fra31/auto-attack #58

Add [Stochastic LWTA]

**Paper**: Local Competition and Stochasticity for Adversarial Robustness in Deep Learning (http://proceedings.mlr.press/v130/panousis21a) **Venue**: International Conference on Artificial Intellig…

konpanousis updated 3 years ago
11
pytorch/pytorch #127749

Gumbel Vector Quantizer produces NaN when using with torch.c…

### 🐛 Describe the bug ## Minimum reproduction ```python import torch.nn.functional as F import torch from torch import nn class GumbelVectorQuantizer(nn.Module): def __init__(self): …

gau-nernst updated 2 months ago
15
EricGuo5513/momask-codes #41

Gumbel Softmax in Quantizer

Thank you for your amazing work. I just want to make sure I understand the code correctly. The Gumbel Sampling is not necessary here, the Argmin version (line 76) will be exactly the same result, co…

exitudio updated 4 months ago
2
lucidrains/DALLE-pytorch #10

Results

I have trained DiscreteVEE on 128x128 [FFHQ dataset](https://www.kaggle.com/greatgamedota/ffhq-face-data-set). using this configration: ``` vae = DiscreteVAE( num_layers = 2, num_tokens = …

NaxAlpha updated 2 years ago
57

上一页 1...8 9 10 11 12 13 14...41 下一页

403 results for gumbel-softmax

403 results
for gumbel-softmax