layer-normalization Search Results

1000+ results
for layer-normalization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/TransformerEngine #953

NaN loss issues when I switch to the Transformer Engine Tran…

**Summary** I'm hitting a NaN loss issue when I use the TransformerLayer in place of a pytorch transformer layer I wrote. **Details** I'm using the nvcr.io/nvidia/pytorch:24.04-py3 docker cont…

jasonkrone updated 2 months ago
1
microsoft/vscode-jupyter #15934

Breakpoint in Notebook Cell doesn't stop when VSCode is runn…

### Applies To - [X] Notebooks (.ipynb files) - [ ] Interactive Window and\/or Cell Scripts (.py files with \#%% markers) ### What happened? I'm facing is the same issue reported in https://github.…

jersonchua updated 1 month ago
3
HongtaoYang/DAC-tensorflow #2

About the parameters sensitivity

@HongtaoYang , I am very grateful for your source code! However, I have found that your implementation is very sensitive to the parameters of the network, such as : - In the batch_normalization lay…

flexibility2 updated 5 years ago
1
Lasagne/Lasagne #577

Batch Normalization for RNN

The paper: [Deep Speech 2](http://arxiv.org/abs/1512.02595) used _sequence-wise_ normalization for recurrent computation which was proved to substantially improves final generalization error while gre…

trungnt13 updated 7 years ago
27
jeongjae96/AI #1

CS231n Self-study

### What? - [CS231n: Deep Learning for Computer Vision](http://cs231n.stanford.edu/) assignments self-study

jeongjae96 updated 1 year ago
2
wanggrun/Adaptively-Connected-Neural-Networks #1

Question about paper and implementation

Hi! Thanks for sharing your code. After I read your paper, I found this idea is very interesting and it is a little like SN(switchable normalization). I have a little question about your paper and im…

lxtGH updated 4 years ago
6
onnx/keras-onnx #279

AttributeError: The layer has never been called and thus has…

``` from tensorflow.keras import backend as K from tensorflow.keras.models import model_from_json import tensorflow as tf import keras2onnx sess = tf.Session() K.set_session(sess) K.set_learn…

sonfire186 updated 4 years ago
5
NVIDIA/TensorRT-LLM #1982

gptSessionBenchmark failed due to invalid OptProfilerSelecto…

### System Info GPU: H20 server CUDA Version: 12.5 Driver: 555.42.02 TRTLLM Commit: 2d234357c6e69fa514f6e9b4d4a5ad3bc431c4a6 built from source on linux ### Who can help? _No response_ …

ZJLi2013 updated 1 week ago
4
WongKinYiu/yolov7 #975

What's the recommended way of local custom model inference -…

I plan to use a custom trained model in a local environment without network access. What's the best way to inference saved model -via `model = torch.hub.load(...)` or `model = attempt_load('…

OleksiiYeromenko updated 1 year ago
1
feihuzhang/DSMNet #3

aobut Figure3 in the paper

Hello, I read your paper, and get a little a little confused about Figure 3 in it. What does the y axis (1,2,3,4,5) of Figure 3 refer to? After Domain transform,the norm of the C-channels feature is n…

xubin1994 updated 4 years ago
1

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for layer-normalization

1000+ results
for layer-normalization