attention-lstm Search Results

1000+ results
for attention-lstm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

nnop/notes #14

deep learning papers

# speech recognition - Soltau, Hagen, Hank Liao, and Hasim Sak. "Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition." arXiv preprint arXiv:1610.09975 (201…

nnop updated 6 years ago
8
JyotsnaT/ML-interviews #9

Deep Learning Mastry

Good understanding of deep learning architectures like Multi-Layer Perceptron, Recurrent Neural Networks (RNNs), Long Short Term Memory models (LSTMs), Gated Recurrent Units (GRUs), and Convolutional …

JyotsnaT updated 8 months ago
2
ddkang/loss_dropper #4

Is there any way to apply this work with pretrained model( …

I'm really interested in your great work. Just curious, If it is possible that combine BART with loss truncation? Cuz the vanilla LSTM with attention is kind of out-of-date.

ElderWanng updated 3 years ago
1
tensorflow/nmt #412

InvalidArgumentError (see above for traceback): Found Inf or…

Hi, I recently trained an LSTM-with-attention based model using the following hparams: python3.6 -m nmt.nmt \ --attention=luong \ --src=r --tgt=p \ --vocab_prefix=/home/hisham/nmt…

hichiaty updated 5 years ago
3
shap/shap #610

set an input feature equal output, why this feature is not t…

@slundberg I have a question. To test Attention Mechanism, I fix the tenth column of input X equal y. This is my code: ``` def get_lstm_data(n, time_steps, input_dim, attention_col=10): x = np…

searchlink updated 1 year ago
1
datalogue/keras-attention #38

Vanishing Gradient Problem Occurred During Training

Hi, I am new to the attention mechanism and I found your codes, tutorials very helpful to beginners like me! Currently, I am trying to use your attention decoder to do the sentiment analysis of the…

bright1993ff66 updated 5 years ago
4
keshav22bansal/BAKSA_IITK #3

Finetuning XLM Roberta causes output saturation

Hi, It seems from the source code that XLM Roberta is finetuned with the gradient updates based on the LSTM attention model. However, when I follow the README instructions and train the model on hi…

ilkyyldz95 updated 3 years ago
4
byu-dml/d3m-dynamic-neural-architecture #201

Saving Off Model Embeddings

This is the feature request for saving off the embeddings of the metamodels. Here is a list of all the deep learning metamodels: - dna_regression - lstm - daglstm_regression - hidden_daglstm_reg…

epeters3 updated 4 years ago
5
sktime/pytorch-forecasting #1275

Optuna hyper parameters doesn't work

Hi, The optimization didn't work. So I just added the line : "metrics_callback.on_validation_end( trainer)" after line 209 . I also modified the class : class MetricsCallback(Callback…

Serge9744 updated 1 year ago
3
ExplorerFreda/Structured-Self-Attentive-Sentence-Embedding #11

Why is attention applied on the outputs instead of hidden st…

As mentioned in the paper, the attention is to be applied on the hidden states of the LSTM, but in the code, it is done on the outputs instead of hidden states. Why is it like that ?

prerit2010 updated 2 years ago
1

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for attention-lstm

1000+ results
for attention-lstm