word-level-attention Search Results

1000+ results
for word-level-attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/Megatron-LM #1134

[BUG] 'NoneType' object has no attribute 'shape' error raise…

Hi, It seems that the same code is **working fine with when the Megatron-LM that I git-cloned in April. With the latest Megatron-LM, I've got the following error raised with the pretrain_gpt.py code. …

hwang2006 updated 1 week ago
8
howardyclo/papernotes #17

Zero-shot Sequence Labeling: Transferring Knowledge from Sen…

### Metadata Authors: Marek Rei and Anders Søgaard Organization: University of Cambridge & University of Copenhagen Conference: NAACL 2018 Paper: https://arxiv.org/pdf/1805.02214.pdf Code: https:…

howardyclo updated 5 years ago
1
swcarpentry/modern-scientific-authoring #15

01-mess: Soften "decency"

> … [programmers might finally have the decency to pay attention to the document formats that the other 99% of the human race prefers](https://github.com/swcarpentry/modern-scientific-authoring/blob/4…

wking updated 8 years ago
1
cbaziotis/ntua-slp-semeval2018 #7

Pretraining Getting Stuck

I am running the pretraining code the way you suggested but it has been stuck at this point for 2 hours now. Is this supposed to take this long? ```console neilpaul77@NeilRig77:~/Downloads/ntua-slp-…

iNeil77 updated 4 years ago
4
alasdairtran/transform-and-tell #2

how LSTM+Glove+IA encode articles?

@alasdairtran Hi, I have read your newly published paper. I'm curious about how LSTM+Glove+IA encodes articles? Does it encode each article at the sentence level or the word level?

HAWLYQ updated 4 years ago
8
NeuSpeech/NeuSpeech1 #1

I Have a some questions

hi I am a researcher studying EEG-To-Text. I recently saw your Neuspeech paper. I was impressed by your paper, and it was a great help to my research direction. thanks. But I have some kinds of quest…

girlsending0 updated 4 days ago
24
UKPLab/sentence-transformers #2591

Last Token Embedding not matching

I am using `intfloat/e5-mistral-7b-instruct` model to get last hidden state for my input and compute cosine similarity. I am using a toy example provided at: https://huggingface.co/intfloat/e5-mist…

akjindal53244 updated 5 months ago
4
jiasenlu/HieCoAttenVQA #30

Fail to run train.lua

This is maybe a trivial question but I'm completely new to torch, I tried to search on Google but no luck. I'm working with a Ubuntu 14.04 machine, cuda 7.0 and cudnn R4 version. I prepared all traini…

shuait updated 7 years ago
4
IDEA-Research/GroundingDINO #54

Some sub-words are ignored when use long word

Thanks for the great code. I encountered an issue when using the GroundDINO (or maybe it is just expected?) If I use a long word, like 'pottedplant', it will be tokenized into several sub-words. wh…

weixuansun updated 1 year ago
3
koustuvsinha/dl_ling_papers #3

Discussion on "Attention is All You Need"

Let's write down some of our takeaways from "Attention is All You Need" and then one of us can collate them into a single document to put into this repo so that we can remind ourselves when we forget.…

zafarali updated 6 years ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for word-level-attention

1000+ results
for word-level-attention