attention-mechanisms Search Results

1000+ results
for attention-mechanisms

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mit-han-lab/streaming-llm #54

question about initial tokens

Thanks for your awesome work. I have some questions about the concept of initial tokens and the implementation of learnable initial tokens. 1. In Fig.2, you reach the conclusion that existing LLMs …

chaojiewang94 updated 10 months ago
2
lazyprogrammer/machine_learning_examples #80

Why do we need a manual for loop when implementing Attention…

Please bear with me here. This might be confusing to understand for some because I'm adding the pseudocode to support what's unclear to me. I've been following a tutorial and it was mentioned that …

Nishant-Pall updated 2 years ago
1
k2-fsa/icefall #930

Comparing oracle WER of CTC vs. transducer models for gigasp…

Hello, we are doing experiments on gigaspeech with the pretrained models. One of the experiments is to inspect and compare the oracle WER of CTC vs. transducers. Here is what we get: CTC (decoding …

huangruizhe updated 1 year ago
2
IBM/pytorch-seq2seq #165

Attention type

Can somebody tell me what is the type of attention used in this lib? Because I checked against Bahdanau and Luong attentions and it doesn't look like neither or maybe I'm missing something !

ratis86 updated 4 years ago
8
grafana/grafana-plugin-repository #332

Please improve response times

As developers we depend on the Grafana plugin repository for discoverability of our plugins. Other projects like e.g. node-red have found less restrictive plugin mechanisms by utilising e.g. `npm` and…

andig updated 3 years ago
14
hyunwoongko/transformer #19

About multi-head attention in attention is all you need, tha…

Hello, author. I am sincerely that you can answer me when you saw. I urgently want to realize why there are Q, K, V as input in multi-head attention and then feed them into the three linear of each h…

sonrisa07 updated 10 months ago
2
jiphyeonjeon/season3 #35

Attention Is All You Need

## 집현전 중급반 스터디 - 2022년 6월 19일 일요일 9시 - 이영빈님 송이현님 원재성님 박세훈님 발표 - 논문 링크: https://arxiv.org/abs/1706.03762 > ### Abstract > The dominant sequence transduction models are based on complex recurrent…

jinmang2 updated 1 year ago
3
rstudio/rmarkdown #139

External resources are not generally HTML-addressable in Shi…

In Shiny documents, it's possible to refer to external images using traditional Markdown syntax, but most other references to external resources don't work: for example ``, ``, and `` don't work, as d…

jmcphers updated 2 years ago
1
meta-introspector/meta-meme #127

Tree search

input : lets imagine a self hosted,modifying, improving, expanding system based on gnu/linux and erlang and elixir and using langchainex to improve itself and using coq to prove the improvements are c…

jmikedupont2 updated 6 months ago
1
Ultracrepidarianism/Kingdom #2

Wither explosions disabled globally

[ClaimListener](https://github.com/Ultracrepidarianism/Kingdom/blob/65e5a768b8bde10a60d0f2c7ecd583f271984cd0/src/main/java/ca/ultracrepidarianism/kingdom/listener/ClaimListener.java#L73C34-L73C34) …

mataai updated 9 months ago
1

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for attention-mechanisms

1000+ results
for attention-mechanisms