retnet Search Results - Githubissues

165 results
for retnet

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/torchscale #79

about gamma/decay in RetNet

hello, Could someone enlighten me with the rational behind this line of code, i.e. why "_1 - 2 ** (-5 -_" etc.? Thank you, https://github.com/microsoft/torchscale/blob/881d03079da7b0c52ba0a473…

rouniuyizu updated 11 months ago
2
microsoft/torchscale #77

Chunk recurrent representation incorrect results

I believe there should be some type of normalization mistake in chunk recurrent retention. Output of it does not match the ouput of a simple recurrent and parallel retention. Recurrent retention also …

N0r9st updated 11 months ago
7
syncdoth/RetNet #10

Changelog of official implementation

Thanks for the well-written package! The RetNet's official implementation had several updates at https://github.com/microsoft/unilm/blob/master/retnet/README.md#changelog .

donglixp updated 11 months ago
5
fkodom/yet-another-retnet #17

Running benchmark_inference on the CPU

I am running scripts/benchmark_inference on the CPU (on a Mac M2 with Ventura OS). There are several issues with the code: Could you please run the code on the CPU with a version of Torch which does…

erlebach updated 1 year ago
1
microsoft/torchscale #80

Question about RetNetRelPos

In the retnet code, https://github.com/microsoft/torchscale/blob/main/torchscale/architecture/retnet.py#L25 this creates `inv_freq` (`angle` in this code) using `torch.linspace(0, 1, dim/2)`. but…

hyunwoongko updated 11 months ago
2
syncdoth/RetNet #31

Initialize word embedding layer

I was training RetNet model using your codebase. But I found there's no initialization of word embedding layers. So the loss scale was very poor. (7B model's initial loss was 3000+) I think we need…

hyunwoongko updated 11 months ago
7
fkodom/yet-another-retnet #13

Invalid precision when running train_project_gutenberg

I am running on the mac, and like the clarity of your code. My device is 'cpu', since I don't have Cuda on a mac notebook. Running `retnet.py` works fine. However, when running `train_project_gutenbur…

erlebach updated 1 year ago
2
fkodom/yet-another-retnet #9

Have you ever tried Retnet for vision tasks?

Hi, Thank you for your great work. The Retnet version you provided is the easiest to understand and clear to understand version I have ever seen. Have you ever tried using retnet module for visual t…

cnyvfang updated 1 year ago
4
Hannibal046/nanoRWKV #8

请问后续还会支持retnet吗

https://www.zhihu.com/question/612761391/answer/3128755930 看到知乎上的回答，后续还会支持retnet么。

BrightXiaoHan updated 1 year ago
1
berlino/seq_icl #10

How to reproduce Figure 1?

Hi authors, Thanks for the great work. I am very interested in your work and would like to give the RegBench and those sequence models a try. I wonder if you could further elaborate on how to repro…

Cranial-XIX updated 9 months ago
2

上一页 1...7 8 9 10 11 12 13...17 下一页

165 results for retnet

165 results
for retnet