retnet Search Results - Githubissues

160 results
for retnet

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/torchscale #52

Training & Inference examples for RetNet

Could you provide some Training & Inference examples for RetNet?

jhl-Det updated 1 year ago
1
Jamie-Stirling/RetNet #22

Q, k and D device difference

https://github.com/Jamie-Stirling/RetNet/blob/2acf026fc8435635051149d9bef793cae7f3d7af/src/retention.py#L45 Q and K are put onto any device because they are model parameters, while D is created in …

leffff updated 1 year ago
1
fkodom/yet-another-retnet #3

Changelog of official implementation

Thanks for the well-written package! The RetNet's official implementation had several updates at https://github.com/microsoft/unilm/blob/master/retnet/README.md#changelog .

donglixp updated 1 year ago
1
fkodom/yet-another-retnet #6

About activation function

https://github.com/fkodom/yet-another-retnet/blob/ee3979c7535b9f79a3020cb098d6b97f143bcd22/yet_another_retnet/retention.py#L16 I think this line should be F.silu rather than F.relu. Thanks for r…

Dongyeongkim updated 1 year ago
2
Jamie-Stirling/RetNet #10

Chunkwise retention giving different output

The implementation of chunkwise retention paradigm on the [chunkwise-real](/Jamie-Stirling/RetNet/tree/chunkwise-real) branch gives different outputs to the other two paradigms. It appears there ma…

Jamie-Stirling updated 1 year ago
4
microsoft/torchscale #58

Question about is_first_step and Retnet

In the code when `is_first_step` is `True` then activate_recurrent is set to `False` here: https://github.com/microsoft/torchscale/blob/main/torchscale/architecture/retnet.py#L362 I was wonderin…

tdomhan updated 1 year ago
2
microsoft/torchscale #42

the meaning of "incremental_state" in RetNet

Hi there~, Thanks for your great work RetNet. i have encountered a problem when I try to define "incremental_state". Could you provide me some usage about it or explain more? Thanks, Best regards.

jhl-Det updated 1 year ago
3
syncdoth/RetNet #11

ValueError: not enough values to unpack (expected 2, got 1)

Hey, Thank you for this great work! An error occurred when I used the model to generate text

pathoncyp updated 1 year ago
3
fkodom/yet-another-retnet #1

Throughput measurements of parallel and recurrence methods

Hi @fkodom , Thank you so much for sharing this work with the research community. I have one question please, I measure the throughput in the inference and it seems that the parallel method has …

Amshaker updated 1 year ago
3
syncdoth/RetNet #12

How to load my own model

I trained a model using train.py and got the checkpoint folder, how do I load this model for inference?

zhihui-shao updated 1 year ago
1

上一页 1...10 11 12 13 14 15 16...16 下一页

160 results for retnet

160 results
for retnet