retnet Search Results - Githubissues

160 results
for retnet

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

bkarab03/PersonalNanoGPTProject #1

Changelog of official implementation

Thanks for the well-written package! The RetNet's official implementation had several updates at https://github.com/microsoft/unilm/blob/master/retnet/README.md#changelog .

donglixp updated 1 year ago
1
AkihikoWatanabe/paper_notes #889

Retentive Network: A Successor to Transformer for Large Lang…

# URL - https://arxiv.org/abs/2307.08621 # Affiliations - Yutao Sun, N/A - Li Dong, N/A - Shaohan Huang, N/A - Shuming Ma, N/A - Yuqing Xia, N/A - Jilong Xue, N/A - Jianyong Wang, N/A …

AkihikoWatanabe updated 5 months ago
1
facebookresearch/Detectron #703

How can I use softmax loss to train retinanet?

I'd like to use SoftmaxWithLoss instead of SoftmaxFocalLoss. `gated_prob, cls_focal_loss = model.net.SoftmaxWithLoss( [cls_lvl_logits, 'retnet_cls_labels_' + suffix], ['retnet_prob_{}'.fo…

mytxgmy updated 5 years ago
1
Jamie-Stirling/RetNet #18

Changelog of official implementation

Thanks for the well-written package! The RetNet's official implementation had several updates at https://github.com/microsoft/unilm/blob/master/retnet/README.md#changelog .

donglixp updated 1 year ago
2
myscience/retnet-pytorch #2

Changelog of official implementation

Thanks for the well-written package! The RetNet's official implementation had several updates at https://github.com/microsoft/unilm/blob/master/retnet/README.md#changelog .

donglixp updated 1 year ago
1
akshayatam/machine-translation-with-retnet #2

encoder_output isn't used in the RetNet decoder.

Hi, thanks for sharing your code! I'm also implementing an encoder-decoder model with a similar structure to yours. However, I'm confused about why the encoder_output isn't used in the RetNet deco…

Kimchangheon updated 8 months ago
1
mindspore-courses/step_into_llm #31

最近新出的RetNet好像特别火

请问能讲讲RetNet吗？似乎很有潜力的样子。

zcm200605 updated 1 year ago
1
veya2ztn/fast_retention #2

Question about larger D

Thanks for a great work! I am just wondering what would happen if we had higher D. This is because the RetNet configs that you can obtain from the torchscale (and also mine) have typically `D=128` …

syncdoth updated 11 months ago
3
ollama/ollama #3023

Mamba State Space Models Integration

There has been a completed merge of mamba model support over at Ilama.ccp, would it be possible to implement these into Ollama as well? Merged PR: https://github.com/ggerganov/llama.cpp/pull/5328 …

MarcellM01 updated 1 week ago
5
akshayatam/machine-translation-with-retnet #1

Quality discrepancy between different platforms: Ubuntu with…

Great work, thank you! I am encountering the following issue: When I follow your retnet_machine_translation.ipynb to train retnet on Ubuntu with CUDA, I achieve the same quality as you reported. Howev…

muranski updated 8 months ago
2

上一页 1...1 2 3 4 5 6 7...16 下一页

160 results for retnet

160 results
for retnet