-
### What feature or new tool do you think should be added to DevToys?
Tools like Copilot, ChatGPT or BingChat are truly helpful from a developer perspective.
### Why do you think this is needed?
Ha…
veler updated
6 months ago
-
Hi, I see that the RecurrentCache was renamed to Cache for gla model. However, it raised error as Cache does not have method “from_legacy_cache”.
OREYR updated
4 months ago
-
Thanks!
-
### 🐛 Describe the bug
I have a Mac M1 GPU and I've been trying to replicate the results in [this google colab notebook](https://colab.research.google.com/drive/1_X7O2BkFLvqyCdZzDZvV2MB0aAvYALLC) on …
-
/runs simply holds files for tensorboard data for given models. Moving /runs into each model (and probably naming it something like /tensorboard) would be superior.
-
Hello,
And thanks for torch quantum !...
I applied the new requirements.txt provided one hour ago... and I've a new problem during torchquantum importation :
File ~/miniconda3/envs/retnet_exper…
-
I want to try this NGH, but I have already trained a lot on normal RetNet.
Does NGH work with learned model?
-
Congratulations for such a nice work ! are there any benchmarks for direction comparison to RetNet in terms of throughput (or latency) ? Figure 4 in the paper is the only comparison to RetNet, among o…
-
Hello👋 ,
This is a draft outline for the Trends and New Architectures chapter. I think it'd be better to call it alternative rather than new. Below I'll give a brief overview of the chapter content…
-
The referenced line of code below is giving the following error if the data is already downloaded. Should be a simple fix.
![image](https://github.com/DRAGNLabs/301r_retnet/assets/10404106/5d29e22f…