-
Hi!
I've been working with your IH/IG implementation lately, and doing some experiments with it in an NLP context. What I have noticed is that when I increase the length of my input this is an adve…
-
Hi,
I am not sure if inference speed is mentioned anywhere. Couldn't find it in the paper or supplementary material. Could you share that?
Thanks!
-
Hello! I have questions about the LQR you used in the codes. And I really appreciated if you could answer them.
In the “Init_PreviewControl_Paramter.m” file, there are
```
%Error System
…
-
I used the same data to run the same function five times, and the five running times were: 1282.27764 ms, 0.35153 ms, 0.15597 ms, 0.1487 ms, 0.14346 ms. The difference between 0.14346 ms and 0.35153 m…
-
I looked at how exactly LSTMs are trained with PPO2 and found that a lot of unnecessary data transformations happen:
1. Trajectories are sampled by the [Runner](https://github.com/hill-a/stable-basel…
-
Hello!Can this code run? I have reported some errors running on pytorch and can not find a solution. Is there any special requirement other than the environment you specified?
-
Thanks to @polymorphicengine we have a tidal-listener building into a relocatable binary, great ! We should make use of this for the next tidal release.
We could distribute a ghci drop-in replaceme…
-
- [ ] [Transformer Models](https://docs.cohere.com/docs/transformer-models#the-softmax-layer)
# Transformer Models
**Description:**
- **Tokenization**
Tokenization is the most basic step. It consi…
-
This is a list of todos for BH.
Must to run anything:
**Phase1 for BH bring up - target 5/2**
**@abhullar-tt**
- [x] #8453 bump umd version up to latest
- [x] #8530 ARCH_NAME = blackhole
- [x] #856…
-
![QQ图片20240714011506](https://github.com/user-attachments/assets/19c84688-8322-4893-823b-d8daecfe847b)
(calm) (base) penghuan@ubuntu:~/code/SimSGT/regression$ sh script/pretrain_GEOM.sh
add args
…