-
Submitting Author: Jinning Wang (@jinningwang)
All current maintainers: Hantao Cui (@cuihantao), Jinning Wang (@jinningwang )
Package Name: ANDES
One-Line Description of Package: Power system trans…
-
Perhaps something like `PASS` to basically return whatever was input and `REPL` for removing punctuation.
Another option would be something like `CB` for check Unicode Block.
-
**Describe the bug**
when i run train,rlhf step 3;
```
Actor_Lr=9.65e-6
Critic_Lr=5e-6
#--data_path Dahoas/rm-static \
#--offload_reference_model \
deepspeed --master_port 12346 main_step3.py…
-
The aim here is to show:
* how to build training sets and run the models
* what models and methods are implemented
* Mention ascertainment bias https://github.com/gaynorr/Alph…
-
- [ ] Introduction
- [x] Astrophysical Motivation
- [x] Survey of the state of the art of High-contrast imaging design
- [x] Ray Tracers: What they are used for and how they work
…
-
用我们自己的SR Dataset 开始测试了, 58W张 720x720 的高清图, 数据分布非常好 :) 相信我 :)~
已经跑起来,开始train 了, 不过 train 起来是真的慢啊, MSE model 需要 27 天 :( 然后 GAN 估计还需要27 天
27 天啊, A100 x4 .
不过为了保证质量, options 文件做了点修改:
gt_size: 3…
-
### 🐛 Describe the bug
I am running example codes show in https://github.com/hpcaitech/ColossalAI/tree/main/examples/language/gpt/experiments/auto_parallel with Pytorch 2.0 (because I need to deploy…
wxthu updated
3 months ago
-
**Describe the bug**
I Attempt to train reward models of different size(3B/6B/30B), and found out that when PP > 1, two type of issues arise
3B/6B:
* TP=4, PP=1: ok
* TP=4, PP=2: the job hang…
zirui updated
1 month ago
-
**Describe the bug**
when i set "hybrid_engine" for making step3 training faster, the training progress is not stabilization, and often errors occur after just running or running a few steps
in st…
-
**Main Problem:**
This paper proposed time-sensitive and personalized query auto completion (QAC), named hybrid QAC. They handle the long-tail prefixes. Given a threshold N, we define a prefix p to b…