-
Dear author,
I have read your paper on MuJoCo experiments and I am particularly interested in the hyperparameters used for PPO_GRU and A2C_GRU. I would greatly appreciate it if you could provide me…
-
Hello,
I have been reading through #273 and #187 but I couldn't understand how to resume from a checkpoint because my logs don't have a .ckpt file in them.
![image](https://github.com/Eclectic-Sh…
-
### Issue type
Bug
### Have you reproduced the bug with TensorFlow Nightly?
Yes
### Source
binary
### TensorFlow version
2.14.0
### Custom code
Yes
### OS platform and distribution
Linux Ub…
-
https://github.com/stan-dev/rstan/pull/887 is huge and I think the PR thread will eventually become impossible to follow. So I am opening this issue where you can post anything that is missing in the …
-
### System Info
```Shell
accelerate: 0.28
deepspeed: 0.14
torch: 2.2.1
FSDP configs
fsdp_config:
fsdp_auto_wrap_policy: TRANSFORMER_BASED_WRAP
fsdp_backward_prefetch_policy: BACK…
-
VSCode added a build step that mangles all typescript private fields so they can't be reasonably accessed by Customize UI. This greatly reduces hooks that customize-ui can intercept and methods it can…
-
# 问题
转换完权重之后进行评估验证时出现下述问题
```shell
> number of parameters on (tensor, pipeline) model parallel rank (0, 0): 630167424
loading release checkpoint from /raid/LLM_train/Pai-Megatron-Patch/checkpoint…
-
## Keyword: sgd
There is no result
## Keyword: optimization
### Joint Information and Mechanism Design for Queues with Heterogeneous Users
- **Authors:** Authors: Nasimeh Heydaribeni, Achilleas Ana…
-
## Keyword: metric learning
### ShufaNet: Classification method for calligraphers who have reached the professional level
- **Authors:** Ge Yunfei, Diao Changyu, Li Min, Yu Ruohan, Qiu Linshan, Xu…
-
I training on custom data set based on Window 10.
I don't know what happened in the training.
Anyone can tell me about this?
-----------------------------------------------------
2021-03-30 22:…