-
Hi Users,
We have been informed that GitHub has recently deleted several comments and posts due to violations of their [Hate Speech and Discrimination Policy](https://docs.github.com/en/site-policy…
-
### Required prerequisites
- [X] I have read the documentation .
- [X] I have searched the [Issue Tracker](https://github.com/PKU-Alignment/omnisafe/issues) and [Discussions](https://github.com/PKU-A…
-
Dear Marabou Developers,
I am currently using the Marabou library for a project and I have encountered an issue with the `network.solve()` method. When I call this method, it raises a `ValueError` …
-
-
本 issue 专门汇总本教程各个单元的中文版本的术语与相关注意词汇。
请各位译者自行汇总重要概念与相关词汇在本 issue 下面
# 注意
> 汇总不单单包括英文原本术语,还有中文部分不好翻译的相关术语,相关内容补充资料等
# 格式:
## 第一单元:XXXX @translators
### [术语]:
- 马尔可夫性质:
- 这意味着我们的智能体采取的行动*…
-
-
I use model large-v3
When After running for a period of time, repeatedly output the same sentence like this:
00:00:00->00:00:29:请不吝点赞 订阅 转发 打赏支持明镜与点点栏目
00:00:29->00:00:59:请不吝点赞 订阅 转发 打赏支持明镜与点点栏目
0…
-
this is the terminial log:
![image](https://github.com/yamauz/live-gpt/assets/134033385/e8fcd7bd-1690-4ebb-b0a9-1932329e322e)
Error transcribing audio (attempt 2): Error: HTTP error! sta…
-
P3O (Pairwise Policy Optimization) is a recent paper from Berkeley:
It introduces a new way to align LLMs to human preferences. The loss function is particularly cool as it directly operates on co…
-
Hello,
I've been exploring the implementation of Proximal Policy Optimization (PPO) in the [ppo_trainer.py](https://github.com/huggingface/trl/blob/main/trl/trainer/ppo_trainer.py) file, and I have…