proximal-policy-optimization Search Results

161 results
for proximal-policy-optimization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

calcom/cal.com #11900

Conditional Questions on Event Types

### Is your proposal related to a problem? We would like to have the ability to ask "follow-up questions" based on the answers to previous questions in the booking request. For example, we could h…

moilejter updated 2 weeks ago
15
BA-GROUP-ASSIGNMENT/Solution #4

Machine Learning Approach. Machine Learning Objective. Machi…

I-JOSIANE-JOHNGWA updated 10 months ago
1
deepbaksu/dfab-bot #1

명령어 수행시 다른 이니셜의 데이터도 수집

# 문제: 명령어 수행시 다른 이니셜의 데이터도 함께 수집하는 현상 ## 재현 방법 1. 슬랙 DFAB의 챗봇의 DM으로 이동한다 2. 아래의 명령어를 입력한다. 본 현상은 이니셜 `jun`을 사용할 때 나타나는 현상 ```slack @DFAB get jun 1 a ``` ## 결과 및 문제점 위의 명령을 수행할 경우 **현…

TangoEnSkai updated 5 years ago
4
stulp/dmpbbo #58

DMP Implementation for PDFF Exploration

These questions are specifically about utilizing discrete DMPs (implemented in this repository) for the reaching task, especially in the context of this paper (https://onlinelibrary.wiley.com/doi/abs/…

pranshumalik14 updated 2 years ago
12
git-utilities/fix_logs #1

What repositories need added to default config.json?

To help @llSourcell fix Git logs, the current [`config.json`](https://github.com/git-utilities/fix_logs/blob/master/config.json) file contains entries similar to... > Update notice, `57` reposito…

S0AndS0 updated 4 years ago
3
huggingface/trl #1595

[Question] Does TRL support DPO trainer with simulated envir…

Hi Hugging Face team, I'm exploring the possibility of using the TRL library for training a reinforcement learning model with a simulated environment. Specifically, I'm interested in using the DPO …

aplmikex updated 1 month ago
1
pytorch/pytorch #104857

Torch's `LayerNorm` and Adam optimizer vs those in tensorflo…

### 🐛 Describe the bug Hello pytorch team, I am recently converting [OpenAI's tensorflow RLHF code](https://github.com/openai/lm-human-preferences) to pytorch. Given the same data and model, I was …

vwxyzjn updated 10 months ago
3
keras-team/keras #18468

Keras.io examples conversion gameplan

We need to convert keras.io examples to work with Keras 3. This involves two stages: ## Stage 1: tf.keras backwards compatibility check Keras 3 is intended as a drop-in replacement for tf.ker…

fchollet updated 4 months ago
21
Deadsg/BatsyDefenseAi #9

Gpt thread based on prompt (so far...)

Thank you for providing all the parts. It seems like we've delved into some interesting discussions about mathematical concepts and their connections to abstract ideas. If you have any further questio…

Deadsg updated 11 months ago
2
X-lab2017/open-research #210

基于开源协作行为数据与预训练模型微调的社区问答方法研究 @TieWay59

开题报告初稿：https://www.yuque.com/tieway59/qvc10e/wtwf9o022z791tpf/edit

will-ww updated 6 months ago
7

上一页 1...2 3 4 5 6 7 8...17 下一页

161 results for proximal-policy-optimization

161 results
for proximal-policy-optimization