-
Hello!
Thanks so much for sharing the code!
I am new at inverse reinforcement learning. Now I am trying to apply the code to a customized environment without knowing anything about the reward fu…
ghost updated
8 months ago
-
Hey Team,
Thank you for this awesome work and for releasing the super clean training code! We are super interested in reproducing result on Mistral 7b model.
I have trained a Mistral model with …
-
Hello!
I'm very excited about using this library, however the README claims it is in a broken state, waiting for fixes in the CMA-ES repo.
I see the CMA-ES repo is more active, with many commits r…
-
-
### 一、BackGround 📚
飞桨新IR(PIR)功能建设已经基本完成,当前CI流水线上静态图依然是以飞桨旧IR运行,我们想将CI默认运行的IR切换至PIR,从而能顺利支持未来飞桨基于PIR下的代码提交与验证。但是当前依然有很多单测在PIR模式下的运行会存在问题,修复这些问题成为实现默认切换PIR的必要条件。
### 二、Task Introduction📚
对于存在问题的单测,我们已统一…
-
## 背景
众所周知,Paddle 是一个历史悠久的框架,使得 Paddle 能够久经考验,应对各种场景稳定运行。但历史的沉淀同样带来一个严重的问题,就是框架内 API 语义不清晰,多种 API 能够做同样或者类似的事情。得益于我们的公开 API 审查机制和 fluid 清理,公开 API 中类似问题较少,但框架内部仍存在大量历史遗留的内部 API 的使用,这些 API 的存在导致框架内部需…
-
## Bug description
Hi, I'm currently adapting the Inverse Reinforcement Learning algorithm to analyze the behavior of mice in our lab studies. For this, I have used the Maximum Causal Entropy (MCE) a…
-
Leave below as comments your memos that grapple with the topic of cyber inspired by the readings, movies & novels (at least one per quarter), your research, experiences, and imagination! Also add a th…
-
(I'll update this as I go along)
[Proof of completion](https://github.com/spamegg1/ScalaCapstone/blob/master/spec.png) (you have to scroll sideways a bit)
Originally written: Tuesday, June 9, 20…
-
### Issue type
Bug
### Have you reproduced the bug with TensorFlow Nightly?
Yes
### Source
binary
### TensorFlow version
2.14.0
### Custom code
Yes
### OS platform and distribution
Linux Ub…