-
I setup a critic server.
Auth by critic db and use cookie to keep session.
After then, I add critic as remote repo and fetch it.
It report an error with message "is this a git repository?"
I run tcpd…
-
Is there a Solid response to https://twitter.com/bling0/status/977750840659210242?s=19
> 2007: “Facebook is evil and a walled garden. It must share user info and the graph with other developers!” …
-
I have a program that runs for a very short time (
-
[x] I have checked the [documentation](https://docs.ragas.io/) and related resources and couldn't resolve my bug.
I want to create synthetic test data. Using the OpenAI or Anthropic API is very exp…
-
Hello!
Sometimes I get error messages from critic to my email when it's updating a review from a tracked branch in a remote repository after someone pushed into that branch:
```
2018-02-16 12:06…
-
get subj error in line `policy_estimator = PolicyEstimator(learning_rate=0.001)`
**Continuous MountainCar Actor Critic Solution.ipynb**
python-3.5.2
tensorflow '1.0.0'
```
policy_estimator …
-
Error info:
File "/opt/conda/lib/python3.8/site-packages/deepspeed/runtime/hybrid_engine.py", line 99, in new_inference_container
File "/opt/conda/lib/python3.8/site-packages/deepspeed/module_…
-
-
Baseline PPO agent:
- Critic represents total reward
- Actor is trained to maximize critic
CBF PPO agent:
- Base critic represents nominal reward
- CBF critic represents safety reward
- Actor…
-
Hello!
I noticed that the maximum eposides can be controlled by MAX_EPISODES during training, and EVAL_INTERVAL determines the evaluation intervals; however, the evaluation process seems to determi…