-
Hi, friend, maybe you can try to use the llama architecture instead of the original Transformer?(You can refer to llama architecture in llama2.c)
-
Error info:
File "/opt/conda/lib/python3.8/site-packages/deepspeed/runtime/hybrid_engine.py", line 99, in new_inference_container
File "/opt/conda/lib/python3.8/site-packages/deepspeed/module_…
-
I'm trying to use a DDPG agent with actor and critic networks, and a TFUniform replay buffer, training on my custom environment.
I've extracted a training experience from the buffer using:
```
da…
-
I have load pdf as document, now want to generate test data from it, error is occuring.
---------------------------------------------------------------------------
ExceptionInRunner …
-
Please add as much of the following info as you can:
Title: Kiss of Death
Type (film/tv show): Film - romantic thriller
Film or show in which it appears: The Critic (1994-2001)
Is the pare…
-
-
Anyone know what's wrong of using tf-agent here which trigger the ValueError?
ValueError: Inputs to TanhNormalProjectionNetwork must match the sample_spec.dtype.
In call to configurable 'SacAgent'…
-
@ZhengyiLuo Hello!
I have found the comparison with VQ-latent space in your [project page](https://www.zhengyiluo.com/PULSE-Site/#compare_vq),
and there are some codes related to VQ-latent space s…
-
[x] I have checked the [documentation](https://docs.ragas.io/) and related resources and couldn't resolve my bug.
I want to create synthetic test data. Using the OpenAI or Anthropic API is very exp…
-
Baseline PPO agent:
- Critic represents total reward
- Actor is trained to maximize critic
CBF PPO agent:
- Base critic represents nominal reward
- CBF critic represents safety reward
- Actor…