critic-info Search Results

1000+ results
for critic-info

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

JangYeongSil/JettaRLLLM #1

give you an advice

Hi, friend, maybe you can try to use the llama architecture instead of the original Transformer?(You can refer to llama architecture in llama2.c)

win10ogod updated 1 month ago
1
microsoft/DeepSpeedExamples #448

[DeepSpeedExamples/applications/DeepSpeed-Chat/] Error happe…

Error info: File "/opt/conda/lib/python3.8/site-packages/deepspeed/runtime/hybrid_engine.py", line 99, in new_inference_container File "/opt/conda/lib/python3.8/site-packages/deepspeed/module_…

GxjGit updated 11 months ago
2
tensorflow/agents #476

agent.train --> TypeError: call() missing 1 required positio…

I'm trying to use a DDPG agent with actor and critic networks, and a TFUniform replay buffer, training on my custom environment. I've extracted a training experience from the buffer using: ``` da…

Stentaur updated 2 years ago
5
explodinggradients/ragas #1325

Generate test data for 1 pdf

I have load pdf as document, now want to generate test data from it, error is occuring. --------------------------------------------------------------------------- ExceptionInRunner …

wanjeakshay updated 1 month ago
13
lynnandtonic/nestflix.fun #175

Add Kiss of Death from “The Critic” (Screenshots Added)

Please add as much of the following info as you can: Title: Kiss of Death Type (film/tv show): Film - romantic thriller Film or show in which it appears: The Critic (1994-2001) Is the pare…

HarleysTwin updated 2 years ago
1
csingh27sewts/Masterarbeit #62

Check why the cloth is pushed away towards the edge and solv…

csingh27 updated 3 years ago
5
tensorflow/agents #508

ValueError: Inputs to TanhNormalProjectionNetwork must match…

Anyone know what's wrong of using tf-agent here which trigger the ValueError? ValueError: Inputs to TanhNormalProjectionNetwork must match the sample_spec.dtype. In call to configurable 'SacAgent'…

lamhoson updated 4 years ago
1
ZhengyiLuo/PULSE #6

Question: Training with VQ-latent space

@ZhengyiLuo Hello! I have found the comparison with VQ-latent space in your [project page](https://www.zhengyiluo.com/PULSE-Site/#compare_vq), and there are some codes related to VQ-latent space s…

frredy99 updated 2 months ago
2
explodinggradients/ragas #1070

429 Request Error with Langchain Huggingface Endpoint

[x] I have checked the [documentation](https://docs.ragas.io/) and related resources and couldn't resolve my bug. I want to create synthetic test data. Using the OpenAI or Anthropic API is very exp…

jonas-nothnagel updated 3 months ago
3
dtch1997/safe-control-gym #1

Safety actor-critic

Baseline PPO agent: - Critic represents total reward - Actor is trained to maximize critic CBF PPO agent: - Base critic represents nominal reward - CBF critic represents safety reward - Actor…

dtch1997 updated 1 year ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for critic-info

1000+ results
for critic-info