actor-model Search Results

OpenRLHF/OpenRLHF #230

Actor-Critic-Model

If I understand the current PPO code correctly, this instantiates completely separate actor and critic models, without any layers shared between them. (But correct me in case that is wrong?) Instea…

mgerstgrasser updated 2 months ago

freeflowuniverse/crystallib #255

Actor generator that generates actors for models

timurgordon updated 2 months ago

pytorch/xla #8180

Model support for `soft_actor_critic` with Torch_XLA2

## Fix the model test for `soft_actor_critic.py` 1. setup env according to [Run a model under torch_xla2](https://github.com/pytorch/xla/blob/master/experimental/torch_xla2/docs/support_a_new_model…

ManfeiBai updated 2 weeks ago

THUDM/WebRL #7

Where are policy_lm and critic_lm?

scripts/config/main/webrl.yaml: defaults: - default - _self_ save_path: /workspace/WebRL/scripts/output run_name: "webrl" critic_lm# training policy_lm: /workspace/WebRL/webrl-glm-4-9…

zhengshf updated 4 days ago

alloystorm/dvvr #379

Clothing Simulation Pulling Bug 2024.11

This Bug seem interesting. it's occur in the Clothing Simulation > Mesh 1/2 > Inner radius + Top / Bottom Anchor Lock. the Inner Radius seem to be randomly change the Bug some Higher Value the B…

TretanJonas updated 18 hours ago

openmoh/openmohaa #536

Audio hissing during "Omaha Beach - The Landing - Starting" …

**Describe the bug** Loading the save for the initial d-day section during the landing craft ride the audio hisses on the left channel then the right channel before looping back to the left channel. …

joebonrichie updated 4 hours ago

opengeospatial/ogc-geosparql #579

Use Case 579 : 3D model to store volumetric survey plans

# Use Case 579 ## 3D model to store volumetric survey plans As a user of GeoSPARQL data, I need a 3D model to accurately store information from volumetric survey plans and to conduct analyses fo…

GordonSzczepina updated 2 weeks ago

fatbobman/CoreDataEvolution #1

Multiple actors calling each other can deadlock

TL;DR: The use of `context.performAndWait` can introduce deadlocks into programs which would not deadlock with normal actors. ## Background I was excited to use this library to solve some threadin…

theospears updated 5 days ago

microsoft/DeepSpeedExamples #922

Actor loss nan and Resizing model embedding

The model I use is GPT-2 124M. When resizing model embeddings during the training of STF and RW, I often encounter issues where the generated answers consist entirely of zeros. This causes both the lo…

ouyanmei updated 2 months ago

OpenRLHF/OpenRLHF #472

How do you connect different models using Ray.

For example, reward model has 8 GPU cards with TP and DP configuration. Actor model might have TP&PP&DP(just for example) occupying 64 GPUs. How do you connect Actor's last stage's output to reward m…

zpcalan updated 3 weeks ago

1000+ results for actor-model

1000+ results
for actor-model