distributed-reinforcement-learning Search Results

410 results
for distributed-reinforcement-learning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/transformers #20320

Loading model OOMs with more GPUS

### System Info - `transformers` version: 4.21.2 - Platform: Linux-5.10.135-122.509.amzn2.x86_64-x86_64-with-glibc2.2.5 - Python version: 3.8.5 - Huggingface_hub version: 0.10.0 - PyTorch versi…

Dahoas updated 1 year ago
6
openjournals/joss-reviews #5017

[REVIEW]: DeBIR: A Python Package for Dense Bi-Encoder Infor…

**Submitting author:** @ayuei (Vincent Nguyen) **Repository:** https://github.com/Ayuei/DeBEIR **Branch with paper.md** (empty if default branch): paper **Version:** v0.0.1 **Editor:** @arfon **Review…

editorialbot updated 1 year ago
99
NixOS/nixpkgs #72175

Package request: ray (python package)

**Project description** Ray is a fast and simple framework for building and running distributed applications. It is packaged with RLlib, a scalable reinforcement learning library. The project p…

rht updated 2 years ago
11
africamonkey/autopilot-cross-intersection #1

Reference papers

Hello Thank you for sharing your materials! And I am very happy with your modified Flow. In the past, I tried to install Flow from the official repo but it always had errors. With your repo, it is e…

TrinhTuanHung2021 updated 2 years ago
2
pytorch/pytorch #51631

DataParalllel to ONNX

## ❓ Questions and Help I am new to Pytorch and distributed learning. I am using mlagents to do deep reinforcement learning. Their source code does not support training with multiple GPUs. Therefor…

qiwu57kevin updated 2 years ago
2
LAION-AI/Open-Assistant #2102

Toxicity filters cause bias towards lies?

I've got a hypothesis about the tendancy of ChatGPT to agree with marketing lies over user opinion. I think it's because rude, crass, brutally honest opinions are marked as "toxic" and are therefore f…

bitplane updated 1 year ago
10
timoklein/alphazero-gym #11

Why does the policy loss not decrease but increase?

I executed the run_continuous.py file for the continuous agent and found that the policy loss increased approximately linearly with training episodes until it stabilized. Why is the policy loss not re…

cz11233 updated 1 year ago
5
sourcegraph/about #944

`git pull` ~3 weeks of changes to this repo takes me 3m11s

This is very painful. I wish we would stop committing images here. ``` $ git pull remote: Enumerating objects: 1347, done. remote: Counting objects: 100% (1347/1347), done. remote: Compressin…

slimsag updated 1 year ago
1
JuliaGPU/CUDA.jl #657

Taking gradient with Flux results in NaNs when using CUDA ar…

I am working on a reinforcement learning project using Flux.jl and CUDA.jl. When running one of my experiments, after several million steps `NaN`s pop up and propagate everywhere. I tracked down the i…

jonas-eschmann updated 1 year ago
2
neuropoly/neuro.polymtl.ca #80

List publications with label filtering

Example: https://mila.quebec/en/publications/ It would be nice to reuse the same code as in the Mila website. Not sure if that's 'easily' possible via RTD

jcohenadad updated 1 year ago
1

上一页 1...22 23 24 25 26 27 28...41 下一页

410 results for distributed-reinforcement-learning

410 results
for distributed-reinforcement-learning