[rllib] Continuous Bandit algorithm

ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

https://ray.io

Apache License 2.0

34.18k stars 5.8k forks source link

[rllib] Continuous Bandit algorithm #8448

Closed Leonolovich closed 2 years ago

Leonolovich commented 4 years ago

What is your question?

I am trying to implement a multi-armed bandit solution for continuous action spaces. I know RLLib provides contextual bandit algorithms, but they use Discrete action spaces. Any thoughts on how to maybe use the DDPG algo for a bandit scenario?

Ray version and other system information (Python version, TensorFlow version, OS): Ray 0.9.0 Python 3.6 Centos7

stale[bot] commented 4 years ago

Hi, I'm a bot from the Ray team :)

To help human contributors to focus on more relevant issues, I will automatically add the stale label to issues that have had no activity for more than 4 months.

If there is no further activity in the 14 days, the issue will be closed!

If you'd like to keep the issue open, just leave any comment, and the stale label will be removed!
If you'd like to get more attention to the issue, please tag one of Ray's contributors.

You can always ask for help on our discussion forum or Ray's public slack channel.

stale[bot] commented 4 years ago

Hi again! The issue will be closed because there has been no more activity in the 14 days since the last message.

Please feel free to reopen or open a new issue if you'd still like it to be addressed.

Again, you can always ask for help on our discussion forum or Ray's public slack channel.

Thanks again for opening the issue!

stale[bot] commented 2 years ago