pagand / ORL_optimizer

offline RL optimizer
0 stars 0 forks source link

Model-based Confidence-aware Offline RL

coteaching

Contents

  1. Setup
  2. Usage
  3. Disclaimer

Setup

Clone the repository and build a conda environment from requirement.txt:

git clone https://github.com/pagand/ORL_optimizer
cd ORL_OPTIMIZER
conda create -n orl python=3.10
conda activate orl
pip install --upgrade pip
pip install -r requirements.txt

Now install the following extra packages:

pip install 'cython<3'
pip install scipy==1.12

Install the correct version of pytorch given your CUDA from start locally or previous versions. For CUDA 11.8

pip3 install torch  --index-url https://download.pytorch.org/whl/cu118

Install Jax (only required for the CORL):

pip install "jax[cuda11_cudnn86]"==0.4.7 -f https://storage.googleapis.com/jax-releases/jax_cuda_releases.html

Usage

The code of combustion simulator is in Simulator/simrnn_model.py, the code of model-based offline RL framework, MORE, is in RL/primal_dual_ddpg.py.

cd MBORL
cd CORL
cd VesselModel
cd MORE

Disclaimer

This code is is heavily based on