massquantity / DBRL

Dataset Batch(offline) Reinforcement Learning for recommender system
143 stars 37 forks source link