[WIP] Synthetic simulation, environment, and dataset for RTB

aiueola commented 3 years ago

Type of change

Implemented synthetic simulation, environment, and dataset modules for Reinforcement Learning (RL) in Real-Time Bidding (RTB).

Please refer to the description in the following files. Also, feel free to ask any questions.

Constrained MDP definition in RL: _gym/env/rtb.py
How environment interacts with an RL agent and how each auction outcome is calculated: _gym/env/rtb.py
Ground-truth winning function and CTR/CVR definition: _gym/simulator/function.py
Parameters in the simulator and the environment: _gym/simulator/rtb_synthetic.py and _gym/env/rtb.py, respectively
Usage of the environment and the dataset modules: _gym/env/rtb.py and _gym/dataset/synthetic.py, respectively.
Quickstart code (rough version): examples/quickstart/rtb_synthetic.ipynb

Please refer requirements here.
Work in progress, haven't yet perform error checks and debugging. I will also add some references later.
Appreciate any kind of suggestions including more reasonable naming of the parameters and the description, efficient calculation procedures, coding styles, etc.

k-kawakami213 commented 3 years ago

@aiueola (cc @kojikawamura ) こちら動作検証ってどこかに書かれています？プルリクみたのですが，よく分からず... 実際にシミュレーション含めて動かしてみたいです．

aiueola commented 3 years ago

@k-kawakami213 (cc: @kojikawamura) ありがとうございます！

先程requirementsを追加したので，そちら見ていただけたらと思います．

Please refer requirements here.

あと，かなりラフな感じになっていますが，ipynbにdataset moduleの使い方を書きました．

Quickstart code (rough version): examples/quickstart/rtb_synthetic.ipynb

もう少し詳細な引数の定義などについてはこちらを見ていただけたら嬉しいです．

Usage of the environment and the dataset modules: _gym/env/rtb.py and _gym/dataset/synthetic.py, respectively.

よろしくお願いします！