How to use a trained model?

JaCoderX commented 5 years ago

I am following the examples BTGym have and trained a simple model using the unreal_stacked_lstm example on EUR-USD data.

now for the purpose of learning, let's say the model looks good and I want to use it on new data and see how it interact with the environment, how do I do it? meaning how do I load the model and ask for an action prediction?

Kismuz commented 5 years ago

@JacobHanouna, refer to #49, #46, #40, #23

JaCoderX commented 5 years ago

@Kismuz I have read the issues you refereed to and tried to modify the 'unreal_stacked_lstm' example to see how the trained model behaves (on same example of course). but I'm struggling to make it work as I want.

I would say that my end goal here is to use Backtrader plot ability (Cerebro.plot()) after the launcher finish an epoch over the test data.

based on your 'unreal_stacked_lstm' example this is what I've tried changing:

I have changed MyDataset according to 'data_domain_api_intro' example. so I would have access to the target_period param

setting this param to non-zero duration forces separation to source/target domains (which can be thought of as creating top-level train/test subsets) with target data duration equal to target_period. Source data always precedes target one.

I changed also the data domain to 'BTgymCasualDataDomain' because it sound like it would make more sense to have the data being selected as a time sequence. so it look like this in the end:

MyDataset = BTgymCasualDataDomain(
    filename='./data/DAT_ASCII_EURUSD_M1_2017.csv',
    target_period={'days': 50, 'hours': 0, 'minutes': 0},  # use last 50 days of one year data as 'target domain'
                                                           # so we get [360 - holidays gaps - 50] 
                                                           # days of train data (exclude holidays)
    trial_params=dict(
        start_weekdays={0, 1, 2, 3, 4, 5, 6}, 
        sample_duration={'days': 30, 'hours': 0, 'minutes': 0}, # let each trial be 10 days long
        start_00=True,  # ajust trial beginning to the beginning of the day
        time_gap={'days': 15, 'hours': 0},  # tolerance param 
        test_period={'days': 6, 'hours': 0, 'minutes': 0},  # from those 10 reserve last 2 days for trial test data
    ),
    episode_params=dict(
        start_weekdays={0, 1, 2, 3, 4, 5, 6},
        sample_duration={'days': 0, 'hours': 23, 'minutes': 55},  # make every episode duration be 23:55 
        start_00=False, # do not ajust beginning time
        time_gap={'days': 0, 'hours': 10},
    )  
)

change trainer_config to disable learning and replay, as followed:

trainer_config = dict(
class_ref=Unreal,
kwargs=dict(
    opt_learn_rate=0,
    # opt_learn_rate=[1e-4, 1e-4], # random log-uniform  
    # opt_end_learn_rate=1e-5,
    # opt_decay_steps=50*10**6,
    model_gamma=0.99,
    model_gae_lambda=1.0,
    model_beta=0.05, # entropy reg
    rollout_length=20,
    time_flat=True, 
    use_value_replay=False, 
    model_summary_freq=10,
    episode_summary_freq=1,
    env_render_freq=2,    
)
)

When I run the script after modifications it seem that the launcher never finish working, just keeping the cycle of learning.

Again what I'm trying to achieve is to train the example model using the unreal example... done. Then use the train model on x last period (or even the whole data) and then see how well it preforms using Backtrader plotting which is very intuitive to read and to gain trading insights on the model behavior, using Backtrader Cerebro.plot() after the launcher finished running one time over the test data

JaCoderX commented 5 years ago

OK I think I got how it works. Once I found BTgymPlotter class I followed it in the code.

The launcher can run for how many epochs it wants and it would just generate new Backtrader style summaries in tensorboard (under images) for each epoch.

I really enjoy this project @Kismuz very well designed :)

Kismuz commented 5 years ago

@JacobHanouna,

Train/test routine:

see #72 and relevant part of #54;
note that setting mentioned above kwarg episode_train_test_cycle to (0, 1) results in 'test, don't train' behaviour; if you want to get entire data range backtest as single episode - set episode duration to match entire dataset test range, also set time_gap ~ episode duration;
- there is no need to use BTgymCasualDataDomain because it requires explicit setting of inner global_time variable and messing around with trials|episodes structure which is overkill if you not trying to implement some meta-learning algorithm.

Environment rendering:

indeed, summary output behaviour is controlled via trainer class configuration; actually it rather 'period' than 'frequency' meaning :-/
```
    model_summary_freq=10,
    episode_summary_freq=1,
    env_render_freq=2, 
```
note that setting those to low values results in some slow-down, thats is especially true for env_render_freq kwarg;
that's why rendering is only performed by master worker in train cluster (worker_0);
which btw means you only need single worker if you just want to test trained model with episode_train_test_cycle=[0,1]
environment rendering can be to configured to some degree; e.g. setting 'env_config' kwargs:
```
    render_size_episode=(12,16),
    render_dpi=75,
```
results in a bit bigger picture.
- see also: https://github.com/Kismuz/btgym/blob/master/examples/rendering_howto.ipynb

Kismuz / btgym

How to use a trained model? #77