Compare with the full environment refactor, this base refactor only touches:
Environment structure;
Documentation updates;
Supporting generator;
And as a base refactor, this PR keeps:
All the data generate processes (size, default range, default distribution) are not modified;
All the step, reward calculation, etc logics are not modified;
Overall, this base version PR only works on moving code and doesn't change the logic for consequence. In the future version, we will refactor environments with the guide of the full refactor version step by step.
Description
Please refer to the full environment refactor PR: https://github.com/ai4co/rl4co/pull/166.
Motivation and Context
Please refer to the full environment refactor PR: https://github.com/ai4co/rl4co/pull/166.
Types of changes
Compare with the full environment refactor, this base refactor only touches:
And as a base refactor, this PR keeps:
Overall, this base version PR only works on moving code and doesn't change the logic for consequence. In the future version, we will refactor environments with the guide of the full refactor version step by step.
For more details, please refer to the full environment refactor PR: https://github.com/ai4co/rl4co/pull/166.
Checklist