issues
search
UoA-CARES
/
cares_reinforcement_learning
CARES Reinforcement Learning Package
11
stars
2
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Rename PrioritizedMemoryBuffer to MemoryBuffer
#172
retinfai
closed
5 months ago
0
Rename PrioritizedReplayBuffer -> MemoryBuffer
#171
beardyFace
closed
5 months ago
0
Load model function
#170
rainingx683
closed
4 months ago
3
Plotter Improvements
#169
beardyFace
closed
5 months ago
0
Tidy plotter.py
#168
beardyFace
closed
5 months ago
0
Dev/winter clean
#167
beardyFace
closed
6 months ago
4
Winter Clean
#166
beardyFace
closed
6 months ago
0
Generalise Actor/Critic with MLP in common.py
#165
beardyFace
opened
6 months ago
2
Fix to LA3P methods
#164
beardyFace
closed
6 months ago
1
TQC CPU Crash
#163
rainingx683
opened
6 months ago
0
Fix weights tensor conversion
#162
retinfai
closed
6 months ago
1
AE Encoders TD3/SAC
#161
beardyFace
closed
6 months ago
1
Shift common.py into networks folder?
#160
beardyFace
closed
6 months ago
0
Bug - LA3PSAC
#159
beardyFace
closed
6 months ago
0
Bug - MAPERTD3/SAC not learning
#158
beardyFace
closed
6 months ago
0
Resolve fusion_variance issue
#157
dvalenciar
closed
6 months ago
0
Start_video now logs first frame
#156
kvan910
closed
7 months ago
0
Refactor/sac alpha lr
#155
qiaoting159753
closed
7 months ago
0
add LAPSAC algo
#154
h-yamani
closed
7 months ago
0
Priority Based SAC Algorithms
#153
h-yamani
closed
6 months ago
0
In the original MAPERTD3 algorithm, the parameter alpha is sent to memory, we do not utilize it, too.
#152
h-yamani
closed
6 months ago
1
Updated names of predicted and real rewards
#151
h-yamani
closed
7 months ago
1
Fall Clean
#150
beardyFace
closed
7 months ago
0
TQC
#149
beardyFace
closed
7 months ago
0
Plotter -p option
#148
beardyFace
closed
7 months ago
0
Training to Algorithm Configuration Refactor
#147
beardyFace
closed
7 months ago
0
REDQ
#146
beardyFace
closed
7 months ago
0
PALTD3
#145
beardyFace
closed
7 months ago
0
Memory Sample Consecutive
#144
beardyFace
closed
7 months ago
0
MAPERTD3
#143
beardyFace
closed
7 months ago
1
Update - checkpoint_frequency
#142
rainingx683
closed
1 week ago
2
Implement - save training state for pause and restart training
#141
rainingx683
opened
7 months ago
0
Sample into Algorithm
#140
beardyFace
closed
7 months ago
0
minor errors with weight mse fixed
#139
beardyFace
closed
7 months ago
0
LA3PTD3
#138
beardyFace
closed
7 months ago
1
Added LAPTD3
#137
beardyFace
closed
7 months ago
1
Docs/add examples directory
#136
retinfai
opened
7 months ago
0
Implement - REDQ
#135
beardyFace
closed
7 months ago
0
Feature/per rd td3
#134
beardyFace
closed
7 months ago
1
merged with main
#133
beardyFace
closed
8 months ago
0
Feature/agent update statistics
#132
qiaoting159753
closed
8 months ago
0
Add an extra argument in TrainingConfig for training the world model
#131
qiaoting159753
closed
8 months ago
0
DYNA adapt to the new memory buffer, which has a new sample and sample_consecutive function.
#130
qiaoting159753
closed
7 months ago
7
train reward and dynamic prediciton together
#129
qiaoting159753
closed
8 months ago
0
TypeError create memory
#128
qiaoting159753
closed
8 months ago
0
Fix a figure resize issue
#127
qiaoting159753
closed
8 months ago
0
batch size to 256
#126
beardyFace
closed
8 months ago
0
Algorithm/dueling td3
#125
beardyFace
closed
8 months ago
0
Feature/normalization for mbrl
#124
qiaoting159753
closed
8 months ago
0
Add number steps per train policy config
#123
retinfai
closed
8 months ago
0
Previous
Next