issues
search
UoA-CARES
/
cares_reinforcement_learning
CARES Reinforcement Learning Package
9
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Parameter overrides for default Hidden Size
#180
beardyFace
opened
3 hours ago
1
Hidden layer size
#179
rainingx683
opened
22 hours ago
0
Save the memory buffer
#178
PKWadsy
opened
1 week ago
0
Fix TD3 actor hidden size
#177
rainingx683
closed
1 week ago
0
resolved #109
#176
beardyFace
closed
3 weeks ago
0
Bug/xcb opencv
#175
beardyFace
closed
3 weeks ago
0
Variations of AE
#174
beardyFace
opened
1 month ago
0
Variations of AEs
#173
beardyFace
opened
1 month ago
0
Rename PrioritizedMemoryBuffer to MemoryBuffer
#172
retinfai
closed
1 month ago
0
Rename PrioritizedReplayBuffer -> MemoryBuffer
#171
beardyFace
closed
1 month ago
0
Load model function
#170
rainingx683
opened
1 month ago
2
Plotter Improvements
#169
beardyFace
closed
1 month ago
0
Tidy plotter.py
#168
beardyFace
closed
1 month ago
0
Dev/winter clean
#167
beardyFace
closed
1 month ago
4
Winter Clean
#166
beardyFace
closed
1 month ago
0
Generalise Actor/Critic with MLP in common.py
#165
beardyFace
opened
1 month ago
1
Fix to LA3P methods
#164
beardyFace
closed
1 month ago
1
TQC CPU Crash
#163
rainingx683
opened
1 month ago
0
Fix weights tensor conversion
#162
retinfai
closed
2 months ago
1
AE Encoders TD3/SAC
#161
beardyFace
closed
1 month ago
1
Shift common.py into networks folder?
#160
beardyFace
closed
1 month ago
0
Bug - LA3PSAC
#159
beardyFace
closed
1 month ago
0
Bug - MAPERTD3/SAC not learning
#158
beardyFace
closed
2 months ago
0
Resolve fusion_variance issue
#157
dvalenciar
closed
2 months ago
0
Start_video now logs first frame
#156
kvan910
closed
2 months ago
0
Refactor/sac alpha lr
#155
qiaoting159753
closed
2 months ago
0
add LAPSAC algo
#154
h-yamani
closed
2 months ago
0
Priority Based SAC Algorithms
#153
h-yamani
closed
2 months ago
0
In the original MAPERTD3 algorithm, the parameter alpha is sent to memory, we do not utilize it, too.
#152
h-yamani
closed
2 months ago
1
Updated names of predicted and real rewards
#151
h-yamani
closed
2 months ago
1
Fall Clean
#150
beardyFace
closed
2 months ago
0
TQC
#149
beardyFace
closed
2 months ago
0
Plotter -p option
#148
beardyFace
closed
2 months ago
0
Training to Algorithm Configuration Refactor
#147
beardyFace
closed
2 months ago
0
REDQ
#146
beardyFace
closed
2 months ago
0
PALTD3
#145
beardyFace
closed
2 months ago
0
Memory Sample Consecutive
#144
beardyFace
closed
2 months ago
0
MAPERTD3
#143
beardyFace
closed
3 months ago
1
Update - checkpoint_frequency
#142
rainingx683
opened
3 months ago
1
Implement - save training state for pause and restart training
#141
rainingx683
opened
3 months ago
0
Sample into Algorithm
#140
beardyFace
closed
3 months ago
0
minor errors with weight mse fixed
#139
beardyFace
closed
3 months ago
0
LA3PTD3
#138
beardyFace
closed
3 months ago
1
Added LAPTD3
#137
beardyFace
closed
3 months ago
1
Docs/add examples directory
#136
retinfai
opened
3 months ago
0
Implement - REDQ
#135
beardyFace
closed
2 months ago
0
Feature/per rd td3
#134
beardyFace
closed
3 months ago
1
merged with main
#133
beardyFace
closed
3 months ago
0
Feature/agent update statistics
#132
qiaoting159753
closed
3 months ago
0
Add an extra argument in TrainingConfig for training the world model
#131
qiaoting159753
closed
3 months ago
0
Next