chufanchen / read-paper-and-code

0 stars 0 forks source link

Random Thoughts #168

Open chufanchen opened 5 months ago

chufanchen commented 5 months ago

Cycle Consistency + Decision Transformer

Like world model, we learn a forward and back dynamic model. Then we use these DMs to constraint decision transformer training using cycle consistency constraints.

chufanchen commented 5 months ago

I'm curious about performance between World Model(MLP-based and transformer-base) and decision transformer.

chufanchen commented 5 months ago

crl: offline continual learning + online rehearsal