thomashopkins32 / Minecraft-Virtual-Intelligence

MIT License
0 stars 0 forks source link

Dual learning cycles #17

Open thomashopkins32 opened 2 months ago

thomashopkins32 commented 2 months ago
  1. Online Learning Cycle (every step):

    • Pros: Rapid adaptation to new experiences, potentially faster learning in dynamic environments.
    • Cons: Higher computational overhead, potential for instability.
  2. Offline Learning Cycle (every n steps):

    • Pros: More stable learning, better utilization of past experiences, opportunity for more thorough optimization.
    • Cons: Delayed adaptation to new experiences.

This approach makes sense for several reasons:

Implementation considerations: