Closed OmaymaMahjoub closed 4 months ago
Implement Sebulba architecture with feedforward IPPO on Rware.
Integrate Sebulba's architecture due to its effectiveness in scenarios involving non-jitted/non-jax environments.
Enhance the existing Cleanba code to support marl algorithm and ensure compatibility with Mava's key components, including logger, evaluator, etc.
Before reviewing this PR, we need to merge #976 and #977 The early draft of sebulba ff-ippo can be found in #959 Action Item:
@Louay-Ben-nessir is working on maintaining this PR in a new one that will be added soon :)
What?
Implement Sebulba architecture with feedforward IPPO on Rware.
Why?
Integrate Sebulba's architecture due to its effectiveness in scenarios involving non-jitted/non-jax environments.
How?
Enhance the existing Cleanba code to support marl algorithm and ensure compatibility with Mava's key components, including logger, evaluator, etc.
Extra
Before reviewing this PR, we need to merge #976 and #977 The early draft of sebulba ff-ippo can be found in #959 Action Item: