ai4co / rl4co

A PyTorch library for all things Reinforcement Learning (RL) for Combinatorial Optimization (CO)
https://rl4.co
MIT License
451 stars 83 forks source link

[BugFix] Fixing the hierarchical gating #189

Closed RoyalSkye closed 5 months ago

RoyalSkye commented 5 months ago

Description

Fixing the hierarchical gating implementation in MVMoE.

Motivation and Context

Training instability may occur for MoE-L models.

Types of changes

What types of changes does your code introduce? Remove all that do not apply:

Checklist

Go over all the following points, and put an x in all the boxes that apply. If you are unsure about any of these, don't hesitate to ask. We are here to help!