h-zhao1997 / cobra

Cobra: Extending Mamba to Multi-modal Large Language Model for Efficient Inference
MIT License
259 stars 8 forks source link

fix unbalanced loss for gradient accumulation #19

Closed h-zhao1997 closed 3 months ago