Beckschen / LLaVolta

Efficient Multi-modal Models via Stage-wise Visual Context Compression
Apache License 2.0
28 stars 2 forks source link