phellonchen / X-LLM

X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
https://x-llm.github.io
Apache License 2.0
304 stars 17 forks source link

I-Adapter, V-Adapter and S-Adapter #14

Open ukamath opened 9 months ago

ukamath commented 9 months ago

The paper was unclear about how the encoded modalities are adapted through these adapters. Can you explain and point to the code where I-Adapter, V-Adapter, and S-Adapter are used and information about them, like the LoRA adapters or other networks?