-
Nice work! Can't wait to try your work, I wonder when the code will be released!
By the way, I don't know if you know this paper, "BOOSTING MULTIMODAL LARGE LANGUAGE MODELS WITH
VISUAL TOKENS WITH…
-
Hello! Could you please add SALMONN series models?
Title | Venue | Date | Code | Demo
-- | -- | -- | -- | --
[SALMONN: Towards Generic Hearing Abilities for Large Language Models](https://arxiv.o…
-
- [ ] [LLM-Agents-Papers/README.md at main · AGI-Edgerunners/LLM-Agents-Papers](https://github.com/AGI-Edgerunners/LLM-Agents-Papers/blob/main/README.md?plain=1)
# LLM-Agents-Papers
## :writing_hand…
-
Original Repository: https://github.com/ml-explore/mlx-examples/
Listing out examples from there which would be nice to have. We don't expect the models to work out the moment they are translated to …
-
- [ ] [system-2-research/README.md at main · open-thought/system-2-research](https://github.com/open-thought/system-2-research/blob/main/README.md?plain=1)
# OpenThought - System 2 Research Links
He…
-
# HPT - Open Multimodal Large Language Models
[https://github.com/HyperGAI/HPT](https://github.com/HyperGAI/HPT)
[https://huggingface.co/HyperGAI/HPT](https://huggingface.co/HyperGAI/HPT)
[techni…
-
May I ask which attack method “QR” refers to?
-
**What would you like to be added/modified**:
A benchmark suite for multimodal large language models deployed at the edge using KubeEdge-Ianvs:
1. Modify and adapt the existing edge-cloud data c…
-
Dear Authors,
We'd like to add "GITA: Graph to Visual and Textual Integration for Vision-Language Graph Reasoning" to this repository, which has been accepted by NeurIPS 2024. [**Paper**](https:/…
-
Hi,
Thanks for your efforts on such a valuable collection!
Could you please add the paper "Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate"?
M…