-
# HPT - Open Multimodal Large Language Models
[https://github.com/HyperGAI/HPT](https://github.com/HyperGAI/HPT)
[https://huggingface.co/HyperGAI/HPT](https://huggingface.co/HyperGAI/HPT)
[techni…
-
### Model/Pipeline/Scheduler description
Existing methods for facial identity transfer for diffusion denoising image generation models face challenges in achieving high fidelity and detailed identity…
-
# Interesting papers
- Yan 2024 - An Object is Worth 64x64 Pixels: Generating 3D Object via Image Diffusion [링크](https://omages.github.io/)
- Diffusion을 통해서 64 x 64 크기의 '부품 이미지' (Object image)…
-
Hello,
Is there a way to use tiny-cnn for multimodal network? I'm thinking of combining audio and visual for better recognition.
Thanks a lot.
Nhat
-
What is the scope of data specifications created from this initiative? Is that its intent?
- Will there multimodal data specifications that include bicycle and pedestrian data (such as the[ share…
-
The CTN is a compilation of multimodal Austin transportation infrastructure. It provides an authoritative base map to align data production, management, and analysis across the organization.
![Clo…
-
**What would you like to be added/modified**:
A benchmark suite for multimodal large language models deployed at the edge using KubeEdge-Ianvs:
1. Modify and adapt the existing edge-cloud data c…
-
My goal is to build a unique multimodal WooCommerce search experience with Vespa multivectors and an hybrid ranking on text-BM25, text-vectors, and image-vectors.
For instance, E-commerce can use:
…
-
### System Info
- RTX 4090
- x86_64 GNU/Linux
- main branch
### Who can help?
_No response_
### Information
- [X] The official example scripts
- [ ] My own modified scripts
### Ta…
-
Hi @bhaba-ranjan,
Thanks for sharing the repository, I'm currently looking to reproduce this work. Are the default hyperparameters that are set in the multimodal.py file enough to reproduce your mo…