multimodal-llms Search Results

408 results
for multimodal-llms

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

invictus717/MiCo #2

Will you make the pre-training code and training dataset pub…

I admire and am interested in your work and would like to follow up on your work. Will you make the pre-training code and training dataset public?

handsomelys updated 5 months ago
2
run-llama/llama_parse #374

Extracted sub-images as different file instead of single ima…

**Describe the bug** Image in PDF was extracted as different sub-image files instead of single figure it should be **Files** [mattergen.pdf](https://github.com/user-attachments/files/16831945/mat…

tkcoding updated 1 month ago
2
fulfulggg/Information-gathering #50

効率的なマルチモーダル大規模言語モデル：サーベイ

## タイトル: 効率的なマルチモーダル大規模言語モデル：サーベイ ## リンク: https://arxiv.org/abs/2405.10739 ## 概要: 過去1年間で、マルチモーダル大規模言語モデル（MLLM）は、視覚質問応答、視覚理解、推論などのタスクにおいて目覚ましい性能を示してきました。しかし、モデルサイズが大きく、トレーニングと推論のコストが高いことが、産業界や学術界にお…

fulfulggg updated 3 months ago
2
BerriAI/litellm #6307

[Bug]: Image Handling with litellm Proxy for qwen-vl-plus

### What happened? I’m experiencing an issue when using litellm proxy to communicate with the qwen-vl-plus model for multimodal interactions. When I send an image URL directly to qwen-vl-plus, it pr…

NEWbie0709 updated 3 weeks ago
6
hiyouga/LLaMA-Factory #5423

[Question]support for quantization algorithms that are not p…

### Reminder - [X] I have read the README and searched the existing issues. ### System Info None ### Reproduction None ### Expected behavior None ### Others Hi, Thank you for the fantastic wo…

wenhuach21 updated 2 months ago
3
pytorch/torchtune #791

Vision/Multimodal

With all the growing activity and focus on multimodal models is this library restricted to tune text only LLM? Do we plan to have Vision or more in general multimodal models tuning support?

bhack updated 2 weeks ago
16
intel-analytics/ipex-llm #11341

error with ipex-llm langchain integration for LLAVA model

Hi, I saved the LLAVA model in 4bit using generate.py from: https://github.com/intel-analytics/ipex-llm/tree/main/python/llm/example/CPU/PyTorch-Models/Model/llava model = optimize_model(model) …

tsantra updated 4 months ago
14
AkihikoWatanabe/paper_notes #800

SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation…

# URL - https://arxiv.org/abs/2306.17842 # Affiliations - Lijun Yu, N/A - Yong Cheng, N/A - Zhiruo Wang, N/A - Vivek Kumar, N/A - Wolfgang Macherey, N/A - Yanping Huang, N/A - David A. R…

AkihikoWatanabe updated 1 year ago
1
kubeedge/ianvs #95

Domain-Specific Large Model Benchmarking Based on KubeEdge-I…

**What would you like to be added/modified**: Based on existing datasets, the issue aims to build a benchmark for domain-specific large models on KubeEdge-Ianvs. Namely, it aims to help all Edge AI a…

MooreZheng updated 3 months ago
4
rmusser01/tldw #297

General Research

Papers that don't fit somewhere else right now but may be relevant in the future: https://huggingface.co/papers/2409.18943 https://arxiv.org/pdf/2409.16493

rmusser01 updated 1 week ago
21

上一页 1...4 5 6 7 8 9 10...41 下一页

408 results for multimodal-llms

408 results
for multimodal-llms