multimodal-large-language-models Search Results

344 results
for multimodal-large-language-models

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

nariaki3551/library #132

MM-LLMs: Recent Advances in MultiModal Large Language Models

- year: - journal: - url: - google scholar: - scispace: - cited: (day-month-year) ### 背景 ### どんなもの? ### 先行研究と比べてどこがすごい? ### 技術や手法のキモはどこ? ### どうやって有効だと検証した? ### 議論はある? ### 次に読…

nariaki3551 updated 1 month ago
1
paperswithlove/papers-we-read #15

HPT - Open Multimodal Large Language Models

# HPT - Open Multimodal Large Language Models [https://github.com/HyperGAI/HPT](https://github.com/HyperGAI/HPT) [https://huggingface.co/HyperGAI/HPT](https://huggingface.co/HyperGAI/HPT) [techni…

runhani updated 3 months ago
2
joaomdmoura/crewAI #464

Human Input in Agent Execution documentation: unintelligible…

See example output below. The example does not work - no "human input" is ever sought - and lacks any explanation of how the feature is supposed to be used, making it useless. ``` [DEBUG]: == Wor…

francisjervis updated 1 month ago
3
quic/ai-hub-models #58

[MODEL REQUEST] Kosmos-2.5

Kosmos-2.5 is an relatively small (1.37B params), generative model for machine reading of text-intensive images. **Details of model being requested** - Model name: Kosmos-2.5 - Source repo link: …

EwoutH updated 1 week ago
1
ollama/ollama #4257

Support for InternVL-Chat-V1.5

https://huggingface.co/OpenGVLab/InternVL-Chat-V1-5 We introduce InternVL 1.5, an open-source multimodal large language model (MLLM) to bridge the capability gap between open-source and proprietary…

wwjCMP updated 2 weeks ago
2
arXiv/html_feedback #832

Can ChatGPT Detect DeepFakes? A Study of Using Multimodal L…

### Description The showing structure of multiple subfigures in the paper is not correct, with images in different sizes (namely, not following the subfigure width and height settings). ### (Optiona…

shanface33 updated 3 months ago
2
TaskingAI/TaskingAI #75

Feature Request: Integration of Multimodal LLMs

I want to suggest a significant enhancement that could vastly expand the capabilities of TaskingAI - the integration of multimodal Large Language Models (LLMs), particularly those akin to GPT-4V, whic…

CaseyJordan897 updated 1 month ago
1
InternLM/lmdeploy #1958

[Bug] 使用lmdeploy serve开启internvl-v1-5后一定输出到最长长度

### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [X] 2. The bug has not been fixed in the latest version. ### Describe the bug 1.session length长度不一致，…

sunzx8 updated 5 hours ago
4
pytorch/pytorch #118069

Quantising a multimodal large language PyTorch model to run …

### 🚀 The feature, motivation and pitch I'm working on a PoC that tries to extract as much information from an image as possible. Currently, this capability is only supported on servers/computers wit…

magicianfromriga updated 4 weeks ago
1
ledge8/KI-Workshop #59

quelle ergänzen

https://heise.de/-9722941

ledge8 updated 3 weeks ago
12

上一页 1...1 2 3 4 5 6 7...35 下一页

344 results for multimodal-large-language-models

344 results
for multimodal-large-language-models