multimodal-large-language-models Search Results

443 results
for multimodal-large-language-models

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/pytorch #77764

General MPS op coverage tracking issue

### This issue is to have a centralized place to list and track work on adding support to new ops for the MPS backend. [**PyTorch MPS Ops Project**](https://github.com/users/kulinseth/projects/1/vi…

albanD updated 15 hours ago
1549
BradyFU/Awesome-Multimodal-Large-Language-Models #140

Plese update the content

https://github.com/baaivision/Emu Only the paper of Emu1 is listed. I think the Em2 paper should be listed as well. And the repo should refer to Emu1 and Emu2 directly. https://github.com/BradyFU…

SetoKaiba updated 7 months ago
1
fly51fly/aicoco #3

爱可可老师24小时热门分享

微博内容精选

fly51fly updated 3 weeks ago
1906
Vasyanator/google_translate_plus #1

load failue

using [camenduru/text-generation-webui-colab](https://github.com/camenduru/text-generation-webui-colab) i used following code to download the extension: !aria2c --console-log-level=error -c -x 16 -s…

flotufox updated 1 week ago
2
zjunlp/EasyEdit #243

About OK-VQA dataset

The paper ‘Can We Edit Multimodal Large Language Models’ said that the accuracies of base model(blip2, minigpt4) on OK-VQA are all 100. And I'm a little confused. Does the pretrained model have such s…

yaohui120 updated 6 months ago
2
soohoonc/llms #6

Add section on future of llms

We want to see where the field is headed!

soohoonc updated 5 months ago
2
iburenko/multimodal-reading-group #2

[Paper Suggestion] Eyes Wide Shut? Exploring the Visual Shor…

Is vision good enough for language? Recent advancements in multimodal models primarily stem from the powerful reasoning abilities of large language models (LLMs). However, the visual component typical…

lbuess updated 6 months ago
4
YingqingHe/Awesome-LLMs-meet-Multimodal-Generation #1

Consider adding our work into your great repo.

Hi! This is a great repo to record the LLM-based generation/editing. Would you like to add our CVPR work into your repo and survey. Towards Language-Driven Video Inpainting via Multimodal Large …

lxtGH updated 6 months ago
1
w3c/webrtc-charter #84

Representing Meetings' Transcripts and Minutes

## Introduction Hello. I would like to propose that exploring representations of meetings' transcripts and minutes be in-scope for the WebRTC Working Group. This work item would involve designing a…

AdamSobieski updated 3 months ago
1
open-webui/open-webui #1589

Support multimodal models like vision

**Is your feature request related to a problem? Please describe.** I wish to integrate multimodal models. **Describe the solution you'd like** Support models like NousResearch/Obsidian-3B-V…

fire updated 6 months ago
1

上一页 1...24 25 26 27 28 29 30...45 下一页

443 results for multimodal-large-language-models

443 results
for multimodal-large-language-models