multimodal-datasets Search Results

979 results
for multimodal-datasets

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

OpenGVLab/LLaMA-Adapter #10

Results on more multimodal datasets

Hi, thanks for this great work! I noticed in your paper you mentioned you're evaluating on more multimodal datasets, like VQAv2 and OKVQA. Do you have any results for those now, or any timeline for wh…

sachit-menon updated 1 year ago
1
InhwanBae/LMTrajectory #9

Image captioning clarification and pretrained model

Hello, thank you for your work! I have few questions about your work. 1. The BLIP-2 model is used to create captions of images to be used as prompts for the LMTraj-SUP model. As far as I understan…

vittoriacav updated 2 months ago
3
FlagOpen/FlagEmbedding #1181

Visualized BGE based on BAAI/bge-base-zh-v1.5

Is there any versions for the model of **Visualized BGE based on BAAI/bge-base-zh-v1.5**?And how does the BAAI/bge-visualized-m3 performance compared with ChineseCLIP?

hoshinory updated 3 weeks ago
2
pytorch/torchtitan #650

[Multimodal] Adding OBELICS DataLoader

Hi! I’ve started developing the Multimodal DataLoader. After taking a (deep) look at this whole multimodal universe, I would like to discuss a couple of things before continuing. I’m using the [to…

TJ-Solergibert updated 1 month ago
8
csce585-mlsystems/Phishing-Detection #1

Instructions for Designing Your Experiments and Creating a M…

#### Specific Task: For this project, your main challenge is improving phishing detection by developing a real-time, multimodal system based on transformers and other features like URLs and metadata.…

pooyanjamshidi updated 2 months ago
1
modelscope/ms-swift #2514

Using multimodal datasets to train ovis1_6-gemma2-9b, an err…

Loading checkpoint shards: 0%| | 0/5 [00:00

c-x-l-w updated 2 days ago
1
Ucas-HaoranWei/GOT-OCR2.0 #121

训练中文手写使得原始模型效果变差了

🙂🙏 感谢开源！我用自己的数据训练之后效果还差了，帮忙看看什么问题呢，感谢先。 **1. 训练数据** 我的数据是一行一行的图片，然后合成了一张，多行（2~10行随机），共有1万张合成图片，图片是灰度图。 ![output_document_1](https://github.com/user-attachments/assets/a266c966-6476-449…

edyang updated 1 month ago
5
tattle-made/Uli #594

Participatory Approaches to Building Datasets on Abuse

### Description: Automated approaches to abuse detection rely on annotated datasets. At least at present, unsupervised machine learning alone cannot detect abuse across languages. To fill the gap of …

tarunima updated 1 month ago
3
AkihikoWatanabe/paper_notes #1491

MM-Embed: Universal Multimodal Retrieval with Multimodal LLM…

# URL - https://arxiv.org/abs/2411.02571 # Authors - Sheng-Chieh Lin - Chankyu Lee - Mohammad Shoeybi - Jimmy Lin - Bryan Catanzaro - Wei Ping # Abstract - State-of-the-art retrieval mod…

AkihikoWatanabe updated 3 weeks ago
1
samapriya/awesome-gee-community-datasets #301

[Dataset Title/Name]: LGHAP v2 : a global gap-free aerosol o…

### Contact Details _No response_ ### Dataset description A Long-term Gap-free High-resolution Air Pollutants concentration dataset (abbreviated as LGHAP) is of great significance for environmental…

Wolf-Bigby updated 1 day ago
3

上一页 1...1 2 3 4 5 6 7...98 下一页

979 results for multimodal-datasets

979 results
for multimodal-datasets