-
Hi, thanks for this great work! I noticed in your paper you mentioned you're evaluating on more multimodal datasets, like VQAv2 and OKVQA. Do you have any results for those now, or any timeline for wh…
-
Hello, thank you for your work!
I have few questions about your work.
1. The BLIP-2 model is used to create captions of images to be used as prompts for the LMTraj-SUP model. As far as I understan…
-
Is there any versions for the model of **Visualized BGE based on BAAI/bge-base-zh-v1.5**?And how does the BAAI/bge-visualized-m3 performance compared with ChineseCLIP?
-
Hi!
I’ve started developing the Multimodal DataLoader. After taking a (deep) look at this whole multimodal universe, I would like to discuss a couple of things before continuing. I’m using the [to…
-
#### Specific Task:
For this project, your main challenge is improving phishing detection by developing a real-time, multimodal system based on transformers and other features like URLs and metadata.…
-
Loading checkpoint shards: 0%| | 0/5 [00:00
-
🙂🙏 感谢开源!
我用自己的数据训练之后效果还差了,帮忙看看什么问题呢,感谢先。
**1. 训练数据**
我的数据是一行一行的图片,然后合成了一张,多行(2~10行随机),共有1万张合成图片,图片是灰度图。
![output_document_1](https://github.com/user-attachments/assets/a266c966-6476-449…
-
### Description:
Automated approaches to abuse detection rely on annotated datasets. At least at present, unsupervised machine learning alone cannot detect abuse across languages. To fill the gap of …
-
# URL
- https://arxiv.org/abs/2411.02571
# Authors
- Sheng-Chieh Lin
- Chankyu Lee
- Mohammad Shoeybi
- Jimmy Lin
- Bryan Catanzaro
- Wei Ping
# Abstract
- State-of-the-art retrieval mod…
-
### Contact Details
_No response_
### Dataset description
A Long-term Gap-free High-resolution Air Pollutants concentration dataset (abbreviated as LGHAP) is of great significance for environmental…