-
Can you specify how exactly can i test the model i.e given an image with question the model is expected to return answers with confidence.
-
Hi there, I can roughly reproduce the result of COCO, Flickr30, OKVQA, and VQAv2, but not the results on imagenet1k. the acc@1 is only 0.02, and the acc@5 is only 0.04. Is this a personal problem for …
-
## Proposed Change
Look into how we could easily create and supply errors in demo mode.
## Why Should We Prioritize?
Currently there isn't an easily written detox solution for implementing errors f…
-
Hi Joshua,
I tried to use chipQA to train and test its performance on a CSIQ-VQA database, which is a 480p resolution database.
1 - I run chipqa_yuv.py get the feature file
2 - I run python clean…
-
This thread contains the discussion of the implementation of LaTr with one of the authors of the same paper
The earlier discussion with the first author is mentioned [here](https://github.com/uakar…
-
### System Info
- `transformers` version: 4.46.0
- Platform: Linux-5.15.0-97-generic-x86_64-with-glibc2.35
- Python version: 3.12.3
- Huggingface_hub version: 0.26.1
- Safetensors version: 0.4.…
-
I generated the `submit_predict.json` and submited it to GQA evaluation server. However, I got an accuracy of 0 in test phase, but the result in dev phase makes sense. Is it possible that I predict al…
-
- [ ] [Defining AGI: Exploring Six Key Principles for an Operational Definition](https://arxiv.org/html/2311.02462v2)
# Defining AGI: Exploring Six Key Principles for an Operational Definition
## Sn…
-
## ❓ Questions and Help
您好,我使用bottom up attention(来源:https://github.com/airsplay/py-bottom-up-attention,我对它的理解是用faster rcnn在VG数据集上预训练)对整个coco2014数据集做测试,获得了gt-box和每个box对应的label,每个box用一个2048维的向量表示视觉特征,…
-
# 💻 cs
## 📚 mask (total: 9)
### 📃 Deep Pneumonia: Attention-Based Contrastive Learning for Class-Imbalanced Pneumonia Lesion Recognition in Chest X-rays
- **Authors:** Xinxu Wei, Haohan Bai, Xianshi …