easy-vqa Search Results

135 results
for easy-vqa

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

PKU-YuanGroup/MoE-LLaVA #39

[Discussion] Implementation of Qwen1.5 for the project

### Discussion Firstly, Wish you have a nice day on Chinese New Year. I am currently catching up with your progress in integrating Qwen1.5 to this project. Since the Qwen1.5 shares a similar struc…

cydiachen updated 9 months ago
19
ManifoldRG/NEKO_Archive #1

Dataset Availability Analysis

The datasets provided in the original GATO paper are varied and numerous. We need a preliminary analysis of what data is availability, what data has equivalents, and what data is not clearly source ab…

harshsikka updated 1 year ago
5
haotian-liu/LLaVA #593

[Question] GIF files in OCR_VQA dataset

### Question I noticed during the fine-tuning phase that in the OCR_VQA dataset, there are many GIF files, but they have all been changed to JPG in the JSON. Can these files be directly modified by c…

July-zh updated 1 year ago
6
AI-metrics/AI-metrics #62

Add augmentations used by the image recognition methods

In order to make comparing different image recognition methods easier, it would help if the tables and charts included the augmentations used by the papers. Image recogntion can be made easier by augm…

tarvaina updated 1 year ago
1
gradio-app/gradio #3383

Demo crushes given many images + text in unwrapped only when…

### Describe the bug Hi. I have the following app in huggingface: [link](https://huggingface.co/spaces/nlphuji/whoops_explorer). It's a 25 rows * 4 columns of images. I have 500 images, but when …

yonatanbitton updated 9 months ago
13
fly51fly/aicoco #3

爱可可老师24小时热门分享

微博内容精选

fly51fly updated 1 month ago
1906
flathub/org.prismlauncher.PrismLauncher #19

ReplayMod can't render because ffmpeg doesn't support rawvid…

In the Prism Launcher flatpak, it seems like ffmpeg doesn't support format `rawvideo`: ``` [april@tadaima ~]$ flatpak run --command="ffmpeg" org.prismlauncher.PrismLauncher -formats [...] File f…

april83c updated 8 months ago
7
kohjingyu/gill #17

Multimodal generation in one pass

Hi, thanks for sharing this awesome work. As I was trying your system more and more, a few questions popped up in my mind: 1. In my experience, I am seeing instances where the LLM starts generat…

avipartho updated 1 year ago
5
tattle-made/kosh-v2 #72

Claim extraction from Images

The various challenges involved in making sense of an image found on social media is summarized by this image ![Screenshot 2023-12-04 at 15-13-05 Tech Interventions against Online Harms](https://gith…

dennyabrain updated 10 months ago
20
guilk/VLC #7

VQA trained model weight available?

Hi, Could you please release the trained model weight for VQA? Currently, the links for VQA in the pre-trained models section are the JSON file instead of the ckpt file for the model weight. Than…

LiuJoffrey updated 1 year ago
1

上一页 1...4 5 6 7 8 9 10...14 下一页

135 results for easy-vqa

135 results
for easy-vqa