vision-language-pretraining Search Results

189 results
for vision-language-pretraining

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

haotian-liu/LLaVA #943

[Question] Finetune with chinese-clip

### Question ## motivation: I try to use chinese-clip replace clip. ## environment ```bash $ uname -a Linux localhost.localdomain 3.10.0-1160.80.1.el7.x86_64 #1 SMP Tue Nov 8 15:48:59 UTC…

chenchun0629 updated 6 months ago
6
bigshanedogg/survey #23

An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA

## Problem statement 1. performance bottleneck in knowledge-based VQA due to two-phase architecture consists of knowledge retrieval from external soruces and training question answering task in super…

bigshanedogg updated 2 years ago
1
LLaVA-VL/LLaVA-NeXT #218

Where can I get the pretrained model for finetuning of LLaVA…

Hi, I am trying to finetune LLaVA-NeXT with my custom dataset, using "finetune_clip.sh" shell file. I gave some edits to the shell for my convenience and to satisfy my task so far, like this: ``` …

Bleking updated 7 hours ago
44
LLaVA-VL/LLaVA-NeXT #79

LLaVa-NeXT-Video is added to 🤗 Transformers!

Hey all! The video models are all supported in Transformers now and will be part of the v4.42 release. Feel free to check out the model checkpoints [here](https://huggingface.co/collections/llava-h…

zucchini-nlp updated 1 month ago
28
Lareina2441/LLaVA-Med #1

作者的自言自语。。。

UserWarning: Failed to initialize NumPy: _ARRAY_API not found (Triggered internally at ../torch/csrc/utils/tensor_numpy.cpp:84.) device: torch.device = torch.device("cpu"), Models: ['llavamed']

Lareina2441 updated 4 days ago
35
UChicago-Thinking-Deep-Learning-Course/Readings-Responses #15

Week 9 - Possibility Readings

Post a reading of your own that uses deep learning for social science analysis and understanding, with a focus on Solving Problems & Creating Digital Doubles - in this case, we want you to look for ex…

bhargavvader updated 3 years ago
10
LAION-AI/Open-Assistant #3144

Curate SFT-9 dataset mixes

Iterate on the SFT-8 dataset mixes to create pretraining and final SFT mixes for SFT-9. This requires investigating the quality and usefulness of the datasets. Community input welcome below. See the `…

olliestanley updated 1 year ago
10
scene-verse/SceneVerse #14

Details of point cloud alignment and how to bring in custom …

Hi authors, Great work! Can you please share more details about the pointcloud alignment mentioned in your paper below: > To ensure cohesion across various sources, we conduct preprocessing ste…

zubair-irshad updated 2 months ago
5
masakhane-io/masakhane-reading-group #1

Papers Voting

In this issue you can either: - Add papers that you think are interesting to read and discuss (please stick to the format). - vote: should be done using :+1: on comments Example: https://githu…

jaderabbit updated 4 years ago
22
eg-nlp-community/nlp-reading-group #1

Papers voting

In this issue you can either: - **Add papers** that you think are interesting to read and discuss (please stick to the format). - **vote**: should be done using :+1: on comments

hadyelsahar updated 4 years ago
77

上一页 1...3 4 5 6 7 8 9...19 下一页

189 results for vision-language-pretraining

189 results
for vision-language-pretraining