masked-image-modeling Search Results

517 results
for masked-image-modeling

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ViTAE-Transformer/SAMRS #5

About your data for semantic segmenation

Hello, After going through your data, you just labeled objects which has boxes. The background like sky or water are not labeled. Therefore, I am curious how your data can be used for semantic s…

lywang76 updated 1 year ago
12
jungwoo-ha/WeeklyArxivTalk #86

[20230618] Weekly AI ArXiv 만담 시즌2 - 20회차

### News - Conferences - [CVPR 2023](https://cvpr2023.thecvf.com/) - 일시/장소: 6. 18 - 22, Vancouver convention center - Main and Expo: 20 - 22, Workshop and Tutorial: 18-19 - 국내 부스: L…

jungwoo-ha updated 1 year ago
4
open-compass/VLMEvalKit #479

qwenvl2 run.py 无法一机多卡，每卡一个模型，并行推理一个测评

当我运行 ``` torchrun --nproc-per-node=8 run.py --data DocVQA_TEST --model Qwen2-VL-2B-Instruct --verbose ``` 出现以下错误 ``` [{'role': 'user', 'content': [{'type': 'image', 'image': '/vlmeval/images/Doc…

M3Dade updated 2 months ago
2
mlfoundations/open_flamingo #129

Instruction about training Open-Flamingo from scratch

Hi @anas-awadalla As described in #124, "Our training took place on 32 80GB A100s. We trained on 5M samples from MMC4 and 10M from LAION 2B." I am interested in the details of loss during trai…

HenryHZY updated 1 year ago
26
meta-llama/llama #384

Grouped-Query Attention

Hello Meta GenAI team (cc @ruanslv), With regards to the 70B model, I'm currently looking into the implementation of the GQA architecture -- specifically after noticing the 8192 x 1024 layer shapes…

19h updated 1 year ago
3
yhytoto12/new-arxiv-papers #1

New papers for 2022-07-26 Tue!

# 💻 cs ## 📚 mask (total: 9) ### 📃 Deep Pneumonia: Attention-Based Contrastive Learning for Class-Imbalanced Pneumonia Lesion Recognition in Chest X-rays - **Authors:** Xinxu Wei, Haohan Bai, Xianshi …

yhytoto12 updated 2 years ago
1
UChicago-Computational-Content-Analysis/Readings-Responses-2023 #16

7. Accounting for Context - challenge

Post your response to our challenge questions. First, write down two intuitions you have about broad content patterns you will discover about your data as encoded within a pre-trained or fine-tuned…

JunsolKim updated 2 years ago
22
mickeyding/mickeyding.github.io #2

【论文阅读】Show-o: ONE SINGLE TRANSFORMER TO UNIFY MULTIMODAL UND…

SHOW-O 通过一个单一的Transformer架构，引入**离散去噪过程**处理图像的生成任务，LLM任务采用因果attention，图像生成任务采用全局attention，统一了多模态理解和生成任务，无需多个专门的模型。 - 在文本到图像生成任务中，能匹敌SD1.5的效果，但仍有提升空间； - SHOW-O 支持多种任务类型，如视觉问答、图像修复、图像外推、混合模态生成等，无需针对…

mickeyding updated 2 months ago
1
facebookresearch/segment-anything #673

Extraction of image embeddings/ feature vectors latent space…

Hello, I'm trying to understand how SAM works. I am interested in extracting the **image embeddings** created by **ImageEncoderViT**. Also, I'm interested in the output after combining _image embeddin…

BilAlHomsi updated 8 months ago
12
Thinking-with-Deep-Learning-Spring-2024/Readings-Responses #9

Week 5. Apr. 19: Transformers and Social Simulation - Orient…

Post your questions here about: [“Language Learning with Large Language Models”](https://docs.google.com/document/d/1vCRoU_g9yYwG31uZMdAVK8iNL5Jj8BB4iwcvarTq06E/edit?usp=sharing) and “Digital Doubles …

JunsolKim updated 6 months ago
23

上一页 1...8 9 10 11 12 13 14...52 下一页

517 results for masked-image-modeling

517 results
for masked-image-modeling