vision-language-learning Search Results

1000+ results
for vision-language-learning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ymLeiFDU/CLIP-Lung #3

A question about network architecture

Hello,I would like to ask why you use ViT-B/16 as a text encoder. Why not use NLP models as a text encoder? Thank you very much.

Dijkstra111111 updated 3 months ago
1
kuisailab/ai-media-group #2

complete the missing "Research Highlights" and "Publications…

- [x] Computer Vision Research Highlights - [ ] Computer Vision Publications - [ ] Computational Biology and Medicine Research Highlights - [ ] Computational Biology and Medicine Publications - [x…

denizyuret updated 3 years ago
2
changh95/WeeklySpatialAI #1

2024.07.24 - #1 - FutureMapping, GLIM, DeepSLAM, Co-RAL, SOL…

# Interesting papers - [Davison 2018 - FutureMapping: The Computational Structure of Spatial AI Systems](https://arxiv.org/abs/1803.11288) - Imperial College London의 Dyson Robotics Lab 교수님이신 A…

changh95 updated 1 week ago
5
gauravpandeyDL/Feature-List #17

Healthcare Capabilities

#### **Healthcare Capabilities in AI** --- **1. AI Model Development** - **Capabilities:** - Crafting bespoke AI models tailored for healthcare applications. - Leveraging dee…

gauravpandeydigilantern updated 1 month ago
5
ReScience/ReScience #27

Reviewer application

If you want to become a reviewer for ReScience, please post your information here. The format is: ``` [name](github account link) Scientific expertise - Language expertise ORCID: [xxxx](http…

rougier updated 1 week ago
163
yyf17/awesome-embodied-intelligent #1

SoundSpace

# [sound-spaces](https://github.com/facebookresearch/sound-spaces) [Project: RLR-Audio-Propagation](https://github.com/facebookresearch/rlr-audio-propagation) [Audio Sensor](https://github.com/f…

yyf17 updated 2 years ago
1
denoland/deno #21610

Implement WebNN (Web Neural Network) API

Draft Spec: https://www.w3.org/TR/webnn/ From the spec: > At the heart of neural networks is a computational graph of mathematical operations. These operations are the building blocks of modern ma…

littledivy updated 3 days ago
4
coderplex-org/resources #3

AI Resources Structure

The current resources list aren't structured, I'm proposing the following structure for learning ai: sections: - Mathematics for AI/ML - Machine Learning …

VineethKanaparthi updated 6 years ago
1
e4exp/paper_manager_abstract #343

Seeing Out of tHe bOx: End-to-End Pre-training for Vision-La…

- https://arxiv.org/abs/2104.03135 - CVPR 2021 本研究では、畳み込みニューラルネットワーク（CNN）とトランスフォーマー（Transformer）の共同学習により、何百万もの画像とテキストのペアからクロスモーダルな位置合わせを学習することを目的とした視覚言語事前学習（VLPT）を研究しています。従来の手法では、画像の顕著な領域を抽出し、その…

e4exp updated 3 years ago
6
salesforce/LAVIS #237

[BLIP2] How to perform stage 1 Vision-Language Representati…

I think that stage 1 learning, that means visual-language representation learning with those three objectives mentioned in the article is not yet implemented. Am I right? Not implemented `load_pre…

klima7 updated 1 year ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for vision-language-learning

1000+ results
for vision-language-learning