visual-language-learning Search Results

1000+ results
for visual-language-learning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

OpenGVLab/InternVideo #70

About Framework of ViCLIP

Thank you for nice work. In training ViCLIP, I would like to clarify my understanding of this paper. If vision transforms is not pre-trained such as MAE method, then, it means that it only align…

kimsekeun updated 5 months ago
3
microsoft/DeepSpeed-MII #119

Support for Albert and Swin/ViT

Just curious whether there is a plan to support Albert and Swin/ViT. currently I am playing with a model for multimodal learning which involves language models like Albert and visual transformers like…

larry-fuy updated 1 year ago
1
Lab-LVM/awesome-VLM #2

[20230406] Weekly VLM1 - CoOp

Paper [Learning Transferable Visual Models From Natural Language Supervision](https://arxiv.org/abs/2109.01134#) (a.k.a. CoOp) **Summary** CLIP과 같이, VLM의 Contrastive Learning 방법론 중 하나임. 11가지 Data…

kalelpark updated 1 year ago
4
meta-introspector/lang_agent #1

merge model

## Bing Merging Large Language Model (LLM) weights is a complex process that involves several steps and concepts. Here's a high-level overview: - Obtain the Models: You need the full-precision model…

jmikedupont2 updated 4 months ago
1
wordplaydev/wordplay #390

Sound output

## What's the problem? Wordplay has no audio output, such as sounds, noise, music, or other audio media, other than rudimentary screen reader support via browsers and operating systems. This is a m…

amyjko updated 1 month ago
7
azpoliak/robust-nli #4

Datasets with biases and other related work

A running issue to keep track of other tasks and datasets with biases, which may be amenable to a similar methodology. - [Stance Detection Benchmark: How Robust Is Your Stance Detection?](https://…

boknilev updated 3 years ago
1
joaomdmoura/crewAI #464

Human Input in Agent Execution documentation: unintelligible…

See example output below. The example does not work - no "human input" is ever sought - and lacks any explanation of how the feature is supposed to be used, making it useless. ``` [DEBUG]: == Wor…

francisjervis updated 1 month ago
3
DARIAH-ERIC/dariah-campus #789

[new resource]: Text to Video Prompt Engineering Intensive

### Title of the resource Text to Video Prompt Engineering Intensive ### Resource type None ### Authors, editors and contributors Emily Genatowski ### Topics (keywords) AI, Large Language Model…

emilykateemilykate updated 9 months ago
1
EvelynFan/FaceFormer #60

use blendshapes instead of vertices

Hi EvelynFan This is an interesting work for me. I want to know if i have 101 blendshapes , In the trainging i can use the blendshapes instead of the vertices?

zhaiyuan0217 updated 1 month ago
9
szzexpoi/POEM #1

How do prototypes learn？

Thank you very much for your work, it was very interesting! But I'm curious about how prototypes learn. The article says that prototypes can be learned, represented as a linear layer in the code. Ho…

BAOOOOOM updated 5 months ago
3

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for visual-language-learning

1000+ results
for visual-language-learning