cross-modal-retrieval Search Results

235 results
for cross-modal-retrieval

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

FlagOpen/FlagEmbedding #659

bge-visual每次出来的embed值都不一样

使用的是bge-m3， candi_emb_1 = model.encode(text="The Mid-Hudson Bridge, spanning the Hudson River between Poughkeepsie and Highland.", image="./imgs/wiki_candi_1.jpg")

charliedream1 updated 7 months ago
6
e4exp/paper_manager_abstract #287

Perspectives and Prospects on Transformer Architecture for C…

- https://arxiv.org/abs/2103.04037 - 2021 トランスフォーマーアーキテクチャは、長年リカレントニューラルネットワークに支配されていた計算言語学の分野に根本的な変化をもたらしました。その成功は、言語と視覚のクロスモーダルなタスクにも劇的な変化をもたらし、多くの研究者がすでにこの問題に取り組んでいます。本論文では、この分野における最も重要なマイル…

e4exp updated 3 years ago
7
postech-cv-multimodal/fire #1

09/22 회의록

## FIRE (Fine-grained Image-text Retrieval with Explicit focus on semantic objects) - 기존 베이스라인 방법론은 semantic object에 Implicit하게 focusing 된 Image/Text representation 활용 - Image/Text에 추가적인 모듈을 더하여 E…

aqaqsubin updated 10 months ago
1
fly51fly/aicoco #3

爱可可老师24小时热门分享

微博内容精选

fly51fly updated 3 weeks ago
1906
long8v/PTIR #165

🍅 짭짤이 논문 모아놓기 (CLIP)

거의 scheming만 했던 논문 모아놓는 곳. notion에 정리중이었으나 link를 걸기가 어려워서 옮김.

long8v updated 3 months ago
18
ArrowLuo/CLIP4Clip #72

How to download "cross_pytorch_model.bin" as pretrained weig…

1. 我按照readme提供的参数，无法收敛，观察到日志中： 1. 05/20/2022 00:18:31 - INFO - Weight doesn't exsits. xxx/modules/cross-base/cross_pytorch_model.bin 2. 05/20/2022 00:18:42 - INFO - Weights from pretrained model…

AAUfoa updated 8 months ago
12
meta-introspector/meta-meme #79

kwality

jmikedupont2 updated 1 year ago
62
LinWeizheDragon/FLMR #16

question about details of finetuning script

hi lin, i managed to write a finetuning script, could you help me check it? i also got confused about some details, listed below(also marked with NOTE in code comments), could you illustrate somehow? …

Maxlinn updated 4 months ago
9
KevinLight831/CTP #1

Questions about paper

Thanks for sharing great work and dataset! I have two questions about paper. First of all, I think that authors mainly follow the losses and architecture of ALBEF. But, CTP do not use the ITM loss…

jaeseokbyun updated 1 year ago
4
data-liberation/data-liberation-resources #2

pdf 数据提取

表格检测 >哪些区域是表格哪些不是(是文本、图表) 表格结构识别 >哪些是表名、标题、表头、行和列、单元格网格结构表格数据语义提取 > table interpretation: rediscovering the meaning of the tabular structure. This includes: (a) functional analysis: deter…

wanghaisheng updated 4 years ago
49

上一页 1...10 11 12 13 14 15 16...24 下一页

235 results for cross-modal-retrieval

235 results
for cross-modal-retrieval