long8v / PTIR

Paper Today I Read
19 stars 0 forks source link

[68] Iterative Scene Graph Generation #75

Open long8v opened 1 year ago

long8v commented 1 year ago
image

paper

TL;DR

Details

image

Architecture

image

Conditional Positional Encodings

image

Conditional Queries

image

Result

image

harmonic Recall은 얘네가 제시한 evaluation metric인데 recall, mR 섞은거 AP는 평가를 안했네! 사나이다!

bipartite matching

ground truth relation을 no relation으로 padding하고 전체 joint matching cost를 최소화하는 그래프를 찾는 것으로 함. (굳이? 흠..)

image image

우리의 loss!

image

Implementation Details

Ablation

number of queries

image

생각보다 num_queries 크다고 달라지는게 없네

refinement의 효과

image