long8v / PTIR

Paper Today I Read
19 stars 0 forks source link

[167] Pick-a-Pic: An Open Dataset of User Preferences for Text-to-Image Generation #186

Open long8v opened 1 month ago

long8v commented 1 month ago
image

paper, code, dataset

TL;DR

Details

image

annotation

image

Pick-a-Pic Dataset

PickScore

$s$ : score $x$ : prompt $y_1, y_2$: image

in-batch negative도 해봤는데 별로 성능이 안좋았다고 함 trainingdms 4000 step, lr 3e-6, bs 128, warmup 500 step 8 A100으로 1시간도 안걸렸다고 함.

Result

long8v commented 1 month ago
image

preference가 alignment에 많이 맞혀있는듯