gwleeee / PaperReview

0 stars 0 forks source link

Human Pose as Compositional Tokens #10

Open gwleeee opened 1 year ago

gwleeee commented 1 year ago

Human Pose as Compositional Tokens

https://github.com/gengzigang/pct

https://arxiv.org/abs/2303.11638

https://sites.google.com/view/pctpose

CVPR2023

Abstract

image

image

Pose as Compositional Tokens (PCT)

image

1. Learning compositional tokens

image

2. Human pose estimation

image 학습된 코드북과 디코더를 사용하여 포즈 추론

Experiments

  1. Datasets and metrics

    • Dataset
      • 2D: COCO, MPII / occlusion: CrowdPose, OCHuman, SyncOCC, OCHuman
      • 3D: H36M
    • Evaluation metric: OKS, AP, PCKh, MPJPE
  2. Implementation details

    • GT bbox가 제공되는 데이터셋 사용 학습
    • backbone: SwinTransformerV2 (imageNet-1k)
      • backbone은 고정하고 head만 학습
    • MMPose 이용 augmentation (scale, rotation, flip, dropout, color jitter)
    • AdamW + warm restart
  3. COCO, MPII

    image

    image

  4. Occlusion

    image

    image

  5. number of codebook entries

    image

image

image