Holistic-Motion2D / Tender

The official code for Tender
Apache License 2.0
35 stars 0 forks source link

Holistic-Motion2D

This is the official code release of Holistic-Motion2D: Scalable Whole-body Human Motion Generation in 2D Space by Yuan Wang*, Zhao Wang*, Junhao Gong*, Di Huang, Tong He, Wanli Ouyang, Jile Jiao, Xuetao Feng, Qi Dou, Shixiang Tang†, Dan Xu.

📝 Changelog

We present the first time a large-scale human motion benchmark, Holistic-Motion2D, including over 1M in-the-wild motion sequences, each paired with high-quality whole-body or partial pose annotations and textual descriptions.

image-20240817150254870

image-20240817150543506

image-20240817160339303

image-20240817160401230

Dataset Collection and Processing

image-20240817150513111

All data will be downloaded on Open-Data Lab:

Path Size Files Format Description
Holistic-Motion-2D-dataset 118.15 GB 1,464,278 Main folder
├  kpfiles 118.06 GB 400,790 Sequence of key points for character motion
├ ├  UCF101 5.03 GB 11,391 Pickle Whole-body key-points for UCF101
├ ├  CAER 153.81 MB 3,542 Pickle Facial key-points for CAER
├ ├  K400 55.54 GB 152,798 Pickle Whole-body key-points for Kinetics-400
├ ├  InternVid 44.02 GB 85,665 Pickle Whole-body key-points for InternVid
├ ├  K700 0 0 Pickle Whole-body key-points for Kinetics-700
├ ├  IDEA400 6.33 GB 12,025 Pickle Whole-body key-points for IDEA400
├ ├  sthv2 900.19 MB 106,661 Pickle Hand key-points for Something-to-Something-v2
├ ├  UBody 3.21 GB 5,195 Pickle Whole-body key-points for UBody
├ ├  DFEW 1.68 GB 15,524 Pickle Facial key-points for DFEW
├  texts 101.05 MB 1,063,488 Caption for character motion video
├ ├  UCF101 4.68 MB 24,711 TXT Texts for UCF101
├ ├  CAER 32.16 KB 4,574 TXT Texts for CARE
├ ├  K400 40.6 MB 215,479 TXT Texts for Kinetics-400
├ ├  InternVid 20.52 MB 421,894 TXT Texts for InternVid
├ ├  K700 22.75 MB 141,611 TXT Texts for Kinetics-700
├ ├  IDEA400 2.96 MB 12,025 TXT Texts for IDEA400
├ ├  sthv2 8.05 MB 220,848 TXT Texts for Something-to-Something-v2
├ ├  UBody 1.01 MB 5,974 TXT Texts for UBody
├ ├  DFEW 456.1 KB 16,372 TXT Texts for DFEW

2D text-driven whole-body motion generation model

image-20240817173409065

MDM MLD T2M-GPT Tender(Ours)

Downstream Applications

License