hanwenzhu / dreamhoi

DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors
https://DreamHOI.github.io
Other
29 stars 2 forks source link

DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors

Website | Paper

Thomas Zhu, Ruining Li*, Tomas Jakab* (*equal advising)
Visual Geometry Group, University of Oxford

Teaser figure We present DreamHOI, a novel method for zero-shot synthesis of human-object interactions (HOIs), enabling a 3D human model to realistically interact with any given object based on a textual description.

Installation

Please carefully follow docs/installation.md to set up your system for DreamHOI.

Running

To run our pipeline, run

python main.py \
  --num_iterations 1 \
  --tag sit-ball \
  --smpl_texture /path/to/smpl/texture.png \
  --smpl_shape /path/to/smpl/shape.npy \
  --smpl_gender female \
  --smpl_variant smplh \
  --prompt "A photo of a person sitting on a ball, high detail, photography" \
  --prompt_human "A photo of a person, high detail, photography" \
  --negative_prompt "missing limbs, missing legs, missing arms" \
  --negative_prompt_human "ball, missing limbs, missing legs, missing arms" \
  --mesh_path /path/to/ball.obj \
  --mesh_normalize \
  --mesh_translation 0.0 0.0 -0.3 \
  --mesh_scale 0.3 \
  --mesh_rotation_deg 0.0 \
  --mesh_tilt_deg 0.0 \
  --checkpoint_interval 1000 \
  --use_wandb \
  --openpose_dir /path/to/openpose \
  --openpose_bin /path/to/openpose/build/examples/openpose/openpose.bin \
  --nerf_init_args ... \
  --nerf_refit_args ...

where:

main.py is a wrapper around our pipeline, and you can modify our pipeline by directly modifying it.

After running, DreamHOI outputs:

Note that as in threestudio, the generated meshes are Z-up, and you can transform back to Y-up by a -π/2 rotation about the X axis.

Extra configurations

You may also add

Tips to improve generation

License

DreamHOI is released under MIT License. Some parts of this project uses third-party software. See LICENSE for their respective notices and licenses.

Citation

@article{zhu2024dreamhoi,
  title   = {{DreamHOI}: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors},
  author  = {Thomas Hanwen Zhu and Ruining Li and Tomas Jakab},
  journal = {arXiv preprint arXiv:2409.08278},
  year    = {2024}
}