ASD with SD
(Stable Diffusion). Feel free to change the prompt accordingly.
sh scripts/single-prompt-benchmark/asd_sd_nerf.sh
ASD with MV
(MVDream). Feel free to change the prompt accordingly.
sh scripts/single-prompt-benchmark/asd_mv_nerf.sh
The following 3D generator
architectures are available:
Network | Description | File |
---|---|---|
Hyper-iNGP | iNGP with text-conditioned linear layers,adopted from ATT3D | geometry, background |
3DConv-net | A StyleGAN generator that outputs voxels with 3D convolution, code adopted from CC3D | geometry, architecture |
Triplane-Transformer | Transformer-based 3D Generator, with Triplane as the output structure, adopted from LRM | geometry, architecture |
The following corpus
datasets are available:
Corpus | Description | File |
---|---|---|
MG15 | 15 text pormpts from Magic3D project page | json |
DF415 | 415 text pormpts from DreamFusion project page | json |
AT2520 | 2520 text pormpts from ATT3D experiments | json |
DL17k | 17k text pormpts from Instant3D release | json |
CP100k | 110k text pormpts from Cap3D dataset | json |
Run the following script to start training
Hyper-iNGP
with SD
on MG15
sh scripts/multi-prompt-benchmark/asd_sd_hyper_iNGP_MG15.sh
3DConv-net
with SD
on DF415
sh scripts/multi-prompt-benchmark/asd_sd_3dconv_net_DF415.sh
3DConv-net
with SD
on AT2520
sh scripts/multi-prompt-benchmark/asd_sd_3dconv_net_AT2520.sh
Triplane-Transformer
with MV
on DL17k
sh scripts/multi-prompt-benchmark/asd_mv_triplane_transformer_DL17k.sh
3DConv-net
with SD
on CP100k
scripts/multi-prompt-benchmark/asd_sd_3dconv_net_CP100k.sh
Create a directory to save the checkpoints
mkdir pretrained/3d_checkpoints
The checkpoints of the âŦī¸ experiments are available. Save the corresponding .pth
file to 3d_checkpoint
, then run the scripts as below.
Hyper-iNGP
with SD
on MG15
. The ckpt in Google Drivesh scripts/multi_prompts_benchmark_evaluation/asd_sd_hyper_iNGP_MG15.sh
3DConv-net
with SD
on DF415
. The ckpt in Google Drivesh scripts/multi_prompts_benchmark_evaluation/asd_sd_3dconv_net_DF415.sh
3DConv-net
with SD
on AT2520
. The ckpt in Google Drivesh scripts/multi_prompts_benchmark_evaluation/asd_sd_3dconv_net_AT2520.sh
Triplane-Transformer
with MV
on DL17k
. The ckpt in Google Drivesh scripts/multi_prompts_benchmark_evaluation/asd_mv_triplane_transformer_DL17k.sh
3DConv-net
with SD
on CP100k
. The ckpt in Google Drivesh scripts/multi_prompts_benchmark_evaluation/asd_sd_3dconv_net_CP100k.sh
The rendered images and videos are saved in outputs/<experiment_name>/save/<num_iter>
directory. Compute the metrics with CLIP via
python evaluation/CLIP/evaluation_amortized.py --result_dir <video_dir>
custom/amortized/models/geometry
, check out the other code in that directory for reference.custom/amortized/models/geometry/__init__.py
system.geometry_type
argument, check out the other code in the configs/multi-prompt_benchmark
directory for reference.threestudio/models/guidance
, take a look at the other code in that directory or other guidance for reference.threestudio/models/guidance/__init__.py
system.guidance_type
argument, take a look at the other code in the configs/multi-prompt_benchmark
directory for reference.load
directory system.prompt_processor.prompt_library
argument to set up the corpus, take other commands in the scripts
directory for referenceYou can also add your modules for data
, renderer
, prompt_processor
, etc.
If you find this paper helpful, please cite
@article{ma2024scaledreamer,
title={ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation},
author={Ma, Zhiyuan and Wei, Yuxiang and Zhang, Yabin and Zhu, Xiangyu and Lei, Zhen and Zhang, Lei},
journal={arXiv preprint arXiv:2407.02040},
year={2024}
}