princeton-vl / infinigen

Infinite Photorealistic Worlds using Procedural Generation
https://infinigen.org
BSD 3-Clause "New" or "Revised" License
5.14k stars 430 forks source link

Error on Creating high quality videos #224

Closed TomTomTommi closed 2 months ago

TomTomTommi commented 2 months ago

Describe the bug

I successfully run the Hello World demo. But following the Example Commands to generate high quality videos,

python -m infinigen.datagen.manage_jobs --output_folder outputs/my_videos --num_scenes 500     --pipeline_config slurm monocular_video cuda_terrain opengl_gt     --cleanup big_files --warmup_sec 60000 --config video high_quality_terrain

but an error occur:

WARNING:root:SMB_AUTH envvar is not set, smb_client upload will not work. Ignore this message if not using upload
WARNING:infinigen.datagen.job_funcs:process_mesh_path=PosixPath('/home/jj323/PycharmProjects/infinigen/infinigen/datagen/customgt/build/customgt') does not exist, if opengl_gt is enabled it will fail
/home/jj323/anaconda3/envs/infinigen/lib/python3.10/site-packages/numpy/core/getlimits.py:549: UserWarning: The value of the smallest subnormal for <class 'numpy.float64'> type is zero.
  setattr(self, word, getattr(machar, word).flat[0])
/home/jj323/anaconda3/envs/infinigen/lib/python3.10/site-packages/numpy/core/getlimits.py:89: UserWarning: The value of the smallest subnormal for <class 'numpy.float64'> type is zero.
  return self._float_to_str(self.smallest_subnormal)
Using get_slurm_banned_nodes()=[]
outputs/my_videos 04/23 05:59PM -> 04/23 05:59PM
Traceback (most recent call last):
  File "/home/jj323/anaconda3/envs/infinigen/lib/python3.10/runpy.py", line 196, in _run_module_as_main
    return _run_code(code, main_globals, None,
  File "/home/jj323/anaconda3/envs/infinigen/lib/python3.10/runpy.py", line 86, in _run_code
    exec(code, run_globals)
  File "/home/jj323/PycharmProjects/infinigen/infinigen/datagen/manage_jobs.py", line 812, in <module>
    main(args)
  File "/home/jj323/anaconda3/envs/infinigen/lib/python3.10/site-packages/gin/config.py", line 1605, in gin_wrapper
    utils.augment_exception_message_and_reraise(e, err_str)
  File "/home/jj323/anaconda3/envs/infinigen/lib/python3.10/site-packages/gin/utils.py", line 41, in augment_exception_message_and_reraise
    raise proxy.with_traceback(exception.__traceback__) from None
  File "/home/jj323/anaconda3/envs/infinigen/lib/python3.10/site-packages/gin/config.py", line 1582, in gin_wrapper
    return fn(*new_args, **new_kwargs)
  File "/home/jj323/PycharmProjects/infinigen/infinigen/datagen/manage_jobs.py", line 675, in main
    manage_datagen_jobs(all_scenes, elapsed=(now-start_time).total_seconds())
  File "/home/jj323/anaconda3/envs/infinigen/lib/python3.10/site-packages/gin/config.py", line 1605, in gin_wrapper
    utils.augment_exception_message_and_reraise(e, err_str)
  File "/home/jj323/anaconda3/envs/infinigen/lib/python3.10/site-packages/gin/utils.py", line 41, in augment_exception_message_and_reraise
    raise proxy.with_traceback(exception.__traceback__) from None
  File "/home/jj323/anaconda3/envs/infinigen/lib/python3.10/site-packages/gin/config.py", line 1582, in gin_wrapper
    return fn(*new_args, **new_kwargs)
  File "/home/jj323/PycharmProjects/infinigen/infinigen/datagen/manage_jobs.py", line 608, in manage_datagen_jobs
    control_state = compute_control_state(args, totals, elapsed, num_concurrent)
  File "/home/jj323/PycharmProjects/infinigen/infinigen/datagen/manage_jobs.py", line 567, in compute_control_state
    num_concurrent = int(os.environ[NUM_CONCURRENT_ENVVAR])
  File "/home/jj323/anaconda3/envs/infinigen/lib/python3.10/os.py", line 679, in __getitem__
    raise KeyError(key) from None
KeyError: 'INFINIGEN_NUMCONCURRENT_TARGET'
  In call to configurable 'manage_datagen_jobs' (<function manage_datagen_jobs at 0x7f2ff5133130>)
  In call to configurable 'main' (<function main at 0x7f2ff51330a0>)

All the commands in ConfiguringInfinigen.md have the same error.

Platform

araistrick commented 2 months ago

Hello - are you running on a slurm cluster? if so just set export INFINIGEN_NUM_CONCURRENT_TARGET=100 or something similar to specify how many jobs you wish to run in paralell. I need to add better docs / eliminate this variable.

If you are not using a slurm cluster, please replace slurm with local_128GB or some other option, and read ConfiguringInfinigen.md for more on what this means.