Kai-46 / nerfplusplus

improves over nerf in 360 capture of unbounded scenes
BSD 2-Clause "Simplified" License
911 stars 101 forks source link

Process 1 terminated with the following error #49

Open shuimoqingyin opened 1 year ago

shuimoqingyin commented 1 year ago

2022-11-18 23:46:13,697 [INFO] root: tat_training_Truck step: 0 resolution: 1.000000 level_0/loss: 0.064675 level_0/pnsr: 11.892565 level_1/loss: 0.064430 level_1/pnsr: 11.909071 iter_time: 0.250360 Exception in thread Thread-1: Traceback (most recent call last): File "/~~/anaconda3/envs/nerfplusplus/lib/python3.6/threading.py", line 916, in _bootstrap_inner self.run() File "/~~/anaconda3/envs/nerfplusplus/lib/python3.6/site-packages/tensorboardX/event_file_writer.py", line 202, in run data = self._queue.get(True, queue_wait_duration) File "/~~/anaconda3/envs/nerfplusplus/lib/python3.6/multiprocessing/queues.py", line 108, in get res = self._recv_bytes() File "/~~/anaconda3/envs/nerfplusplus/lib/python3.6/multiprocessing/connection.py", line 216, in recv_bytes buf = self._recv_bytes(maxlength) File "/~~/anaconda3/envs/nerfplusplus/lib/python3.6/multiprocessing/connection.py", line 407, in _recv_bytes buf = self._recv(4) File "/~~/anaconda3/envs/nerfplusplus/lib/python3.6/multiprocessing/connection.py", line 383, in _recv raise EOFError EOFError

Traceback (most recent call last): File "ddp_train_nerf.py", line 604, in train() File "ddp_train_nerf.py", line 599, in train join=True) File "/~~/anaconda3/envs/nerfplusplus/lib/python3.6/site-packages/torch/multiprocessing/spawn.py", line 200, in spawn return start_processes(fn, args, nprocs, join, daemon, start_method='spawn') File "/~~/anaconda3/envs/nerfplusplus/lib/python3.6/site-packages/torch/multiprocessing/spawn.py", line 158, in start_processes while not context.join(): File "/~~/anaconda3/envs/nerfplusplus/lib/python3.6/site-packages/torch/multiprocessing/spawn.py", line 119, in join raise Exception(msg) Exception:

-- Process 1 terminated with the following error: Traceback (most recent call last): File "/~~/anaconda3/envs/nerfplusplus/lib/python3.6/site-packages/torch/multiprocessing/spawn.py", line 20, in _wrap fn(i, *args) File "/~~/nerfplusplus-master/ddp_train_nerf.py", line 488, in ddp_train_nerf idx = what_val_to_log % len(val_ray_samplers) ZeroDivisionError: integer division or modulo by zero

how to realise the problem,could you help me

CYGG1123 commented 1 year ago

You can refer to #2 to solve your problem, or just edit the argument from "validation" to "test" like below:

image