Closed moyu026 closed 1 year ago
Does it start now? It will take some time to compile the torch extensions
No, it just prints some info and stops running
When you hit Ctrl+C, what does it print out?
when I hit Ctrl+C, it shows me that open3d is not installed, but my pachages have open3d
(NKSR) l@node1:/mnt/disk2/workspace/l/NKSR-public$ python train.py configs/points2surf/train.yaml
07-06 15:25:29 (train.py:67) [INFO] Intelligent GPU selection: 0
Tensorboard logger, version number = 1
Global seed set to 0
/mnt/disk2/.conda/l/envs/NKSR/lib/python3.10/site-packages/pytorch_lightning/trainer/connectors/accelerator_connector.py:446: LightningDeprecationWarning: Setting Trainer(gpus=1)
is deprecated in v1.7 and will be removed in v2.0. Please use Trainer(accelerator='gpu', devices=1)
instead.
rank_zero_deprecation(
Auto select gpus: [0]
GPU available: True (cuda), used: True
TPU available: False, using: 0 TPU cores
IPU available: False, using: 0 IPUs
HPU available: False, using: 0 HPUs
^C07-06 15:25:30 (o3d.py:10) [ERROR] Open3D not installed! You can try either the following 2 options:
pip install python-pycg[full] -f https://pycg.s3.ap-northeast-1.amazonaws.com/packages/index.html
pip install python-pycg[all]
Traceback (most recent call last):
File "/mnt/disk2/workspace/l/NKSR-public/train.py", line 258, in
Package Version
absl-py 1.4.0 addict 2.4.0 aiohttp 3.8.4 aiosignal 1.3.1 ansi2html 1.8.0 antlr4-python3-runtime 4.9.3 appdirs 1.4.4 asttokens 2.2.1 async-timeout 4.0.2 attrs 23.1.0 backcall 0.2.0 ca-certificates 2021.4.13 cachetools 5.3.1 calmsize 0.1.3 certifi 2023.5.7 charset-normalizer 3.1.0 click 8.1.3 cmake 3.26.3 comm 0.1.3 ConfigArgParse 1.5.3 contourpy 1.0.7 cycler 0.11.0 dash 2.11.1 dash-core-components 2.0.0 dash-html-components 2.0.0 dash-table 5.0.0 debugpy 1.6.7 decorator 5.1.1 docker-pycreds 0.4.0 executing 1.2.0 fastjsonschema 2.17.1 filelock 3.12.0 fire 0.5.0 Flask 2.2.5 flatten-dict 0.4.2 fonttools 4.40.0 frozenlist 1.3.3 fsspec 2023.5.0 gitdb 4.0.10 GitPython 3.1.31 google-auth 2.21.0 google-auth-oauthlib 1.0.0 grpcio 1.56.0 idna 3.4 ipykernel 6.23.2 ipython 8.13.2 ipywidgets 8.0.6 itsdangerous 2.1.2 jedi 0.18.2 Jinja2 3.1.2 joblib 1.2.0 jsonschema 4.17.3 jupyter_client 8.2.0 jupyter_core 5.3.0 jupyterlab-widgets 3.0.7 kiwisolver 1.4.4 lightning-lite 1.8.0 lightning-utilities 0.3.0 lit 16.0.6 Markdown 3.4.3 MarkupSafe 2.1.2 matplotlib 3.7.1 matplotlib-inline 0.1.6 mpmath 1.3.0 multidict 6.0.4 nbformat 5.5.0 nest-asyncio 1.5.6 networkx 3.1 ninja 1.11.1 nksr 1.0.3+pt20cu117 numpy 1.25.0 oauthlib 3.2.2 omegaconf 2.3.0 open3d 0.16.1+c65c7ef packaging 23.1 pandas 2.0.3 parso 0.8.3 pathtools 0.1.2 pexpect 4.8.0 pickleshare 0.7.5 Pillow 9.5.0 pip 23.1.2 platformdirs 3.5.3 plotly 5.14.1 plyfile 0.9 prompt-toolkit 3.0.38 protobuf 4.23.3 psutil 5.9.5 ptyprocess 0.7.0 pure-eval 0.2.2 pyasn1 0.5.0 pyasn1-modules 0.3.0 pybind11 2.10.4 Pygments 2.15.1 pykdtree 1.3.7.post0 pyntcloud 0.3.1 pynvml 11.5.0 pyparsing 3.1.0 pyquaternion 0.9.9 pyrsistent 0.19.3 python-dateutil 2.8.2 python-pycg 0.5.2 pytorch-lightning 1.8.0 pytz 2023.3 PyYAML 6.0 pyzmq 25.1.0 randomname 0.2.1 requests 2.31.0 requests-oauthlib 1.3.1 retrying 1.3.4 rsa 4.9 scikit-learn 1.2.2 scipy 1.11.1 sentry-sdk 1.26.0 setproctitle 1.3.2 setuptools 68.0.0 six 1.16.0 smmap 5.0.0 stack-data 0.6.2 sympy 1.12 tenacity 8.2.2 tensorboard 2.13.0 tensorboard-data-server 0.7.0 termcolor 2.3.0 threadpoolctl 3.1.0 torch 2.0.0+cu117 torch-scatter 2.1.1 torchmetrics 0.11.4 torchvision 0.15.0+cu117 tornado 6.3.2 tqdm 4.65.0 traitlets 5.9.0 triton 2.0.0 typing_extensions 4.7.0 tzdata 2023.3 urllib3 1.26.16 usd-core 23.5 wandb 0.15.4 wcwidth 0.2.6 Werkzeug 2.2.3 wheel 0.40.0 widgetsnbextension 4.0.7 yarl 1.9.2
I change open3d version and this problem is solved,but it still doesn't start training
(NKSR) l@node1:/mnt/disk2/workspace/l/NKSR-public$ python train.py configs/points2surf/train.yaml
07-07 15:12:21 (train.py:67) [INFO] Intelligent GPU selection: 0
Tensorboard logger, version number = 1
Global seed set to 0
/mnt/disk2/.conda/l/envs/NKSR/lib/python3.10/site-packages/pytorch_lightning/trainer/connectors/accelerator_connector.py:446: LightningDeprecationWarning: Setting Trainer(gpus=1)
is deprecated in v1.7 and will be removed in v2.0. Please use Trainer(accelerator='gpu', devices=1)
instead.
rank_zero_deprecation(
Auto select gpus: [0]
GPU available: True (cuda), used: True
TPU available: False, using: 0 TPU cores
IPU available: False, using: 0 IPUs
HPU available: False, using: 0 HPUs
^CTraceback (most recent call last):
File "/mnt/disk2/workspace/l/NKSR-public/train.py", line 259, in
I want to use 'python train.py configs/points2surf/train.yaml' to train Points2Surf dataset, It just prints some info but doesn't start training. 07-06 09:33:42 (train.py:67) [INFO] Intelligent GPU selection: 0 Tensorboard logger, version number = 1 Global seed set to 0 /mnt/disk2/.conda/liudengjin/envs/NKSR/lib/python3.10/site-packages/pytorch_lightning/trainer/connectors/accelerator_connector.py:446: LightningDeprecationWarning: Setting
Trainer(gpus=1)
is deprecated in v1.7 and will be removed in v2.0. Please useTrainer(accelerator='gpu', devices=1)
instead. rank_zero_deprecation( Auto select gpus: [0] GPU available: True (cuda), used: True TPU available: False, using: 0 TPU cores IPU available: False, using: 0 IPUs HPU available: False, using: 0 HPUs