Closed AFESDTTM closed 5 months ago
而且没有吃GPU
您好,出现warning no valid point的原因在这里,可能是数据集本身标注的误差产生的,正常训练忽略即可。
关于没有占用GPU的原因我也不是很确定产生的原因,您是直接运行的python train.py --config config/REAL/camera_real.yaml
吗?可能可以通过指定GPU序号解决,例如指定0号GPU,python train.py --config config/REAL/camera_real.yaml --gpus 0
有什么问题可以继续沟通
另外如果显存不够的话可以调整config文件中的batchsize
好的谢谢您,我再去尝试一下
你好,我发现我指定用第1,2块卡跑它实际是用的第0块,你知道是什么原因吗,或者代码那地方需要改动呢
您好,我们并没有进行多卡的测试,因为我们的代码中没有实现torch.nn.DataParallel
或torch.nn.DistributedDataParallel
,所以指定多卡的话可能会出问题。我们的结果都是在单张24G的3090上的结果。如果你显存不够的话可以调整config中的syn_bs, real_bs
,我们测试过,按照3:1的比例调小一些结果应该不会有很大的差距
好的,谢谢您
欢迎您在这里展示您的实验结果,这样我可以了解到我们实验在其他机器上的可复现性。
您好,我想问问,在3090上训练一个epoch大概需要多久
我想请问一下,如何训练测试CAMERA数据集呢
是不是只要训练NOCS,然后直接测试REAL和CAMERA
好的,谢谢您的回复
是只修改吗 我修改进行测试显示的
你好,请问测试camera数据集成功了吗
你好,请问相机数据集?
我们的数据集中的深度图的名称与作者代码中的不一致造成的
您好,请问你测试camera数据集成功了吗?你下载的数据集是作者主页里的链接的数据集吗?我得到了一些报错 Test [717/19220][5]: : 4%|████▉ | 717/19220 [00:27<10:34, 29.18it/s]warning: No data Test [846/19220][1]: : 4%|█████▊ | 845/19220 [00:31<10:24, 29.41it/s]warning: No data Test [922/19220][3]: : 5%|██████▎ | 921/19220 [00:34<09:58, 30.56it/s]libpng error: IDAT: CRC error Test [938/19220][3]: : 5%|██████▍ | 938/19220 [00:35<11:22, 26.80it/s] Traceback (most recent call last): File "/root/autodl-tmp/AG-Pose-main/test.py", line 114, in test_func(model, dataloder, save_path) File "/root/autodl-tmp/AG-Pose-main/utils/solver.py", line 225, in test_func for i, data in enumerate(dataloder): File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/dataloader.py", line 681, in next data = self._next_data() File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/dataloader.py", line 1356, in _next_data return self._process_data(data) File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/dataloader.py", line 1402, in _process_data data.reraise() File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/_utils.py", line 461, in reraise raise exception TypeError: Caught TypeError in DataLoader worker process 2. Original Traceback (most recent call last): File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/_utils/worker.py", line 302, in _worker_loop data = fetcher.fetch(index) File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/_utils/fetch.py", line 49, in fetch data = [self.dataset[idx] for idx in possibly_batched_index] File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/_utils/fetch.py", line 49, in data = [self.dataset[idx] for idx in possibly_batched_index] File "/root/autodl-tmp/AG-Pose-main/provider/nocs_dataset.py", line 210, in getitem rgb = cv2.imread(image_path + '_color.png')[:, :, :3] TypeError: 'NoneType' object is not subscriptable。 请问你遇到过这些问题吗
您好,请问你测试camera数据集成功了吗?你下载的数据集是作者主页里的链接的数据集吗?我得到了一些报错 Test [717/19220][5]: : 4%|████▉ | 717/19220 [00:27<10:34, 29.18it/s]warning: No data Test [846/19220][1]: : 4%|█████▊ | 845/19220 [00:31<10:24, 29.41it/s]warning: No data Test [922/19220][3]: : 5%|██████▎ | 921/19220 [00:34<09:58, 30.56it/s]libpng error: IDAT: CRC error Test [938/19220][3]: : 5%|██████▍ | 938/19220 [00:35<11:22, 26.80it/s] Traceback (most recent call last): File "/root/autodl-tmp/AG-Pose-main/test.py", line 114, in test_func(model, dataloder, save_path) File "/root/autodl-tmp/AG-Pose-main/utils/solver.py", line 225, in test_func for i, data in enumerate(dataloder): File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/dataloader.py", line 681, in next data = self._next_data() File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/dataloader.py", line 1356, in _next_data return self._process_data(data) File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/dataloader.py", line 1402, in _process_data data.reraise() File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/_utils.py", line 461, in reraise raise exception TypeError: Caught TypeError in DataLoader worker process 2. Original Traceback (most recent call last): File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/_utils/worker.py", line 302, in _worker_loop data = fetcher.fetch(index) File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/_utils/fetch.py", line 49, in fetch data = [self.dataset[idx] for idx in possibly_batched_index] File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/_utils/fetch.py", line 49, in data = [self.dataset[idx] for idx in possibly_batched_index] File "/root/autodl-tmp/AG-Pose-main/provider/nocs_dataset.py", line 210, in getitem rgb = cv2.imread(image_path + '_color.png')[:, :, :3] TypeError: 'NoneType' object is not subscriptable。 请问你遇到过这些问题吗
没有遇到过,我是从DPDN中下载的,下载完需要进行预处理,DPDN中的预处理代码
是data_processing.py文件吧,我已经预处理完了,测试real数据集是正常的,但是camera数据集就报上述错误
---- Replied Message ---- | From | @.> | | Date | 06/20/2024 15:12 | | To | Leeiieeo/AG-Pose @.> | | Cc | zxl55552 @.>, Comment @.> | | Subject | Re: [Leeiieeo/AG-Pose] warning no valid point (Issue #3) |
您好,请问你测试camera数据集成功了吗?你下载的数据集是作者主页里的链接的数据集吗?我得到了一些报错 Test [717/19220][5]: : 4%|████▉ | 717/19220 [00:27<10:34, 29.18it/s]warning: No data Test [846/19220][1]: : 4%|█████▊ | 845/19220 [00:31<10:24, 29.41it/s]warning: No data Test [922/19220][3]: : 5%|██████▎ | 921/19220 [00:34<09:58, 30.56it/s]libpng error: IDAT: CRC error Test [938/19220][3]: : 5%|██████▍ | 938/19220 [00:35<11:22, 26.80it/s] Traceback (most recent call last): File "/root/autodl-tmp/AG-Pose-main/test.py", line 114, in test_func(model, dataloder, save_path) File "/root/autodl-tmp/AG-Pose-main/utils/solver.py", line 225, in test_func for i, data in enumerate(dataloder): File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/dataloader.py", line 681, in next data = self._next_data() File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/dataloader.py", line 1356, in _next_data return self._process_data(data) File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/dataloader.py", line 1402, in _process_data data.reraise() File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/_utils.py", line 461, in reraise raise exception TypeError: Caught TypeError in DataLoader worker process 2. Original Traceback (most recent call last): File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/_utils/worker.py", line 302, in _worker_loop data = fetcher.fetch(index) File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/_utils/fetch.py", line 49, in fetch data = [self.dataset[idx] for idx in possibly_batched_index] File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/_utils/fetch.py", line 49, in data = [self.dataset[idx] for idx in possibly_batched_index] File "/root/autodl-tmp/AG-Pose-main/provider/nocs_dataset.py", line 210, in getitem rgb = cv2.imread(image_path + '_color.png')[:, :, :3] TypeError: 'NoneType' object is not subscriptable。 请问你遇到过这些问题吗
没有遇到过,我是从DPDN中下载的,下载完需要进行预处理,DPDN中的预处理代码
— Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: @.***>
你好,请问你测试camera数据集和原论文中的结果差别大吗?测试的时候有没有出现no data的情况
---- Replied Message ---- | From | @.> | | Date | 06/20/2024 15:12 | | To | Leeiieeo/AG-Pose @.> | | Cc | zxl55552 @.>, Comment @.> | | Subject | Re: [Leeiieeo/AG-Pose] warning no valid point (Issue #3) |
您好,请问你测试camera数据集成功了吗?你下载的数据集是作者主页里的链接的数据集吗?我得到了一些报错 Test [717/19220][5]: : 4%|████▉ | 717/19220 [00:27<10:34, 29.18it/s]warning: No data Test [846/19220][1]: : 4%|█████▊ | 845/19220 [00:31<10:24, 29.41it/s]warning: No data Test [922/19220][3]: : 5%|██████▎ | 921/19220 [00:34<09:58, 30.56it/s]libpng error: IDAT: CRC error Test [938/19220][3]: : 5%|██████▍ | 938/19220 [00:35<11:22, 26.80it/s] Traceback (most recent call last): File "/root/autodl-tmp/AG-Pose-main/test.py", line 114, in test_func(model, dataloder, save_path) File "/root/autodl-tmp/AG-Pose-main/utils/solver.py", line 225, in test_func for i, data in enumerate(dataloder): File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/dataloader.py", line 681, in next data = self._next_data() File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/dataloader.py", line 1356, in _next_data return self._process_data(data) File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/dataloader.py", line 1402, in _process_data data.reraise() File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/_utils.py", line 461, in reraise raise exception TypeError: Caught TypeError in DataLoader worker process 2. Original Traceback (most recent call last): File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/_utils/worker.py", line 302, in _worker_loop data = fetcher.fetch(index) File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/_utils/fetch.py", line 49, in fetch data = [self.dataset[idx] for idx in possibly_batched_index] File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/_utils/fetch.py", line 49, in data = [self.dataset[idx] for idx in possibly_batched_index] File "/root/autodl-tmp/AG-Pose-main/provider/nocs_dataset.py", line 210, in getitem rgb = cv2.imread(image_path + '_color.png')[:, :, :3] TypeError: 'NoneType' object is not subscriptable。 请问你遇到过这些问题吗
没有遇到过,我是从DPDN中下载的,下载完需要进行预处理,DPDN中的预处理代码
— Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: @.***>
你好,请问你测试camera数据集和原论文中的结果差别大吗?测试的时候有没有出现no data的情况 … ---- Replied Message ---- | From | @.> | | Date | 06/20/2024 15:12 | | To | Leeiieeo/AG-Pose @.> | | Cc | zxl55552 @.>, Comment @.> | | Subject | Re: [Leeiieeo/AG-Pose] warning no valid point (Issue #3) | 您好,请问你测试camera数据集成功了吗?你下载的数据集是作者主页里的链接的数据集吗?我得到了一些报错 Test [717/19220][5]: : 4%|████▉ | 717/19220 [00:27<10:34, 29.18it/s]warning: No data Test [846/19220][1]: : 4%|█████▊ | 845/19220 [00:31<10:24, 29.41it/s]warning: No data Test [922/19220][3]: : 5%|██████▎ | 921/19220 [00:34<09:58, 30.56it/s]libpng error: IDAT: CRC error Test [938/19220][3]: : 5%|██████▍ | 938/19220 [00:35<11:22, 26.80it/s] Traceback (most recent call last): File "/root/autodl-tmp/AG-Pose-main/test.py", line 114, in test_func(model, dataloder, save_path) File "/root/autodl-tmp/AG-Pose-main/utils/solver.py", line 225, in test_func for i, data in enumerate(dataloder): File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/dataloader.py", line 681, in next data = self._next_data() File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/dataloader.py", line 1356, in _next_data return self._process_data(data) File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/dataloader.py", line 1402, in _process_data data.reraise() File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/_utils.py", line 461, in reraise raise exception TypeError: Caught TypeError in DataLoader worker process 2. Original Traceback (most recent call last): File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/_utils/worker.py", line 302, in _worker_loop data = fetcher.fetch(index) File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/_utils/fetch.py", line 49, in fetch data = [self.dataset[idx] for idx in possibly_batched_index] File "/root/miniconda3/envs/ag/lib/python3.9/site-packages/torch/utils/data/_utils/fetch.py", line 49, in data = [self.dataset[idx] for idx in possibly_batched_index] File "/root/autodl-tmp/AG-Pose-main/provider/nocs_dataset.py", line 210, in getitem rgb = cv2.imread(image_path + '_color.png')[:, :, :3] TypeError: 'NoneType' object is not subscriptable。 请问你遇到过这些问题吗 没有遇到过,我是从DPDN中下载的,下载完需要进行预处理,DPDN中的预处理代码 — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: @.***>
我出现no data的情况的处理后的深度图,与作者代码中的Camera深度图名称不一致造成的,我测试camera数据稍微低了4个点,可是导入模型的问题
同一个实验先后测试REAL和CAMERA数据集需要注意 https://github.com/Leeiieeo/AG-Pose/issues/11#issuecomment-2181848287
同一个实验先后测试REAL和CAMERA数据集需要注意 #11 (comment)
好的,谢谢你的帮助
你好,我想问一下,在训练的时候出先warning no valid point是什么意思,我是把模型下载到本地的。 是不是我模型下载错了,还是什么原因?