CPU机器运行报错：InvalidArgumentError (see above for traceback): Default MaxPoolingOp only supports NHWC on device type CPU

tang1485 commented 3 years ago

运行bash run_opt_rgb.sh，报了InvalidArgumentError (see above for traceback): Default MaxPoolingOp only supports NHWC on device type CPU的错误。使用的CPU机器，安装的是cpu版本tensorflow

prepare datas
start MTCNN
MTCNN detect
hello
2020-11-05 15:07:19.810071: I tensorflow/core/platform/cpu_feature_guard.cc:140] Your CPU supports instructions that this TensorFlow binary was not compiled to use: AVX2 FMA
=========================
has run MTCNN: 1 / 1
start detect 86pt 3D lmk
hello
has run 86pt lmk: 1 / 1
start detect 68pt 2D lmk
hello
WARNING:tensorflow:From /root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/util/tf_should_use.py:118: initialize_all_variables (from tensorflow.python.ops.variables) is deprecated and will be removed after 2017-03-02.
Instructions for updating:
Use `tf.global_variables_initializer` instead.
W1105 15:07:21.785537 140196717344576 tf_logging.py:126] From /root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/util/tf_should_use.py:118: initialize_all_variables (from tensorflow.python.ops.variables) is deprecated and will be removed after 2017-03-02.
Instructions for updating:
Use `tf.global_variables_initializer` instead.
load lm: /data/hifi3dface/hifi3dface/test_data/RGB/test1/single_img//prepare_rgb/lmk_3D_86pts_ori.txt
2020-11-05 15:07:24.326655: E tensorflow/core/common_runtime/executor.cc:660] Executor failed to create kernel. Invalid argument: Default MaxPoolingOp only supports NHWC on device type CPU
         [[Node: max_pool = MaxPool[T=DT_FLOAT, data_format="NCHW", ksize=[1, 1, 2, 2], padding="VALID", strides=[1, 1, 2, 2], _device="/job:localhost/replica:0/task:0/device:CPU:0"](Relu_23)]]
Traceback (most recent call last):
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1322, in _do_call
    return fn(*args)
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1307, in _run_fn
    options, feed_dict, fetch_list, target_list, run_metadata)
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1409, in _call_tf_sessionrun
    run_metadata)
tensorflow.python.framework.errors_impl.InvalidArgumentError: Default MaxPoolingOp only supports NHWC on device type CPU
         [[Node: max_pool = MaxPool[T=DT_FLOAT, data_format="NCHW", ksize=[1, 1, 2, 2], padding="VALID", strides=[1, 1, 2, 2], _device="/job:localhost/replica:0/task:0/device:CPU:0"](Relu_23)]]

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "run_data_preparation.py", line 342, in <module>
    app.run(main)
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/absl/app.py", line 300, in run
    _run_main(main, args)
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/absl/app.py", line 251, in _run_main
    sys.exit(main(argv))
  File "run_data_preparation.py", line 321, in main
    prepare_test_data_RGB(FLAGS.img_dir, FLAGS.out_dir)
  File "run_data_preparation.py", line 217, in prepare_test_data_RGB
    pb_path, img_dir, lmk3D_ori_txt_path, lmk2D_ori_txt_path
  File "run_data_preparation.py", line 84, in detect_2Dlmk_all_imgs
    np.array([lmk3D]), np.array([img]), sess
  File "/data/hifi3dface/hifi3dface/data_prepare/detect_2D_landmark.py", line 226, in detect_2Dlmk68
    heatmap = sess.run(outputs, {inputs: test_img})
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 900, in run
    run_metadata_ptr)
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1135, in _run
    feed_dict_tensor, options, run_metadata)
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1316, in _do_run
    run_metadata)
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1335, in _do_call
    raise type(e)(node_def, op, message)
tensorflow.python.framework.errors_impl.InvalidArgumentError: Default MaxPoolingOp only supports NHWC on device type CPU
         [[Node: max_pool = MaxPool[T=DT_FLOAT, data_format="NCHW", ksize=[1, 1, 2, 2], padding="VALID", strides=[1, 1, 2, 2], _device="/job:localhost/replica:0/task:0/device:CPU:0"](Relu_23)]]

Caused by op 'max_pool', defined at:
  File "run_data_preparation.py", line 342, in <module>
    app.run(main)
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/absl/app.py", line 300, in run
    _run_main(main, args)
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/absl/app.py", line 251, in _run_main
    sys.exit(main(argv))
  File "run_data_preparation.py", line 321, in main
    prepare_test_data_RGB(FLAGS.img_dir, FLAGS.out_dir)
  File "run_data_preparation.py", line 217, in prepare_test_data_RGB
    pb_path, img_dir, lmk3D_ori_txt_path, lmk2D_ori_txt_path
  File "run_data_preparation.py", line 70, in detect_2Dlmk_all_imgs
    tf.import_graph_def(graph_def, name="")
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/util/deprecation.py", line 432, in new_func
    return func(*args, **kwargs)
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/framework/importer.py", line 513, in import_graph_def
    _ProcessNewOps(graph)
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/framework/importer.py", line 303, in _ProcessNewOps
    for new_op in graph._add_new_tf_operations(compute_devices=False):  # pylint: disable=protected-access
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/framework/ops.py", line 3540, in _add_new_tf_operations
    for c_op in c_api_util.new_tf_operations(self)
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/framework/ops.py", line 3540, in <listcomp>
    for c_op in c_api_util.new_tf_operations(self)
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/framework/ops.py", line 3428, in _create_op_from_tf_operation
    ret = Operation(c_op, self)
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/framework/ops.py", line 1718, in __init__
    self._traceback = self._graph._extract_stack()  # pylint: disable=protected-access

InvalidArgumentError (see above for traceback): Default MaxPoolingOp only supports NHWC on device type CPU
         [[Node: max_pool = MaxPool[T=DT_FLOAT, data_format="NCHW", ksize=[1, 1, 2, 2], padding="VALID", strides=[1, 1, 2, 2], _device="/job:localhost/replica:0/task:0/device:CPU:0"](Relu_23)]]

data prepare failed

cyj907 commented 3 years ago

运行bash run_opt_rgb.sh，报了InvalidArgumentError (see above for traceback): Default MaxPoolingOp only supports NHWC on device type CPU的错误。使用的CPU机器，安装的是cpu版本tensorflow

prepare datas
start MTCNN
MTCNN detect
hello
2020-11-05 15:07:19.810071: I tensorflow/core/platform/cpu_feature_guard.cc:140] Your CPU supports instructions that this TensorFlow binary was not compiled to use: AVX2 FMA
=========================
has run MTCNN: 1 / 1
start detect 86pt 3D lmk
hello
has run 86pt lmk: 1 / 1
start detect 68pt 2D lmk
hello
WARNING:tensorflow:From /root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/util/tf_should_use.py:118: initialize_all_variables (from tensorflow.python.ops.variables) is deprecated and will be removed after 2017-03-02.
Instructions for updating:
Use `tf.global_variables_initializer` instead.
W1105 15:07:21.785537 140196717344576 tf_logging.py:126] From /root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/util/tf_should_use.py:118: initialize_all_variables (from tensorflow.python.ops.variables) is deprecated and will be removed after 2017-03-02.
Instructions for updating:
Use `tf.global_variables_initializer` instead.
load lm: /data/hifi3dface/hifi3dface/test_data/RGB/test1/single_img//prepare_rgb/lmk_3D_86pts_ori.txt
2020-11-05 15:07:24.326655: E tensorflow/core/common_runtime/executor.cc:660] Executor failed to create kernel. Invalid argument: Default MaxPoolingOp only supports NHWC on device type CPU
         [[Node: max_pool = MaxPool[T=DT_FLOAT, data_format="NCHW", ksize=[1, 1, 2, 2], padding="VALID", strides=[1, 1, 2, 2], _device="/job:localhost/replica:0/task:0/device:CPU:0"](Relu_23)]]
Traceback (most recent call last):
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1322, in _do_call
    return fn(*args)
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1307, in _run_fn
    options, feed_dict, fetch_list, target_list, run_metadata)
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1409, in _call_tf_sessionrun
    run_metadata)
tensorflow.python.framework.errors_impl.InvalidArgumentError: Default MaxPoolingOp only supports NHWC on device type CPU
         [[Node: max_pool = MaxPool[T=DT_FLOAT, data_format="NCHW", ksize=[1, 1, 2, 2], padding="VALID", strides=[1, 1, 2, 2], _device="/job:localhost/replica:0/task:0/device:CPU:0"](Relu_23)]]

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "run_data_preparation.py", line 342, in <module>
    app.run(main)
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/absl/app.py", line 300, in run
    _run_main(main, args)
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/absl/app.py", line 251, in _run_main
    sys.exit(main(argv))
  File "run_data_preparation.py", line 321, in main
    prepare_test_data_RGB(FLAGS.img_dir, FLAGS.out_dir)
  File "run_data_preparation.py", line 217, in prepare_test_data_RGB
    pb_path, img_dir, lmk3D_ori_txt_path, lmk2D_ori_txt_path
  File "run_data_preparation.py", line 84, in detect_2Dlmk_all_imgs
    np.array([lmk3D]), np.array([img]), sess
  File "/data/hifi3dface/hifi3dface/data_prepare/detect_2D_landmark.py", line 226, in detect_2Dlmk68
    heatmap = sess.run(outputs, {inputs: test_img})
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 900, in run
    run_metadata_ptr)
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1135, in _run
    feed_dict_tensor, options, run_metadata)
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1316, in _do_run
    run_metadata)
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1335, in _do_call
    raise type(e)(node_def, op, message)
tensorflow.python.framework.errors_impl.InvalidArgumentError: Default MaxPoolingOp only supports NHWC on device type CPU
         [[Node: max_pool = MaxPool[T=DT_FLOAT, data_format="NCHW", ksize=[1, 1, 2, 2], padding="VALID", strides=[1, 1, 2, 2], _device="/job:localhost/replica:0/task:0/device:CPU:0"](Relu_23)]]

Caused by op 'max_pool', defined at:
  File "run_data_preparation.py", line 342, in <module>
    app.run(main)
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/absl/app.py", line 300, in run
    _run_main(main, args)
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/absl/app.py", line 251, in _run_main
    sys.exit(main(argv))
  File "run_data_preparation.py", line 321, in main
    prepare_test_data_RGB(FLAGS.img_dir, FLAGS.out_dir)
  File "run_data_preparation.py", line 217, in prepare_test_data_RGB
    pb_path, img_dir, lmk3D_ori_txt_path, lmk2D_ori_txt_path
  File "run_data_preparation.py", line 70, in detect_2Dlmk_all_imgs
    tf.import_graph_def(graph_def, name="")
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/util/deprecation.py", line 432, in new_func
    return func(*args, **kwargs)
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/framework/importer.py", line 513, in import_graph_def
    _ProcessNewOps(graph)
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/framework/importer.py", line 303, in _ProcessNewOps
    for new_op in graph._add_new_tf_operations(compute_devices=False):  # pylint: disable=protected-access
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/framework/ops.py", line 3540, in _add_new_tf_operations
    for c_op in c_api_util.new_tf_operations(self)
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/framework/ops.py", line 3540, in <listcomp>
    for c_op in c_api_util.new_tf_operations(self)
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/framework/ops.py", line 3428, in _create_op_from_tf_operation
    ret = Operation(c_op, self)
  File "/root/anaconda3/envs/py36/lib/python3.6/site-packages/tensorflow/python/framework/ops.py", line 1718, in __init__
    self._traceback = self._graph._extract_stack()  # pylint: disable=protected-access

InvalidArgumentError (see above for traceback): Default MaxPoolingOp only supports NHWC on device type CPU
         [[Node: max_pool = MaxPool[T=DT_FLOAT, data_format="NCHW", ksize=[1, 1, 2, 2], padding="VALID", strides=[1, 1, 2, 2], _device="/job:localhost/replica:0/task:0/device:CPU:0"](Relu_23)]]

data prepare failed

我们只在GPU上面测试过，CPU应该是不支持的，毕竟栅格化的部分是cuda。如果想改成CPU跑，你需要要把所有不支持CPU的操作都重新写一下。

JacksonL1 commented 3 years ago

将detect_2Dlmk_all_imgs, face_seg两个函数中的网络，修改为调用GPU就可以了

qianxinchun commented 3 years ago

将detect_2Dlmk_all_imgs, face_seg两个函数中的网络，修改为调用GPU就可以了

这样还是不能成功啊，请问您这边有完全在CPU上跑起来吗？

JacksonL1 commented 3 years ago

将detect_2Dlmk_all_imgs, face_seg两个函数中的网络，修改为调用GPU就可以了

这样还是不能成功啊，请问您这边有完全在CPU上跑起来吗？

CPU上没有跑成功，正如cy907回复的一样，需要重写所以修改成gpu了。具体修改位置，明天帮你看一下

JacksonL1 commented 3 years ago

将detect_2Dlmk_all_imgs, face_seg两个函数中的网络，修改为调用GPU就可以了

这样还是不能成功啊，请问您这边有完全在CPU上跑起来吗？

在上面两个函数中，就更仅更改了以下内容，我这里就可以运行了．如果还是不行，可以把error信息贴一下 with tf.Graph().as_default(): 改为 with tf.Graph().as_default(), tf.device('/device:XLA_GPU:0'):