bytedance / byteps

A high performance and generic framework for distributed DNN training
Other
3.62k stars 488 forks source link

pip install error #217

Closed jasperzhong closed 4 years ago

jasperzhong commented 4 years ago

Describe the bug

$ pip3 install byteps
Collecting byteps
  Using cached https://files.pythonhosted.org/packages/90/b3/180bd882fe262482e20da1169eeda5cd8c2442eaddee0601fa01abf4d6a8/byteps-0.2.1.tar.gz
Building wheels for collected packages: byteps
  Running setup.py bdist_wheel for byteps: started
  Running setup.py bdist_wheel for byteps: finished with status 'error'
  Complete output from command /usr/bin/python3 -u -c "import setuptools, tokenize;__file__='/tmp/pip-build-cncjh9xt/byteps/setup.py';f=getattr(tokenize, 'open', open)(__file__);code=f.read().replace('\r\n', '\n');f.close();exec(compile(code, __file__, 'exec'))" bdist_wheel -d /tmp/tmp5dg6gdovpip-wheel- --python-tag cp36:
  /usr/lib/python3.6/distutils/dist.py:261: UserWarning: Unknown distribution option: 'long_description_content_type'
    warnings.warn(msg)
  running bdist_wheel
  running build
  running build_py
  creating build
  creating build/lib.linux-x86_64-3.6
  creating build/lib.linux-x86_64-3.6/byteps
  copying byteps/__init__.py -> build/lib.linux-x86_64-3.6/byteps
  copying byteps/__version__.py -> build/lib.linux-x86_64-3.6/byteps
  creating build/lib.linux-x86_64-3.6/byteps/server
  copying byteps/server/__init__.py -> build/lib.linux-x86_64-3.6/byteps/server
  creating build/lib.linux-x86_64-3.6/byteps/_keras
  copying byteps/_keras/callbacks.py -> build/lib.linux-x86_64-3.6/byteps/_keras
  copying byteps/_keras/__init__.py -> build/lib.linux-x86_64-3.6/byteps/_keras
  creating build/lib.linux-x86_64-3.6/byteps/mxnet
  copying byteps/mxnet/__init__.py -> build/lib.linux-x86_64-3.6/byteps/mxnet
  copying byteps/mxnet/ops.py -> build/lib.linux-x86_64-3.6/byteps/mxnet
  creating build/lib.linux-x86_64-3.6/byteps/common
  copying byteps/common/__init__.py -> build/lib.linux-x86_64-3.6/byteps/common
  creating build/lib.linux-x86_64-3.6/byteps/keras
  copying byteps/keras/callbacks.py -> build/lib.linux-x86_64-3.6/byteps/keras
  copying byteps/keras/__init__.py -> build/lib.linux-x86_64-3.6/byteps/keras
  creating build/lib.linux-x86_64-3.6/byteps/tensorflow
  copying byteps/tensorflow/compression.py -> build/lib.linux-x86_64-3.6/byteps/tensorflow
  copying byteps/tensorflow/__init__.py -> build/lib.linux-x86_64-3.6/byteps/tensorflow
  copying byteps/tensorflow/ops.py -> build/lib.linux-x86_64-3.6/byteps/tensorflow
  copying byteps/tensorflow/util.py -> build/lib.linux-x86_64-3.6/byteps/tensorflow
  creating build/lib.linux-x86_64-3.6/byteps/torch
  copying byteps/torch/compression.py -> build/lib.linux-x86_64-3.6/byteps/torch
  copying byteps/torch/__init__.py -> build/lib.linux-x86_64-3.6/byteps/torch
  copying byteps/torch/ops.py -> build/lib.linux-x86_64-3.6/byteps/torch
  copying byteps/torch/cross_barrier.py -> build/lib.linux-x86_64-3.6/byteps/torch
  creating build/lib.linux-x86_64-3.6/byteps/misc
  copying byteps/misc/__init__.py -> build/lib.linux-x86_64-3.6/byteps/misc
  creating build/lib.linux-x86_64-3.6/byteps/misc/imagenet18
  copying byteps/misc/imagenet18/__init__.py -> build/lib.linux-x86_64-3.6/byteps/misc/imagenet18
  running egg_info
  writing byteps.egg-info/PKG-INFO
  writing dependency_links to byteps.egg-info/dependency_links.txt
  writing top-level names to byteps.egg-info/top_level.txt
  reading manifest file 'byteps.egg-info/SOURCES.txt'
  reading manifest template 'MANIFEST.in'
  warning: no previously-included files found matching '.git/*'
  warning: no directories found matching '3rdparty/ps-lite'
  writing manifest file 'byteps.egg-info/SOURCES.txt'
  copying byteps/server/queue.h -> build/lib.linux-x86_64-3.6/byteps/server
  copying byteps/server/server.cc -> build/lib.linux-x86_64-3.6/byteps/server
  copying byteps/server/server.h -> build/lib.linux-x86_64-3.6/byteps/server
  copying byteps/mxnet/adapter.cc -> build/lib.linux-x86_64-3.6/byteps/mxnet
  copying byteps/mxnet/adapter.h -> build/lib.linux-x86_64-3.6/byteps/mxnet
  copying byteps/mxnet/cuda_util.cc -> build/lib.linux-x86_64-3.6/byteps/mxnet
  copying byteps/mxnet/cuda_util.h -> build/lib.linux-x86_64-3.6/byteps/mxnet
  copying byteps/mxnet/ops.cc -> build/lib.linux-x86_64-3.6/byteps/mxnet
  copying byteps/mxnet/ops.h -> build/lib.linux-x86_64-3.6/byteps/mxnet
  copying byteps/mxnet/ready_event.cc -> build/lib.linux-x86_64-3.6/byteps/mxnet
  copying byteps/mxnet/ready_event.h -> build/lib.linux-x86_64-3.6/byteps/mxnet
  copying byteps/mxnet/tensor_util.cc -> build/lib.linux-x86_64-3.6/byteps/mxnet
  copying byteps/mxnet/tensor_util.h -> build/lib.linux-x86_64-3.6/byteps/mxnet
  copying byteps/mxnet/util.h -> build/lib.linux-x86_64-3.6/byteps/mxnet
  copying byteps/common/common.cc -> build/lib.linux-x86_64-3.6/byteps/common
  copying byteps/common/common.h -> build/lib.linux-x86_64-3.6/byteps/common
  copying byteps/common/communicator.cc -> build/lib.linux-x86_64-3.6/byteps/common
  copying byteps/common/communicator.h -> build/lib.linux-x86_64-3.6/byteps/common
  copying byteps/common/core_loops.cc -> build/lib.linux-x86_64-3.6/byteps/common
  copying byteps/common/core_loops.h -> build/lib.linux-x86_64-3.6/byteps/common
  copying byteps/common/cpu_reducer.cc -> build/lib.linux-x86_64-3.6/byteps/common
  copying byteps/common/cpu_reducer.h -> build/lib.linux-x86_64-3.6/byteps/common
  copying byteps/common/global.cc -> build/lib.linux-x86_64-3.6/byteps/common
  copying byteps/common/global.h -> build/lib.linux-x86_64-3.6/byteps/common
  copying byteps/common/logging.cc -> build/lib.linux-x86_64-3.6/byteps/common
  copying byteps/common/logging.h -> build/lib.linux-x86_64-3.6/byteps/common
  copying byteps/common/nccl_manager.cc -> build/lib.linux-x86_64-3.6/byteps/common
  copying byteps/common/nccl_manager.h -> build/lib.linux-x86_64-3.6/byteps/common
  copying byteps/common/operations.cc -> build/lib.linux-x86_64-3.6/byteps/common
  copying byteps/common/operations.h -> build/lib.linux-x86_64-3.6/byteps/common
  copying byteps/common/ready_table.cc -> build/lib.linux-x86_64-3.6/byteps/common
  copying byteps/common/ready_table.h -> build/lib.linux-x86_64-3.6/byteps/common
  copying byteps/common/scheduled_queue.cc -> build/lib.linux-x86_64-3.6/byteps/common
  copying byteps/common/scheduled_queue.h -> build/lib.linux-x86_64-3.6/byteps/common
  copying byteps/common/shared_memory.cc -> build/lib.linux-x86_64-3.6/byteps/common
  copying byteps/common/shared_memory.h -> build/lib.linux-x86_64-3.6/byteps/common
  copying byteps/tensorflow/ops.cc -> build/lib.linux-x86_64-3.6/byteps/tensorflow
  copying byteps/tensorflow/ops.h -> build/lib.linux-x86_64-3.6/byteps/tensorflow
  copying byteps/torch/adapter.cc -> build/lib.linux-x86_64-3.6/byteps/torch
  copying byteps/torch/adapter.h -> build/lib.linux-x86_64-3.6/byteps/torch
  copying byteps/torch/cuda_util.cc -> build/lib.linux-x86_64-3.6/byteps/torch
  copying byteps/torch/cuda_util.h -> build/lib.linux-x86_64-3.6/byteps/torch
  copying byteps/torch/handle_manager.cc -> build/lib.linux-x86_64-3.6/byteps/torch
  copying byteps/torch/handle_manager.h -> build/lib.linux-x86_64-3.6/byteps/torch
  copying byteps/torch/ops.cc -> build/lib.linux-x86_64-3.6/byteps/torch
  copying byteps/torch/ops.h -> build/lib.linux-x86_64-3.6/byteps/torch
  copying byteps/torch/ready_event.cc -> build/lib.linux-x86_64-3.6/byteps/torch
  copying byteps/torch/ready_event.h -> build/lib.linux-x86_64-3.6/byteps/torch
  running build_ext
  /tmp/pip-build-cncjh9xt/byteps/setup.py:236: UserWarning:

   No RDMA header file detected. Will disable RDMA for compilation!

    warnings.warn("\n\n No RDMA header file detected. Will disable RDMA for compilation! \n\n")
  error: [Errno 2] No such file or directory: '3rdparty/ps-lite': '3rdparty/ps-lite'

  ----------------------------------------
  Running setup.py clean for byteps
Failed to build byteps
Installing collected packages: byteps
  Running setup.py install for byteps: started
    Running setup.py install for byteps: finished with status 'error'
    Complete output from command /usr/bin/python3 -u -c "import setuptools, tokenize;__file__='/tmp/pip-build-cncjh9xt/byteps/setup.py';f=getattr(tokenize, 'open', open)(__file__);code=f.read().replace('\r\n', '\n');f.close();exec(compile(code, __file__, 'exec'))" install --record /tmp/pip-26sn7f0c-record/install-record.txt --single-version-externally-managed --compile --user --prefix=:
    /usr/lib/python3.6/distutils/dist.py:261: UserWarning: Unknown distribution option: 'long_description_content_type'
      warnings.warn(msg)
    running install
    running build
    running build_py
    creating build
    creating build/lib.linux-x86_64-3.6
    creating build/lib.linux-x86_64-3.6/byteps
    copying byteps/__init__.py -> build/lib.linux-x86_64-3.6/byteps
    copying byteps/__version__.py -> build/lib.linux-x86_64-3.6/byteps
    creating build/lib.linux-x86_64-3.6/byteps/server
    copying byteps/server/__init__.py -> build/lib.linux-x86_64-3.6/byteps/server
    creating build/lib.linux-x86_64-3.6/byteps/_keras
    copying byteps/_keras/callbacks.py -> build/lib.linux-x86_64-3.6/byteps/_keras
    copying byteps/_keras/__init__.py -> build/lib.linux-x86_64-3.6/byteps/_keras
    creating build/lib.linux-x86_64-3.6/byteps/mxnet
    copying byteps/mxnet/__init__.py -> build/lib.linux-x86_64-3.6/byteps/mxnet
    copying byteps/mxnet/ops.py -> build/lib.linux-x86_64-3.6/byteps/mxnet
    creating build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/__init__.py -> build/lib.linux-x86_64-3.6/byteps/common
    creating build/lib.linux-x86_64-3.6/byteps/keras
    copying byteps/keras/callbacks.py -> build/lib.linux-x86_64-3.6/byteps/keras
    copying byteps/keras/__init__.py -> build/lib.linux-x86_64-3.6/byteps/keras
    creating build/lib.linux-x86_64-3.6/byteps/tensorflow
    copying byteps/tensorflow/compression.py -> build/lib.linux-x86_64-3.6/byteps/tensorflow
    copying byteps/tensorflow/__init__.py -> build/lib.linux-x86_64-3.6/byteps/tensorflow
    copying byteps/tensorflow/ops.py -> build/lib.linux-x86_64-3.6/byteps/tensorflow
    copying byteps/tensorflow/util.py -> build/lib.linux-x86_64-3.6/byteps/tensorflow
    creating build/lib.linux-x86_64-3.6/byteps/torch
    copying byteps/torch/compression.py -> build/lib.linux-x86_64-3.6/byteps/torch
    copying byteps/torch/__init__.py -> build/lib.linux-x86_64-3.6/byteps/torch
    copying byteps/torch/ops.py -> build/lib.linux-x86_64-3.6/byteps/torch
    copying byteps/torch/cross_barrier.py -> build/lib.linux-x86_64-3.6/byteps/torch
    creating build/lib.linux-x86_64-3.6/byteps/misc
    copying byteps/misc/__init__.py -> build/lib.linux-x86_64-3.6/byteps/misc
    creating build/lib.linux-x86_64-3.6/byteps/misc/imagenet18
    copying byteps/misc/imagenet18/__init__.py -> build/lib.linux-x86_64-3.6/byteps/misc/imagenet18
    running egg_info
    writing byteps.egg-info/PKG-INFO
    writing dependency_links to byteps.egg-info/dependency_links.txt
    writing top-level names to byteps.egg-info/top_level.txt
    reading manifest file 'byteps.egg-info/SOURCES.txt'
    reading manifest template 'MANIFEST.in'
    warning: no previously-included files found matching '.git/*'
    warning: no directories found matching '3rdparty/ps-lite'
    writing manifest file 'byteps.egg-info/SOURCES.txt'
    copying byteps/server/queue.h -> build/lib.linux-x86_64-3.6/byteps/server
    copying byteps/server/server.cc -> build/lib.linux-x86_64-3.6/byteps/server
    copying byteps/server/server.h -> build/lib.linux-x86_64-3.6/byteps/server
    copying byteps/mxnet/adapter.cc -> build/lib.linux-x86_64-3.6/byteps/mxnet
    copying byteps/mxnet/adapter.h -> build/lib.linux-x86_64-3.6/byteps/mxnet
    copying byteps/mxnet/cuda_util.cc -> build/lib.linux-x86_64-3.6/byteps/mxnet
    copying byteps/mxnet/cuda_util.h -> build/lib.linux-x86_64-3.6/byteps/mxnet
    copying byteps/mxnet/ops.cc -> build/lib.linux-x86_64-3.6/byteps/mxnet
    copying byteps/mxnet/ops.h -> build/lib.linux-x86_64-3.6/byteps/mxnet
    copying byteps/mxnet/ready_event.cc -> build/lib.linux-x86_64-3.6/byteps/mxnet
    copying byteps/mxnet/ready_event.h -> build/lib.linux-x86_64-3.6/byteps/mxnet
    copying byteps/mxnet/tensor_util.cc -> build/lib.linux-x86_64-3.6/byteps/mxnet
    copying byteps/mxnet/tensor_util.h -> build/lib.linux-x86_64-3.6/byteps/mxnet
    copying byteps/mxnet/util.h -> build/lib.linux-x86_64-3.6/byteps/mxnet
    copying byteps/common/common.cc -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/common.h -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/communicator.cc -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/communicator.h -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/core_loops.cc -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/core_loops.h -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/cpu_reducer.cc -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/cpu_reducer.h -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/global.cc -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/global.h -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/logging.cc -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/logging.h -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/nccl_manager.cc -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/nccl_manager.h -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/operations.cc -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/operations.h -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/ready_table.cc -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/ready_table.h -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/scheduled_queue.cc -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/scheduled_queue.h -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/shared_memory.cc -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/shared_memory.h -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/tensorflow/ops.cc -> build/lib.linux-x86_64-3.6/byteps/tensorflow
    copying byteps/tensorflow/ops.h -> build/lib.linux-x86_64-3.6/byteps/tensorflow
    copying byteps/torch/adapter.cc -> build/lib.linux-x86_64-3.6/byteps/torch
    copying byteps/torch/adapter.h -> build/lib.linux-x86_64-3.6/byteps/torch
    copying byteps/torch/cuda_util.cc -> build/lib.linux-x86_64-3.6/byteps/torch
    copying byteps/torch/cuda_util.h -> build/lib.linux-x86_64-3.6/byteps/torch
    copying byteps/torch/handle_manager.cc -> build/lib.linux-x86_64-3.6/byteps/torch
    copying byteps/torch/handle_manager.h -> build/lib.linux-x86_64-3.6/byteps/torch
    copying byteps/torch/ops.cc -> build/lib.linux-x86_64-3.6/byteps/torch
    copying byteps/torch/ops.h -> build/lib.linux-x86_64-3.6/byteps/torch
    copying byteps/torch/ready_event.cc -> build/lib.linux-x86_64-3.6/byteps/torch
    copying byteps/torch/ready_event.h -> build/lib.linux-x86_64-3.6/byteps/torch
    running build_ext
    /tmp/pip-build-cncjh9xt/byteps/setup.py:236: UserWarning:

     No RDMA header file detected. Will disable RDMA for compilation!

      warnings.warn("\n\n No RDMA header file detected. Will disable RDMA for compilation! \n\n")
    error: [Errno 2] No such file or directory: '3rdparty/ps-lite': '3rdparty/ps-lite'

    ----------------------------------------
Stderr:   Failed building wheel for byteps
Command "/usr/bin/python3 -u -c "import setuptools, tokenize;__file__='/tmp/pip-build-cncjh9xt/byteps/setup.py';f=getattr(tokenize, 'open', open)(__file__);code=f.read().replace('\r\n', '\n');f.close();exec(compile(code, __file__, 'exec'))" install --record /tmp/pip-26sn7f0c-record/install-record.txt --single-version-externally-managed --compile --user --prefix=" failed with error code 1 in /tmp/pip-build-cncjh9xt/byteps/

To Reproduce Steps to reproduce the behavior:

  1. See error

Expected behavior A clear and concise description of what you expected to happen.

Screenshots If applicable, add screenshots to help explain your problem.

Environment (please complete the following information):

c5n.xlarge

Additional context Add any other context about the problem here.

jasperzhong commented 4 years ago

It seems that this is because byteps-0.2.1.tar.gz does not include 3rdparty/ps-lite.

ymjiang commented 4 years ago

We will fix this ASAP.

ymjiang commented 4 years ago

@zhongyuchen We have a new pip release just now. Can you please try pip3 install byteps==0.2.2?

jasperzhong commented 4 years ago

it works on a g4dn which has gpu and mxnet but fails on the c5n.xlarge. i want c5n.xlarge to be a server node.

do i have to install one of tensorflow/mxnet/pytorch in the server node?

i have set the environment variables for the c5n.xlarge including:

export BYTEPS_WITHOUT_PYTORCH=1
export BYTEPS_WITHOUT_TENSORFLOW=1
export BYTEPS_WITHOUT_MXNET=1
The directory '/home/ubuntu/.cache/pip/http' or its parent directory is not owned by the current user and the cache has been disabled. Please check the permissions and owner of that directory. If executing pip with sudo, you may want sudo's -H flag.
The directory '/home/ubuntu/.cache/pip' or its parent directory is not owned by the current user and caching wheels has been disabled. check the permissions and owner of that directory. If executing pip with sudo, you may want sudo's -H flag.
Collecting byteps==0.2.2
  Downloading https://files.pythonhosted.org/packages/71/74/fb0c1edcc8ebf58a7000446536924458a940a43b5072eac91942155f1330/byteps-0.2.2.tar.gz (231kB)
Installing collected packages: byteps
  Running setup.py install for byteps: started
    Running setup.py install for byteps: finished with status 'error'
    Complete output from command /usr/bin/python3 -u -c "import setuptools, tokenize;__file__='/tmp/pip-build-p12hg9z0/byteps/setup.py';f=getattr(tokenize, 'open', open)(__file__);code=f.read().replace('\r\n', '\n');f.close();exec(compile(code, __file__, 'exec'))" install --record /tmp/pip-lrsfn6ih-record/install-record.txt --single-version-externally-managed --compile:
    /usr/lib/python3.6/distutils/dist.py:261: UserWarning: Unknown distribution option: 'long_description_content_type'
      warnings.warn(msg)
    running install
    running build
    running build_py
    creating build
    creating build/lib.linux-x86_64-3.6
    creating build/lib.linux-x86_64-3.6/byteps
    copying byteps/__init__.py -> build/lib.linux-x86_64-3.6/byteps
    copying byteps/__version__.py -> build/lib.linux-x86_64-3.6/byteps
    creating build/lib.linux-x86_64-3.6/byteps/server
    copying byteps/server/__init__.py -> build/lib.linux-x86_64-3.6/byteps/server
    creating build/lib.linux-x86_64-3.6/byteps/_keras
    copying byteps/_keras/callbacks.py -> build/lib.linux-x86_64-3.6/byteps/_keras
    copying byteps/_keras/__init__.py -> build/lib.linux-x86_64-3.6/byteps/_keras
    creating build/lib.linux-x86_64-3.6/byteps/mxnet
    copying byteps/mxnet/__init__.py -> build/lib.linux-x86_64-3.6/byteps/mxnet
    copying byteps/mxnet/ops.py -> build/lib.linux-x86_64-3.6/byteps/mxnet
    creating build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/__init__.py -> build/lib.linux-x86_64-3.6/byteps/common
    creating build/lib.linux-x86_64-3.6/byteps/keras
    copying byteps/keras/callbacks.py -> build/lib.linux-x86_64-3.6/byteps/keras
    copying byteps/keras/__init__.py -> build/lib.linux-x86_64-3.6/byteps/keras
    creating build/lib.linux-x86_64-3.6/byteps/tensorflow
    copying byteps/tensorflow/compression.py -> build/lib.linux-x86_64-3.6/byteps/tensorflow
    copying byteps/tensorflow/__init__.py -> build/lib.linux-x86_64-3.6/byteps/tensorflow
    copying byteps/tensorflow/ops.py -> build/lib.linux-x86_64-3.6/byteps/tensorflow
    copying byteps/tensorflow/util.py -> build/lib.linux-x86_64-3.6/byteps/tensorflow
    creating build/lib.linux-x86_64-3.6/byteps/torch
    copying byteps/torch/compression.py -> build/lib.linux-x86_64-3.6/byteps/torch
    copying byteps/torch/__init__.py -> build/lib.linux-x86_64-3.6/byteps/torch
    copying byteps/torch/ops.py -> build/lib.linux-x86_64-3.6/byteps/torch
    copying byteps/torch/cross_barrier.py -> build/lib.linux-x86_64-3.6/byteps/torch
    creating build/lib.linux-x86_64-3.6/byteps/misc
    copying byteps/misc/__init__.py -> build/lib.linux-x86_64-3.6/byteps/misc
    creating build/lib.linux-x86_64-3.6/byteps/misc/imagenet18
    copying byteps/misc/imagenet18/__init__.py -> build/lib.linux-x86_64-3.6/byteps/misc/imagenet18
    running egg_info
    writing byteps.egg-info/PKG-INFO
    writing dependency_links to byteps.egg-info/dependency_links.txt
    writing top-level names to byteps.egg-info/top_level.txt
    reading manifest file 'byteps.egg-info/SOURCES.txt'
    reading manifest template 'MANIFEST.in'
    warning: no previously-included files found matching '.git/*'
    writing manifest file 'byteps.egg-info/SOURCES.txt'
    copying byteps/server/queue.h -> build/lib.linux-x86_64-3.6/byteps/server
    copying byteps/server/server.cc -> build/lib.linux-x86_64-3.6/byteps/server
    copying byteps/server/server.h -> build/lib.linux-x86_64-3.6/byteps/server
    copying byteps/mxnet/adapter.cc -> build/lib.linux-x86_64-3.6/byteps/mxnet
    copying byteps/mxnet/adapter.h -> build/lib.linux-x86_64-3.6/byteps/mxnet
    copying byteps/mxnet/cuda_util.cc -> build/lib.linux-x86_64-3.6/byteps/mxnet
    copying byteps/mxnet/cuda_util.h -> build/lib.linux-x86_64-3.6/byteps/mxnet
    copying byteps/mxnet/ops.cc -> build/lib.linux-x86_64-3.6/byteps/mxnet
    copying byteps/mxnet/ops.h -> build/lib.linux-x86_64-3.6/byteps/mxnet
    copying byteps/mxnet/ready_event.cc -> build/lib.linux-x86_64-3.6/byteps/mxnet
    copying byteps/mxnet/ready_event.h -> build/lib.linux-x86_64-3.6/byteps/mxnet
    copying byteps/mxnet/tensor_util.cc -> build/lib.linux-x86_64-3.6/byteps/mxnet
    copying byteps/mxnet/tensor_util.h -> build/lib.linux-x86_64-3.6/byteps/mxnet
    copying byteps/mxnet/util.h -> build/lib.linux-x86_64-3.6/byteps/mxnet
    copying byteps/common/common.cc -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/common.h -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/communicator.cc -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/communicator.h -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/core_loops.cc -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/core_loops.h -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/cpu_reducer.cc -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/cpu_reducer.h -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/global.cc -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/global.h -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/logging.cc -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/logging.h -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/nccl_manager.cc -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/nccl_manager.h -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/operations.cc -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/operations.h -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/ready_table.cc -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/ready_table.h -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/scheduled_queue.cc -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/scheduled_queue.h -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/shared_memory.cc -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/common/shared_memory.h -> build/lib.linux-x86_64-3.6/byteps/common
    copying byteps/tensorflow/ops.cc -> build/lib.linux-x86_64-3.6/byteps/tensorflow
    copying byteps/tensorflow/ops.h -> build/lib.linux-x86_64-3.6/byteps/tensorflow
    copying byteps/torch/adapter.cc -> build/lib.linux-x86_64-3.6/byteps/torch
    copying byteps/torch/adapter.h -> build/lib.linux-x86_64-3.6/byteps/torch
    copying byteps/torch/cuda_util.cc -> build/lib.linux-x86_64-3.6/byteps/torch
    copying byteps/torch/cuda_util.h -> build/lib.linux-x86_64-3.6/byteps/torch
    copying byteps/torch/handle_manager.cc -> build/lib.linux-x86_64-3.6/byteps/torch
    copying byteps/torch/handle_manager.h -> build/lib.linux-x86_64-3.6/byteps/torch
    copying byteps/torch/ops.cc -> build/lib.linux-x86_64-3.6/byteps/torch
    copying byteps/torch/ops.h -> build/lib.linux-x86_64-3.6/byteps/torch
    copying byteps/torch/ready_event.cc -> build/lib.linux-x86_64-3.6/byteps/torch
    copying byteps/torch/ready_event.h -> build/lib.linux-x86_64-3.6/byteps/torch
    running build_ext
    /tmp/pip-build-p12hg9z0/byteps/setup.py:236: UserWarning:

     No RDMA header file detected. Will disable RDMA for compilation!

      warnings.warn("\n\n No RDMA header file detected. Will disable RDMA for compilation! \n\n")
    rm -rf zeromq-4.1.4.tar.gz zeromq-4.1.4
    wget https://raw.githubusercontent.com/mli/deps/master/build/zeromq-4.1.4.tar.gz && tar --no-same-owner -zxf zeromq-4.1.4.tar.gz
    --2020-03-11 09:23:17--  https://raw.githubusercontent.com/mli/deps/master/build/zeromq-4.1.4.tar.gz
    Resolving raw.githubusercontent.com (raw.githubusercontent.com)... 151.101.248.133
    Connecting to raw.githubusercontent.com (raw.githubusercontent.com)|151.101.248.133|:443... connected.
    HTTP request sent, awaiting response... 200 OK
    Length: 1400012 (1.3M) [application/octet-stream]
    Saving to: ‘zeromq-4.1.4.tar.gz’

         0K .......... .......... .......... .......... ..........  3% 44.8M 0s
        50K .......... .......... .......... .......... ..........  7% 56.1M 0s
       100K .......... .......... .......... .......... .......... 10% 65.1M 0s
       150K .......... .......... .......... .......... .......... 14% 56.9M 0s
       200K .......... .......... .......... .......... .......... 18% 68.3M 0s
       250K .......... .......... .......... .......... .......... 21% 81.2M 0s
       300K .......... .......... .......... .......... .......... 25% 71.4M 0s
       350K .......... .......... .......... .......... .......... 29% 56.8M 0s
       400K .......... .......... .......... .......... .......... 32% 94.5M 0s
       450K .......... .......... .......... .......... .......... 36% 81.7M 0s
       500K .......... .......... .......... .......... .......... 40% 73.6M 0s
       550K .......... .......... .......... .......... .......... 43% 92.7M 0s
       600K .......... .......... .......... .......... .......... 47% 80.1M 0s
       650K .......... .......... .......... .......... .......... 51%  162M 0s
       700K .......... .......... .......... .......... .......... 54% 93.0M 0s
       750K .......... .......... .......... .......... .......... 58% 97.5M 0s
       800K .......... .......... .......... .......... .......... 62% 97.7M 0s
       850K .......... .......... .......... .......... .......... 65% 96.5M 0s
       900K .......... .......... .......... .......... .......... 69%  154M 0s
       950K .......... .......... .......... .......... .......... 73% 95.3M 0s
      1000K .......... .......... .......... .......... .......... 76%  167M 0s
      1050K .......... .......... .......... .......... .......... 80%  141M 0s
      1100K .......... .......... .......... .......... .......... 84%  121M 0s
      1150K .......... .......... .......... .......... .......... 87%  102M 0s
      1200K .......... .......... .......... .......... .......... 91%  154M 0s
      1250K .......... .......... .......... .......... .......... 95%  104M 0s
      1300K .......... .......... .......... .......... .......... 98%  138M 0s
      1350K .......... .......                                    100%  144M=0.02s

    2020-03-11 09:23:17 (87.4 MB/s) - ‘zeromq-4.1.4.tar.gz’ saved [1400012/1400012]

    cd zeromq-4.1.4 && export CFLAGS=-fPIC && export CXXFLAGS=-fPIC && ./configure -prefix=/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps --with-libsodium=no --with-libgssapi_krb5=no && make && make install
    checking for a BSD-compatible install... /usr/bin/install -c
    checking whether build environment is sane... yes
    checking for a thread-safe mkdir -p... /bin/mkdir -p
    checking for gawk... gawk
    checking whether make sets $(MAKE)... yes
    checking whether make supports nested variables... yes
    checking whether UID '0' is supported by ustar format... yes
    checking whether GID '0' is supported by ustar format... yes
    checking how to create a ustar tar archive... gnutar
    checking whether make supports nested variables... (cached) yes
    checking for gcc... gcc
    checking whether the C compiler works... yes
    checking for C compiler default output file name... a.out
    checking for suffix of executables...
    checking whether we are cross compiling... no
    checking for suffix of object files... o
    checking whether we are using the GNU C compiler... yes
    checking whether gcc accepts -g... yes
    checking for gcc option to accept ISO C89... none needed
    checking whether gcc understands -c and -o together... yes
    checking for style of include used by make... GNU
    checking dependency style of gcc... gcc3
    checking for gcc option to accept ISO C99... none needed
    checking for g++... g++
    checking whether we are using the GNU C++ compiler... yes
    checking whether g++ accepts -g... yes
    checking dependency style of g++... gcc3
    checking for a sed that does not truncate output... /bin/sed
    checking for gawk... (cached) gawk
    checking for pkg-config... /usr/bin/pkg-config
    checking pkg-config is at least version 0.9.0... yes
    checking for xmlto... no
    checking for asciidoc... no
    checking build system type... x86_64-unknown-linux-gnu
    checking host system type... x86_64-unknown-linux-gnu
    checking how to print strings... printf
    checking for a sed that does not truncate output... (cached) /bin/sed
    checking for grep that handles long lines and -e... /bin/grep
    checking for egrep... /bin/grep -E
    checking for fgrep... /bin/grep -F
    checking for ld used by gcc... /usr/bin/ld
    checking if the linker (/usr/bin/ld) is GNU ld... yes
    checking for BSD- or MS-compatible name lister (nm)... /usr/bin/nm -B
    checking the name lister (/usr/bin/nm -B) interface... BSD nm
    checking whether ln -s works... yes
    checking the maximum length of command line arguments... 1572864
    checking whether the shell understands some XSI constructs... yes
    checking whether the shell understands "+="... yes
    checking how to convert x86_64-unknown-linux-gnu file names to x86_64-unknown-linux-gnu format... func_convert_file_noop
    checking how to convert x86_64-unknown-linux-gnu file names to toolchain format... func_convert_file_noop
    checking for /usr/bin/ld option to reload object files... -r
    checking for objdump... objdump
    checking how to recognize dependent libraries... pass_all
    checking for dlltool... dlltool
    checking how to associate runtime and link libraries... printf %s\n
    checking for ar... ar
    checking for archiver @FILE support... @
    checking for strip... strip
    checking for ranlib... ranlib
    checking command to parse /usr/bin/nm -B output from gcc object... ok
    checking for sysroot... no
    checking for mt... mt
    checking if mt is a manifest tool... no
    checking how to run the C preprocessor... gcc -E
    checking for ANSI C header files... yes
    checking for sys/types.h... yes
    checking for sys/stat.h... yes
    checking for stdlib.h... yes
    checking for string.h... yes
    checking for memory.h... yes
    checking for strings.h... yes
    checking for inttypes.h... yes
    checking for stdint.h... yes
    checking for unistd.h... yes
    checking for dlfcn.h... yes
    checking for objdir... .libs
    checking if gcc supports -fno-rtti -fno-exceptions... no
    checking for gcc option to produce PIC... -fPIC -DPIC
    checking if gcc PIC flag -fPIC -DPIC works... yes
    checking if gcc static flag -static works... yes
    checking if gcc supports -c -o file.o... yes
    checking if gcc supports -c -o file.o... (cached) yes
    checking whether the gcc linker (/usr/bin/ld -m elf_x86_64) supports shared libraries... yes
    checking whether -lc should be explicitly linked in... no
    checking dynamic linker characteristics... GNU/Linux ld.so
    checking how to hardcode library paths into programs... immediate
    checking whether stripping libraries is possible... yes
    checking if libtool supports shared libraries... yes
    checking whether to build shared libraries... yes
    checking whether to build static libraries... yes
    checking how to run the C++ preprocessor... g++ -E
    checking for ld used by g++... /usr/bin/ld -m elf_x86_64
    checking if the linker (/usr/bin/ld -m elf_x86_64) is GNU ld... yes
    checking whether the g++ linker (/usr/bin/ld -m elf_x86_64) supports shared libraries... yes
    checking for g++ option to produce PIC... -fPIC -DPIC
    checking if g++ PIC flag -fPIC -DPIC works... yes
    checking if g++ static flag -static works... yes
    checking if g++ supports -c -o file.o... yes
    checking if g++ supports -c -o file.o... (cached) yes
    checking whether the g++ linker (/usr/bin/ld -m elf_x86_64) supports shared libraries... yes
    checking dynamic linker characteristics... (cached) GNU/Linux ld.so
    checking how to hardcode library paths into programs... immediate
    checking whether the C compiler works... yes
    checking whether we are using Intel C compiler... no
    checking whether we are using Sun Studio C compiler... no
    checking whether we are using clang C compiler... no
    checking whether we are using gcc >= 4 C compiler... yes
    checking whether the C++ compiler works... yes
    checking whether we are using Intel C++ compiler... no
    checking whether we are using Sun Studio C++ compiler... no
    checking whether we are using clang C++ compiler... no
    checking whether we are using gcc >= 4 C++ compiler... yes
    checking whether to enable debugging information... no
    checking whether to enable code coverage... no
    checking if TIPC is available and supports nonblocking connect... no
    checking for pthread_create in -lpthread... yes
    checking for clock_gettime in -lrt... yes
    checking whether C++ compiler supports -fvisibility=hidden... yes
    checking whether C++ compiler supports dso visibility... yes
    checking for asciidoc... no
    checking for xmlto... no
    checking whether to build documentation... no
    checking whether to install manpages... yes
    checking for suitable polling system... using epoll
    checking for ANSI C header files... (cached) yes
    checking errno.h usability... yes
    checking errno.h presence... yes
    checking for errno.h... yes
    checking time.h usability... yes
    checking time.h presence... yes
    checking for time.h... yes
    checking for unistd.h... (cached) yes
    checking limits.h usability... yes
    checking limits.h presence... yes
    checking for limits.h... yes
    checking stddef.h usability... yes
    checking stddef.h presence... yes
    checking for stddef.h... yes
    checking for stdlib.h... (cached) yes
    checking for string.h... (cached) yes
    checking arpa/inet.h usability... yes
    checking arpa/inet.h presence... yes
    checking for arpa/inet.h... yes
    checking netinet/tcp.h usability... yes
    checking netinet/tcp.h presence... yes
    checking for netinet/tcp.h... yes
    checking netinet/in.h usability... yes
    checking netinet/in.h presence... yes
    checking for netinet/in.h... yes
    checking sys/socket.h usability... yes
    checking sys/socket.h presence... yes
    checking for sys/socket.h... yes
    checking sys/time.h usability... yes
    checking sys/time.h presence... yes
    checking for sys/time.h... yes
    checking ifaddrs.h usability... yes
    checking ifaddrs.h presence... yes
    checking for ifaddrs.h... yes
    checking sys/uio.h usability... yes
    checking sys/uio.h presence... yes
    checking for sys/uio.h... yes
    checking sys/eventfd.h usability... yes
    checking sys/eventfd.h presence... yes
    checking for sys/eventfd.h... yes
    checking whether SO_PEERCRED is declared... yes
    checking whether LOCAL_PEERCRED is declared... no
    checking for stdbool.h that conforms to C99... no
    checking for _Bool... no
    checking for an ANSI C-conforming const... yes
    checking for inline... inline
    checking for size_t... yes
    checking for ssize_t... yes
    checking whether time.h and sys/time.h may both be included... yes
    checking for uint32_t... yes
    checking for working volatile... yes
    checking "with_norm_ext = no"... checking how to enable additional warnings for C++ compiler... -Wall
    checking how to turn warnings to errors in C++ compiler... -Werror
    checking how to enable strict standards compliance in C++ compiler... -pedantic
    checking return type of signal handlers... void
    checking for perror... yes
    checking for gettimeofday... yes
    checking for clock_gettime... yes
    checking for memset... no
    checking for socket... yes
    checking for getifaddrs... yes
    checking for freeifaddrs... yes
    checking for fork... no
    checking alloca.h usability... yes
    checking alloca.h presence... yes
    checking for alloca.h... yes
    checking whether SOCK_CLOEXEC is supported... yes
    checking whether SO_KEEPALIVE is supported... yes
    checking whether TCP_KEEPCNT is supported... yes
    checking whether TCP_KEEPIDLE is supported... yes
    checking whether TCP_KEEPINTVL is supported... yes
    checking whether TCP_KEEPALIVE is supported... no
    checking that generated files are newer than configure... done
    configure: creating ./config.status
    config.status: creating Makefile
    config.status: creating src/libzmq.pc
    config.status: creating doc/Makefile
    config.status: creating builds/msvc/Makefile
    config.status: creating builds/redhat/zeromq.spec
    config.status: creating src/platform.hpp
    config.status: executing depfiles commands
    config.status: executing libtool commands
    make[1]: Entering directory '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/zeromq-4.1.4'
    Making all in doc
    make[2]: Entering directory '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/zeromq-4.1.4/doc'
    make[2]: Nothing to be done for 'all'.
    make[2]: Leaving directory '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/zeromq-4.1.4/doc'
    make[2]: Entering directory '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/zeromq-4.1.4'
      CXX      src/libzmq_la-address.lo
      CXX      src/libzmq_la-clock.lo
      CXX      src/libzmq_la-curve_client.lo
      CXX      src/libzmq_la-devpoll.lo
      CXX      src/libzmq_la-dist.lo
      CXX      src/libzmq_la-dealer.lo
      CXX      src/libzmq_la-gssapi_server.lo
      CXX      src/libzmq_la-io_object.lo
      CXX      src/libzmq_la-ipc_address.lo
      CXX      src/libzmq_la-mailbox.lo
      CXX      src/libzmq_la-fq.lo
      CXX      src/libzmq_la-epoll.lo
      CXX      src/libzmq_la-err.lo
      CXX      src/libzmq_la-mtrie.lo
      CXX      src/libzmq_la-ip.lo
      CXX      src/libzmq_la-ipc_connecter.lo
      CXX      src/libzmq_la-null_mechanism.lo
      CXX      src/libzmq_la-mechanism.lo
      CXX      src/libzmq_la-msg.lo
      CXX      src/libzmq_la-options.lo
      CXX      src/libzmq_la-io_thread.lo
      CXX      src/libzmq_la-pgm_receiver.lo
      CXX      src/libzmq_la-own.lo
      CXX      src/libzmq_la-metadata.lo
      CXX      src/libzmq_la-object.lo
      CXX      src/libzmq_la-pgm_sender.lo
      CXX      src/libzmq_la-pair.lo
      CXX      src/libzmq_la-lb.lo
      CXX      src/libzmq_la-gssapi_mechanism_base.lo
      CXX      src/libzmq_la-plain_server.lo
      CXX      src/libzmq_la-plain_client.lo
      CXX      src/libzmq_la-norm_engine.lo
      CXX      src/libzmq_la-poller_base.lo
      CXX      src/libzmq_la-gssapi_client.lo
      CXX      src/libzmq_la-pgm_socket.lo
      CXX      src/libzmq_la-poll.lo
      CXX      src/libzmq_la-kqueue.lo
      CXX      src/libzmq_la-pull.lo
      CXX      src/libzmq_la-proxy.lo
      CXX      src/libzmq_la-pub.lo
      CXX      src/libzmq_la-pipe.lo
      CXX      src/libzmq_la-ipc_listener.lo
      CXX      src/libzmq_la-push.lo
      CXX      src/libzmq_la-curve_server.lo
      CXX      src/libzmq_la-router.lo
      CXX      src/libzmq_la-raw_decoder.lo
      CXX      src/libzmq_la-raw_encoder.lo
      CXX      src/libzmq_la-socks_connecter.lo
      CXX      src/libzmq_la-socks.lo
      CXX      src/libzmq_la-tcp_address.lo
      CXX      src/libzmq_la-tcp_connecter.lo
      CXX      src/libzmq_la-stream_engine.lo
      CXX      src/libzmq_la-socket_base.lo
      CXX      src/libzmq_la-select.lo
      CXX      src/libzmq_la-session_base.lo
      CXX      src/libzmq_la-signaler.lo
      CXX      src/libzmq_la-stream.lo
      CXX      src/libzmq_la-rep.lo
      CXX      src/libzmq_la-reaper.lo
      CXX      src/libzmq_la-sub.lo
      CXX      src/libzmq_la-random.lo
      CXX      src/libzmq_la-req.lo
      CXX      src/libzmq_la-ctx.lo
      CXX      src/libzmq_la-tcp.lo
      CXX      src/libzmq_la-tcp_listener.lo
      CXX      src/libzmq_la-v1_encoder.lo
      CXX      src/libzmq_la-tipc_connecter.lo
      CXX      src/libzmq_la-zmq.lo
      CXX      src/libzmq_la-tipc_listener.lo
      CXX      src/libzmq_la-tipc_address.lo
      CXX      src/libzmq_la-v2_encoder.lo
      CXX      src/libzmq_la-v1_decoder.lo
      CXX      src/libzmq_la-v2_decoder.lo
      CXX      src/libzmq_la-zmq_utils.lo
      CXX      perf/inproc_thr.o
      CXX      tools/curve_keygen.o
      CXX      src/libzmq_la-xpub.lo
      CXX      src/libzmq_la-xsub.lo
      CXX      perf/local_thr.o
      CXX      perf/inproc_lat.o
      CXX      perf/remote_lat.o
      CXX      src/libzmq_la-trie.lo
      CXX      src/libzmq_la-thread.lo
      CXX      perf/local_lat.o
      CXX      perf/remote_thr.o
      CXXLD    libzmq.la
    ar: `u' modifier ignored since `D' is the default (see `U')
      CXXLD    curve_keygen
      CXXLD    remote_thr
      CXXLD    inproc_thr
      CXXLD    local_thr
      CXXLD    local_lat
      CXXLD    remote_lat
      CXXLD    inproc_lat
    make[2]: Leaving directory '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/zeromq-4.1.4'
    make[1]: Leaving directory '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/zeromq-4.1.4'
    make[1]: Entering directory '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/zeromq-4.1.4'
    Making install in doc
    make[2]: Entering directory '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/zeromq-4.1.4/doc'
    make[3]: Entering directory '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/zeromq-4.1.4/doc'
    make[3]: Nothing to be done for 'install-exec-am'.
     /bin/mkdir -p '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/share/man/man3'
     /bin/mkdir -p '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/share/man/man7'
     /usr/bin/install -c -m 644 zmq.7 zmq_tcp.7 zmq_pgm.7 zmq_inproc.7 zmq_ipc.7 zmq_null.7 zmq_plain.7 zmq_curve.7 zmq_tipc.7 '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/share/man/man7'
     /usr/bin/install -c -m 644 zmq_bind.3 zmq_unbind.3 zmq_connect.3 zmq_disconnect.3 zmq_close.3 zmq_ctx_new.3 zmq_ctx_term.3 zmq_ctx_get.3 zmq_ctx_set.3 zmq_ctx_shutdown.3 zmq_msg_init.3 zmq_msg_init_data.3 zmq_msg_init_size.3 zmq_msg_move.3 zmq_msg_copy.3 zmq_msg_size.3 zmq_msg_data.3 zmq_msg_close.3 zmq_msg_send.3 zmq_msg_recv.3 zmq_send.3 zmq_recv.3 zmq_send_const.3 zmq_msg_get.3 zmq_msg_set.3 zmq_msg_more.3 zmq_msg_gets.3 zmq_getsockopt.3 zmq_setsockopt.3 zmq_socket.3 zmq_socket_monitor.3 zmq_poll.3 zmq_errno.3 zmq_strerror.3 zmq_version.3 zmq_sendmsg.3 zmq_recvmsg.3 zmq_proxy.3 zmq_proxy_steerable.3 zmq_z85_encode.3 '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/share/man/man3'
     /usr/bin/install -c -m 644 zmq_z85_decode.3 zmq_curve_keypair.3 zmq_has.3 '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/share/man/man3'
    make[3]: Leaving directory '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/zeromq-4.1.4/doc'
    make[2]: Leaving directory '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/zeromq-4.1.4/doc'
    make[2]: Entering directory '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/zeromq-4.1.4'
    make[3]: Entering directory '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/zeromq-4.1.4'
     /bin/mkdir -p '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/lib'
     /bin/mkdir -p '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/include'
     /bin/mkdir -p '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/lib/pkgconfig'
     /bin/bash ./libtool   --mode=install /usr/bin/install -c   libzmq.la '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/lib'
     /usr/bin/install -c -m 644 include/zmq.h include/zmq_utils.h '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/include'
     /usr/bin/install -c -m 644 src/libzmq.pc '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/lib/pkgconfig'
    libtool: install: /usr/bin/install -c .libs/libzmq.so.5.0.0 /tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/lib/libzmq.so.5.0.0
    libtool: install: (cd /tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/lib && { ln -s -f libzmq.so.5.0.0 libzmq.so.5 || { rm -f libzmq.so.5 && ln -s libzmq.so.5.0.0 libzmq.so.5; }; })
    libtool: install: (cd /tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/lib && { ln -s -f libzmq.so.5.0.0 libzmq.so || { rm -f libzmq.so && ln -s libzmq.so.5.0.0 libzmq.so; }; })
    libtool: install: /usr/bin/install -c .libs/libzmq.lai /tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/lib/libzmq.la
    libtool: install: /usr/bin/install -c .libs/libzmq.a /tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/lib/libzmq.a
    libtool: install: chmod 644 /tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/lib/libzmq.a
    libtool: install: ranlib /tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/lib/libzmq.a
    libtool: finish: PATH="/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/snap/bin:/sbin" ldconfig -n /tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/lib
    ----------------------------------------------------------------------
    Libraries have been installed in:
       /tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/lib

    If you ever happen to want to link against installed libraries
    in a given directory, LIBDIR, you must either use libtool, and
    specify the full pathname of the library, or use the `-LLIBDIR'
    flag during linking and do at least one of the following:
       - add LIBDIR to the `LD_LIBRARY_PATH' environment variable
         during execution
       - add LIBDIR to the `LD_RUN_PATH' environment variable
         during linking
       - use the `-Wl,-rpath -Wl,LIBDIR' linker flag
       - have your system administrator add LIBDIR to `/etc/ld.so.conf'

    See any operating system documentation about shared libraries for
    more information, such as the ld(1) and ld.so(8) manual pages.
    ----------------------------------------------------------------------
     /bin/mkdir -p '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/bin'
      /bin/bash ./libtool   --mode=install /usr/bin/install -c curve_keygen '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/bin'
    libtool: install: /usr/bin/install -c .libs/curve_keygen /tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/bin/curve_keygen
    make[3]: Leaving directory '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/zeromq-4.1.4'
    make[2]: Leaving directory '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/zeromq-4.1.4'
    make[1]: Leaving directory '/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/zeromq-4.1.4'
    rm -rf zeromq-4.1.4.tar.gz zeromq-4.1.4
    g++ -std=c++14 -msse2 -fPIC -O3 -ggdb -Wall -finline-functions -I./src -I./include -I/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/include  -I./src -I./include -I/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/include -MM -MT build/customer.o src/customer.cc >build/customer.d
    g++ -std=c++14 -msse2 -fPIC -O3 -ggdb -Wall -finline-functions -I./src -I./include -I/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/include  -I./src -I./include -I/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/include -MM -MT build/postoffice.o src/postoffice.cc >build/postoffice.d
    g++ -std=c++14 -msse2 -fPIC -O3 -ggdb -Wall -finline-functions -I./src -I./include -I/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/include  -I./src -I./include -I/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/include -MM -MT build/van.o src/van.cc >build/van.d
    g++ -std=c++14 -msse2 -fPIC -O3 -ggdb -Wall -finline-functions -I./src -I./include -I/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/include  -pthread -lrt -c src/postoffice.cc -o build/postoffice.o
    g++ -std=c++14 -msse2 -fPIC -O3 -ggdb -Wall -finline-functions -I./src -I./include -I/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/include  -pthread -lrt -c src/customer.cc -o build/customer.o
    g++ -std=c++14 -msse2 -fPIC -O3 -ggdb -Wall -finline-functions -I./src -I./include -I/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/include  -pthread -lrt -c src/van.cc -o build/van.o
    In file included from ./include/ps/internal/customer.h:14:0,
                     from src/customer.cc:4:
    ./include/ps/internal/message.h: In destructor ‘ps::Customer::~Customer()’:
    ./include/ps/internal/message.h:101:8: warning: ‘*((void*)(& msg)+88).ps::Control::barrier_group’ may be used uninitialized in this function [-Wmaybe-uninitialized]
     struct Control {
            ^~~~~~~
    ./include/ps/internal/message.h:101:8: warning: ‘*((void*)(& msg)+88).ps::Control::msg_sig’ may be used uninitialized in this function [-Wmaybe-uninitialized]
    ./include/ps/internal/message.h:136:8: warning: ‘msg.ps::Meta::key’ may be used uninitialized in this function [-Wmaybe-uninitialized]
     struct Meta {
            ^~~~
    ./include/ps/internal/message.h:136:8: warning: ‘msg.ps::Meta::val_len’ may be used uninitialized in this function [-Wmaybe-uninitialized]
    ./include/ps/internal/message.h:136:8: warning: ‘msg.ps::Meta::option’ may be used uninitialized in this function [-Wmaybe-uninitialized]
    ar crv build/libps.a build/customer.o build/postoffice.o build/van.o
    a - build/customer.o
    a - build/postoffice.o
    a - build/van.o
    g++ -std=c++0x -std=c++14 -msse2 -fPIC -O3 -ggdb -Wall -finline-functions -I./src -I./include -I/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/include  -MM -MT tests/test_benchmark tests/test_benchmark.cc >tests/test_benchmark.d -pthread -lrt
    g++ -std=c++0x -std=c++14 -msse2 -fPIC -O3 -ggdb -Wall -finline-functions -I./src -I./include -I/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/include  -MM -MT tests/test_ipc_benchmark tests/test_ipc_benchmark.cc >tests/test_ipc_benchmark.d -pthread -lrt
    g++ -std=c++0x -std=c++14 -msse2 -fPIC -O3 -ggdb -Wall -finline-functions -I./src -I./include -I/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/include  -o tests/test_benchmark tests/test_benchmark.cc build/libps.a -Wl,-rpath,/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/lib -L/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/lib -lzmq -pthread -pthread -lrt
    g++ -std=c++0x -std=c++14 -msse2 -fPIC -O3 -ggdb -Wall -finline-functions -I./src -I./include -I/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/include  -o tests/test_ipc_benchmark tests/test_ipc_benchmark.cc build/libps.a -Wl,-rpath,/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/lib -L/tmp/pip-build-p12hg9z0/byteps/3rdparty/ps-lite/deps/lib -lzmq -pthread -pthread -lrt
    x86_64-linux-gnu-gcc -pthread -DNDEBUG -g -fwrapv -O2 -Wall -g -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -fPIC -std=c++11 -fPIC -O2 -Wall -fopenmp -mf16c -mavx -I/usr/include/python3.6m -c build/temp.linux-x86_64-3.6/test_compile/test_cpp_flags.cc -o build/temp.linux-x86_64-3.6/test_compile/test_cpp_flags.o
    x86_64-linux-gnu-gcc -pthread -shared -Wl,-O1 -Wl,-Bsymbolic-functions -Wl,-Bsymbolic-functions -Wl,-z,relro -Wl,-Bsymbolic-functions -Wl,-z,relro -g -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 build/temp.linux-x86_64-3.6/test_compile/test_cpp_flags.o -o build/temp.linux-x86_64-3.6/test_compile/test_cpp_flags.so
    x86_64-linux-gnu-gcc -pthread -DNDEBUG -g -fwrapv -O2 -Wall -g -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -fPIC -I/usr/include/python3.6m -c build/temp.linux-x86_64-3.6/test_compile/test_link_flags.cc -o build/temp.linux-x86_64-3.6/test_compile/test_link_flags.o
    x86_64-linux-gnu-gcc -pthread -shared -Wl,-O1 -Wl,-Bsymbolic-functions -Wl,-Bsymbolic-functions -Wl,-z,relro -Wl,-Bsymbolic-functions -Wl,-z,relro -g -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -Wl,--version-script=byteps.lds -fopenmp build/temp.linux-x86_64-3.6/test_compile/test_link_flags.o -o build/temp.linux-x86_64-3.6/test_compile/test_link_flags.so
    building 'byteps.server.c_lib' extension
    creating build/temp.linux-x86_64-3.6/byteps
    creating build/temp.linux-x86_64-3.6/byteps/server
    creating build/temp.linux-x86_64-3.6/byteps/common
    x86_64-linux-gnu-gcc -pthread -DNDEBUG -g -fwrapv -O2 -Wall -g -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -fPIC -DEIGEN_MPL2_ONLY=1 -I3rdparty/ps-lite/include -I/usr/local/nccl/include -I/usr/include/python3.6m -c byteps/server/server.cc -o build/temp.linux-x86_64-3.6/byteps/server/server.o -std=c++11 -fPIC -O2 -Wall -fopenmp -mf16c -mavx -DBYTEPS_BUILDING_SERVER
    In file included from byteps/server/server.h:25:0,
                     from byteps/server/server.cc:16:
    byteps/server/../common/cpu_reducer.h: In member function ‘void byteps::common::CpuReducer::HalfBits2Float(short unsigned int*, float*)’:
    byteps/server/../common/cpu_reducer.h:112:46: warning: dereferencing type-punned pointer will break strict-aliasing rules [-Wstrict-aliasing]
         *res = *reinterpret_cast<float const*>(&f);
                                                  ^
    In file included from 3rdparty/ps-lite/include/ps/internal/utils.h:6:0,
                     from 3rdparty/ps-lite/include/ps/base.h:7,
                     from 3rdparty/ps-lite/include/ps/ps.h:9,
                     from byteps/server/server.h:24,
                     from byteps/server/server.cc:16:
    byteps/server/server.h: In function ‘size_t byteps::server::GetThreadID(uint64_t, size_t)’:
    3rdparty/ps-lite/include/dmlc/logging.h:79:34: warning: comparison between signed and unsigned integer expressions [-Wsign-compare]
     #define CHECK_LT(x, y) CHECK((x) < (y))
                                  ~~~~^~~~~
    3rdparty/ps-lite/include/dmlc/logging.h:75:9: note: in definition of macro ‘CHECK’
       if (!(x))                                                           \
             ^
    byteps/server/server.h:166:3: note: in expansion of macro ‘CHECK_LT’
       CHECK_LT(min_index, engine_thread_num_);
       ^~~~~~~~
    x86_64-linux-gnu-gcc -pthread -DNDEBUG -g -fwrapv -O2 -Wall -g -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -fPIC -DEIGEN_MPL2_ONLY=1 -I3rdparty/ps-lite/include -I/usr/local/nccl/include -I/usr/include/python3.6m -c byteps/common/cpu_reducer.cc -o build/temp.linux-x86_64-3.6/byteps/common/cpu_reducer.o -std=c++11 -fPIC -O2 -Wall -fopenmp -mf16c -mavx -DBYTEPS_BUILDING_SERVER
    In file included from byteps/common/cpu_reducer.cc:20:0:
    byteps/common/cpu_reducer.h: In member function ‘void byteps::common::CpuReducer::HalfBits2Float(short unsigned int*, float*)’:
    byteps/common/cpu_reducer.h:112:46: warning: dereferencing type-punned pointer will break strict-aliasing rules [-Wstrict-aliasing]
         *res = *reinterpret_cast<float const*>(&f);
                                                  ^
    x86_64-linux-gnu-gcc -pthread -DNDEBUG -g -fwrapv -O2 -Wall -g -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -fPIC -DEIGEN_MPL2_ONLY=1 -I3rdparty/ps-lite/include -I/usr/local/nccl/include -I/usr/include/python3.6m -c byteps/common/logging.cc -o build/temp.linux-x86_64-3.6/byteps/common/logging.o -std=c++11 -fPIC -O2 -Wall -fopenmp -mf16c -mavx -DBYTEPS_BUILDING_SERVER
    x86_64-linux-gnu-g++ -pthread -shared -Wl,-O1 -Wl,-Bsymbolic-functions -Wl,-Bsymbolic-functions -Wl,-z,relro -Wl,-Bsymbolic-functions -Wl,-z,relro -g -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 build/temp.linux-x86_64-3.6/byteps/server/server.o build/temp.linux-x86_64-3.6/byteps/common/cpu_reducer.o build/temp.linux-x86_64-3.6/byteps/common/logging.o 3rdparty/ps-lite/build/libps.a 3rdparty/ps-lite/deps/lib/libzmq.a -L/usr/local/nccl/lib -L/usr/local/nccl/lib64 -o build/lib.linux-x86_64-3.6/byteps/server/c_lib.cpython-36m-x86_64-linux-gnu.so -Wl,--version-script=byteps.lds -fopenmp
    INFO: Unable to build TensorFlow plugin, will skip it.

    Traceback (most recent call last):
      File "/tmp/pip-build-p12hg9z0/byteps/setup.py", line 316, in check_tf_version
        import tensorflow as tf
    ModuleNotFoundError: No module named 'tensorflow'

    During handling of the above exception, another exception occurred:

    Traceback (most recent call last):
      File "/tmp/pip-build-p12hg9z0/byteps/setup.py", line 863, in build_extensions
        build_tf_extension(self, options)
      File "/tmp/pip-build-p12hg9z0/byteps/setup.py", line 427, in build_tf_extension
        check_tf_version()
      File "/tmp/pip-build-p12hg9z0/byteps/setup.py", line 323, in check_tf_version
        'import tensorflow failed, is it installed?\n\n%s' % traceback.format_exc())
    distutils.errors.DistutilsPlatformError: import tensorflow failed, is it installed?

    Traceback (most recent call last):
      File "/tmp/pip-build-p12hg9z0/byteps/setup.py", line 316, in check_tf_version
        import tensorflow as tf
    ModuleNotFoundError: No module named 'tensorflow'

    import torch failed, is it installed?

    Traceback (most recent call last):
      File "/tmp/pip-build-p12hg9z0/byteps/setup.py", line 701, in check_torch_version
        import torch
    ModuleNotFoundError: No module named 'torch'

    INFO: Unable to build PyTorch plugin, will skip it.

    Traceback (most recent call last):
      File "/tmp/pip-build-p12hg9z0/byteps/setup.py", line 874, in build_extensions
        torch_version = check_torch_version()
      File "/tmp/pip-build-p12hg9z0/byteps/setup.py", line 711, in check_torch_version
        version = parse_version(torch.__version__)
    UnboundLocalError: local variable 'torch' referenced before assignment

    sh: 1: cd: can't cd to /usr/local/cuda/lib64/stubs
    INFO: Unable to build MXNet plugin, will skip it.

    Traceback (most recent call last):
      File "/tmp/pip-build-p12hg9z0/byteps/setup.py", line 455, in check_mx_version
        import mxnet as mx
    ModuleNotFoundError: No module named 'mxnet'

    During handling of the above exception, another exception occurred:

    Traceback (most recent call last):
      File "/tmp/pip-build-p12hg9z0/byteps/setup.py", line 892, in build_extensions
        build_mx_extension(self, options)
      File "/tmp/pip-build-p12hg9z0/byteps/setup.py", line 628, in build_mx_extension
        check_mx_version()
      File "/tmp/pip-build-p12hg9z0/byteps/setup.py", line 462, in check_mx_version
        'import mxnet failed, is it installed?\n\n%s' % traceback.format_exc())
    distutils.errors.DistutilsPlatformError: import mxnet failed, is it installed?

    Traceback (most recent call last):
      File "/tmp/pip-build-p12hg9z0/byteps/setup.py", line 455, in check_mx_version
        import mxnet as mx
    ModuleNotFoundError: No module named 'mxnet'

    error: None of TensorFlow, MXNet, PyTorch plugins were built. See errors above.

    ----------------------------------------
Command "/usr/bin/python3 -u -c "import setuptools, tokenize;__file__='/tmp/pip-build-p12hg9z0/byteps/setup.py';f=getattr(tokenize, 'open', open)(__file__);code=f.read().replace('\r\n', '\n');f.close();exec(compile(code, __file__, 'exec'))" install --record /tmp/pip-lrsfn6ih-record/install-record.txt --single-version-externally-managed --compile" failed with error code 1 in /tmp/pip-build-p12hg9z0/byteps/
ymjiang commented 4 years ago

do i have to install one of tensorflow/mxnet/pytorch in the server node?

i have set the environment variables for the c5n.xlarge including:

export BYTEPS_WITHOUT_PYTORCH=1
export BYTEPS_WITHOUT_TENSORFLOW=1
export BYTEPS_WITHOUT_MXNET=1

Installing one of tensorflow/mxnet/pytorch for the server should fix. But regarding the problem, it looks strange to me. If you have set those envs, these parts should be skipped. But you log shows something like the following, which indicates the env does not work..

Traceback (most recent call last):
      File "/tmp/pip-build-p12hg9z0/byteps/setup.py", line 863, in build_extensions
        build_tf_extension(self, options)
      File "/tmp/pip-build-p12hg9z0/byteps/setup.py", line 427, in build_tf_extension
        check_tf_version()
      File "/tmp/pip-build-p12hg9z0/byteps/setup.py", line 323, in check_tf_version
        'import tensorflow failed, is it installed?\n\n%s' % traceback.format_exc())
    distutils.errors.DistutilsPlatformError: import tensorflow failed, is it installed?

Could you please double check whether the envs have been set?

jasperzhong commented 4 years ago

oh i use sudo at that time so it will skip users' envs. thx!