STEllAR-GROUP / hpx

The C++ Standard Library for Parallelism and Concurrency
https://hpx.stellar-group.org
Boost Software License 1.0
2.51k stars 428 forks source link

setcap cap_sys_rawio=ep for power profiling causes an HPX application to abort #991

Closed sameershende closed 10 years ago

sameershende commented 10 years ago

For the same executable, I used: ariel01:~/scr> ./factorial factorial(10) == 3628800 elapsed time == 0.00140378 [s] ariel01:~/scr>ariel01:~/scr> sudo /sbin/setcap cap_sys_rawio=ep ./factorial ariel01:~/scr> ./factorial

{stack-trace}: 15 frames: 0x2aaaacd5ce1e : hpx::util::backtrace::backtrace(unsigned long) + 0x80 in /opt/tau/hpx.apex.tau.itt.debug/lib/hpx/libhpxd.so.0 0x2aaaacd5cf0c : hpx::util::trace_on_new_stack() + 0x1e in /opt/tau/hpx.apex.tau.itt.debug/lib/hpx/libhpxd.so.0 0x2aaaacd5714e : hpx::detail::backtrace() + 0x18 in /opt/tau/hpx.apex.tau.itt.debug/lib/hpx/libhpxd.so.0 0x2aaaacd5ead6 : boost::exception_ptr hpx::detail::get_exceptionhpx::exception(hpx::exception const&, std::string const&, std::string const&, long) + 0x8c in /opt/tau/hpx.apex.tau.itt.debug/lib/hpx/libhpxd.so.0 0x2aaaacd5f203 : void hpx::detail::throw_exceptionhpx::exception(hpx::exception const&, std::string const&, std::string const&, long) + 0x38 in /opt/tau/hpx.apex.tau.itt.debug/lib/hpx/libhpxd.so.0 0x2aaaad3afcc3 : hpx::util::get_executable_filename() + 0xc2 in /opt/tau/hpx.apex.tau.itt.debug/lib/hpx/libhpxd.so.0 0x2aaaad3afb19 : hpx::util::get_executable_prefix() + 0x19 in /opt/tau/hpx.apex.tau.itt.debug/lib/hpx/libhpxd.so.0 0x2aaaad488fc4 : hpx::util::runtime_configuration::pre_initialize_ini() + 0x4a in /opt/tau/hpx.apex.tau.itt.debug/lib/hpx/libhpxd.so.0 0x2aaaad48a209 : hpx::util::runtime_configuration::reconfigure() + 0x19 in /opt/tau/hpx.apex.tau.itt.debug/lib/hpx/libhpxd.so.0 0x2aaaad489d51 : hpx::util::runtime_configuration::load_components(std::map<std::string, hpx::util::plugin::dll, std::less, std::allocator<std::pair<std::string const, hpx::util::plugin::dll> > >&) + 0x449 in /opt/tau/hpx.apex.tau.itt.debug/lib/hpx/libhpxd.so.0 0x2aaaad0d90a1 : hpx::components::server::runtime_support::load_components() + 0x37 in /opt/tau/hpx.apex.tau.itt.debug/lib/hpx/libhpxd.so.0 0x2aaaad121090 : hpx::util::detail::vtable::type<hpx::util::detail::bound<hpx::actions::action<hpx::components::server::runtime_support, bool, hpx::util::tuple<void, void, void, void, void, void, void, void>, hpx::actions::result_action0<bool (hpx::components::server::runtime_support::)(), &hpx::components::server::runtime_support::load_components, hpx::actions::detail::this_type> >::continuation_thread_object_function_0, hpx::util::tupleboost::shared_ptr<hpx::actions::continuation, bool (hpx::components::server::runtimesupport::)(), hpx::components::server::runtimesupport, void, void, void, void, void> >, hpx::threads::thread_state_enum (hpx::threads::thread_state_ex_enum), void, void>::invoke(void_, hpx::threads::thread_state_ex_enum&&) + 0x4c7 in /opt/tau/hpx.apex.tau.itt.debug/lib/hpx/libhpxd.so.0 0x2aaaacddb425 : hpx::util::coroutines::detail::coroutine_impl_wrapper<hpx::util::function_nonser<hpx::threads::thread_state_enum (hpx::threads::thread_state_ex_enum)>, hpx::util::coroutines::coroutine<hpx::threads::thread_state_enum (hpx::threads::thread_state_ex_enum), hpx::threads::detail::coroutine_allocator, hpx::util::coroutines::detail::lx::x86_linux_context_impl>, hpx::util::coroutines::detail::lx::x86_linux_context_impl, hpx::threads::detail::coroutine_allocator>::operator()() + 0x105 in /opt/tau/hpx.apex.tau.itt.debug/lib/hpx/libhpxd.so.0 0x2aaaacdd78b9 : void hpx::util::coroutines::detail::lx::trampoline<hpx::util::coroutines::detail::coroutine_impl_wrapper<hpx::util::function_nonser<hpx::threads::thread_state_enum (hpx::threads::thread_state_ex_enum)>, hpx::util::coroutines::coroutine<hpx::threads::thread_state_enum (hpx::threads::thread_state_ex_enum), hpx::threads::detail::coroutine_allocator, hpx::util::coroutines::detail::lx::x86_linux_context_impl>, hpx::util::coroutines::detail::lx::x86_linux_context_impl, hpx::threads::detail::coroutine_allocator> >(hpx::util::coroutines::detail::coroutine_impl_wrapper<hpx::util::function_nonser<hpx::threads::thread_state_enum (hpx::threads::thread_state_ex_enum)>, hpx::util::coroutines::coroutine<hpx::threads::thread_state_enum (hpx::threads::thread_state_ex_enum), hpx::threads::detail::coroutine_allocator, hpx::util::coroutines::detail::lx::x86_linux_context_impl>, hpx::util::coroutines::detail::lx::x86_linux_context_impl, hpx::threads::detail::coroutineallocator>) + 0x18 in /opt/tau/hpx.apex.tau.itt.debug/lib/hpx/libhpxd.so.0 {env}: 78 entries: APEX_ROOT=/opt/tau/apex-0.2.2/ BOOST_ROOT=/opt/boost/1.52.0-release/ CPATH=/opt/intel/composer_xe_2013.1.117/mkl/include:/opt/intel/composer_xe_2013.1.117/mkl/include:/opt/intel/composer_xe_2013.1.117/tbb/include:/opt/intel/composer_xe_2013.1.117/mkl/include:/opt/intel/composer_xe_2013.1.117/tbb/include DISPLAY=localhost:11.0 EDITOR=vim GLIBCPP_FORCE_NEW=1 GLIBCXX_FORCE_NEW=1 GROUP=sshende HDF5_ROOT=/opt/hdf5/1.8.7-threadsafe/ HOME=/home/sshende HOST=ariel01.hermione HOSTTYPE=x86_64-linux HPX_HAVE_ITTNOTIFY=1 HPX_USE_STACK_GUARD_PAGES=0 INCLUDE=/opt/intel/composer_xe_2013.1.117/mkl/include:/opt/intel/composer_xe_2013.1.117/mkl/include:/opt/intel/composer_xe_2013.1.117/mkl/include INTEL_LICENSE_FILE=/opt/intel/composer_xe_2013.1.117/licenses:/opt/intel/licenses:/home/sshende/intel/licenses:/opt/intel/composer_xe_2013.1.117/licenses:/opt/intel/licenses:/home/sshende/intel/licenses IPPROOT=/opt/intel/composer_xe_2013.1.117/ipp LANG=en_US.UTF-8 LIBRARY_PATH=/opt/intel/composer_xe_2013.1.117/compiler/lib/intel64:/opt/intel/composer_xe_2013.1.117/mkl/lib/intel64:/opt/intel/composer_xe_2013.1.117/compiler/lib/intel64:/opt/intel/composer_xe_2013.1.117/ipp/../compiler/lib/intel64:/opt/intel/composer_xe_2013.1.117/ipp/lib/intel64:/opt/intel/composer_xe_2013.1.117/compiler/lib/intel64:/opt/intel/composer_xe_2013.1.117/mkl/lib/intel64:/opt/intel/composer_xe_2013.1.117/tbb/lib/intel64:/opt/intel/composer_xe_2013.1.117/compiler/lib/intel64:/opt/intel/composer_xe_2013.1.117/ipp/../compiler/lib/intel64:/opt/intel/composer_xe_2013.1.117/ipp/lib/intel64:/opt/intel/composer_xe_2013.1.117/compiler/lib/intel64:/opt/intel/composer_xe_2013.1.117/mkl/lib/intel64:/opt/intel/composer_xe_2013.1.117/tbb/lib/intel64 LOGNAME=sshende LSCOLORS=rs=0:di=01;34:ln=01;36:mh=00:pi=40;33:so=01;35:do=01;35:bd=40;33;01:cd=40;33;01:or=40;31;01:su=37;41:sg=30;43:ca=30;41:tw=30;42:ow=34;42:st=37;44:ex=01;32:.tar=01;31:.tgz=01;31:.arj=01;31:.taz=01;31:.lzh=01;31:.lzma=01;31:.tlz=01;31:.txz=01;31:.zip=01;31:.z=01;31:.Z=01;31:.dz=01;31:.gz=01;31:.lz=01;31:.xz=01;31:.bz2=01;31:.bz=01;31:.tbz=01;31:.tbz2=01;31:.tz=01;31:.deb=01;31:.rpm=01;31:.jar=01;31:.war=01;31:.ear=01;31:.sar=01;31:.rar=01;31:.ace=01;31:.zoo=01;31:.cpio=01;31:.7z=01;31:.rz=01;31:.jpg=01;35:.jpeg=01;35:.gif=01;35:.bmp=01;35:.pbm=01;35:.pgm=01;35:.ppm=01;35:.tga=01;35:.xbm=01;35:.xpm=01;35:.tif=01;35:.tiff=01;35:.png=01;35:.svg=01;35:.svgz=01;35:.mng=01;35:.pcx=01;35:.mov=01;35:.mpg=01;35:.mpeg=01;35:.m2v=01;35:.mkv=01;35:.webm=01;35:.ogm=01;35:.mp4=01;35:.m4v=01;35:.mp4v=01;35:.vob=01;35:.qt=01;35:.nuv=01;35:.wmv=01;35:.asf=01;35:.rm=01;35:.rmvb=01;35:.flc=01;35:.avi=01;35:.fli=01;35:.flv=01;35:.gl=01;35:.dl=01;35:.xcf=01;35:.xwd=01;35:.yuv=01;35:.cgm=01;35:.emf=01;35:.axv=01;35:.anx=01;35:.ogv=01;35:.ogx=01;35:.aac=00;36:.au=00;36:.flac=00;36:.mid=00;36:.midi=00;36:.mka=00;36:.mp3=00;36:.mpc=00;36:.ogg=00;36:.ra=00;36:.wav=00;36:.axa=00;36:.oga=00;36:.spx=00;36:_.xspf=00;36: MACHTYPE=x86_64 MAIL=/var/mail/sshende MANPATH=/opt/intel/composer_xe_2013.1.117/man/en_US:/opt/intel/composer_xe_2013.1.117/man/en_US:/opt/intel/composer_xe_2013.1.117/man/en_US:/opt/intel/composer_xe_2013.1.117/man/en_US:/opt/intel/composer_xe_2013.1.117/man/en_US:/usr/local/man:/usr/local/share/man:/usr/share/man:::/opt/intel/vtune_amplifier_xe_2013/man MIC_LD_LIBRARY_PATH=/opt/intel/composer_xe_2013.1.117/compiler/lib/mic:/opt/intel/composer_xe_2013.1.117/mkl/lib/mic:/opt/intel/composer_xe_2013.1.117/compiler/lib/mic:/opt/intel/mic/coi/device-linux-release/lib:/opt/intel/mic/myo/lib:/opt/intel/composer_xe_2013.1.117/compiler/lib/mic:/opt/intel/composer_xe_2013.1.117/mkl/lib/mic:/opt/intel/composer_xe_2013.1.117/tbb/lib/mic:/opt/intel/composer_xe_2013.1.117/compiler/lib/mic:/opt/intel/mic/coi/device-linux-release/lib:/opt/intel/mic/myo/lib:/opt/intel/composer_xe_2013.1.117/compiler/lib/mic:/opt/intel/composer_xe_2013.1.117/mkl/lib/mic:/opt/intel/composer_xe_2013.1.117/tbb/lib/mic MKLROOT=/opt/intel/composer_xe_2013.1.117/mkl OSTYPE=linux PATH=/opt/tau/tau2/x86_64/bin:/opt/tau/tau2/x86_64/bin:/opt/intel/vtune_amplifier_xe_2013/bin64:/opt/intel/composer_xe_2013.1.117/bin/intel64:/opt/intel/composer_xe_2013.1.117/bin/intel64:/opt/intel/composer_xe_2013.1.117/bin/intel64_mic:/opt/intel/composer_xe_2013.1.117/debugger/gui/intel64:/opt/intel/composer_xe_2013.1.117/bin/intel64:/opt/intel/composer_xe_2013.1.117/bin/intel64:/opt/intel/composer_xe_2013.1.117/bin/intel64_mic:/opt/intel/composer_xe_2013.1.117/debugger/gui/intel64:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin PWD=/home/sshende/scr SHELL=/bin/bash SHLVL=1 SLURMD_NODENAME=ariel01 SLURM_CHECKPOINT_IMAGE_DIR=/home/sshende SLURM_CPUS_ON_NODE=16 SLURM_DISTRIBUTION=cyclic SLURM_GTIDS=0 SLURM_JOBID=65777 SLURM_JOB_CPUS_PER_NODE=16 SLURM_JOB_ID=65777 SLURM_JOB_NAME=/bin/tcsh SLURM_LAUNCH_NODE_IPADDR=10.1.1.11 SLURM_LOCALID=0 SLURM_NNODES=1 SLURM_NODEID=0 SLURM_NODELIST=ariel01 SLURM_NPROCS=1 SLURM_NTASKS=1 SLURM_PRIO_PROCESS=0 SLURM_PROCID=0 SLURM_PTY_PORT=36402 SLURM_PTY_WIN_COL=80 SLURM_PTY_WIN_ROW=24 SLURM_SRUN_COMM_HOST=10.1.1.11 SLURM_SRUN_COMM_PORT=55495 SLURM_STEPID=0 SLURM_STEP_ID=0 SLURM_STEP_LAUNCHER_PORT=55495 SLURM_STEP_NODELIST=ariel01 SLURM_STEP_NUM_NODES=1 SLURM_STEP_NUM_TASKS=1 SLURM_STEP_TASKS_PER_NODE=1 SLURM_SUBMIT_DIR=/home/sshende SLURM_TASKS_PER_NODE=1 SLURM_TASK_PID=18647 SLURM_TOPOLOGY_ADDR=ariel01 SLURM_TOPOLOGY_ADDR_PATTERN=node SSH_AUTH_SOCK=/tmp/ssh-Nvv1o1ZeDi/agent.25172 SSH_CLIENT=128.223.202.199 57072 22 SSH_CONNECTION=128.223.202.199 57072 10.1.1.11 22 SSH_TTY=/dev/pts/14 TAU_ROOT=/opt/tau/tau2 TAU_SAMPLING=1 TBBROOT=/opt/intel/composer_xe_2013.1.117/tbb TERM=xterm-256color USER=sshende VENDOR=unknown VTUNE_AMPLIFIER_XE_2013_DIR=/opt/intel/vtune_amplifier_xe2013 =/usr/bin/srun {locality-id}: 0 {hostname}: 10.1.1.14:7910 {process-id}: 23389 {function}: get_executable_filename {file}: /opt/tau/build/hpx/src/util/find_prefix.cpp {line}: 91 {os-thread}: 8, worker-thread#2 {thread-id}: 00002aaab2e81ce0 {thread-description}: load_components_action {config}: HPX_HAVE_NATIVE_TLS=ON HPX_HAVE_STACKTRACES=ON HPX_HAVE_COMPRESSION_BZIP2=OFF HPX_HAVE_COMPRESSION_SNAPPY=OFF HPX_HAVE_COMPRESSION_ZLIB=OFF HPX_HAVE_PARCEL_COALESCING=ON HPX_HAVE_PARCELPORT_SHMEM=OFF HPX_HAVE_PARCELPORT_IBVERBS=OFF HPX_HAVE_VERIFY_LOCKS=ON HPX_HAVE_HWLOC=ON HPX_HAVE_ITTNOTIFY=ON HPX_LIMIT=4 HPX_ACTION_ARGUMENT_LIMIT=5 HPX_COMPONENT_CREATE_ARGUMENT_LIMIT=5 HPX_FUNCTION_ARGUMENT_LIMIT=8 HPX_LOCK_LIMIT=5 HPX_TUPLE_LIMIT=8 HPX_WAIT_ARGUMENT_LIMIT=5 HPX_PARCEL_MAX_CONNECTIONS=512 HPX_PARCEL_MAX_CONNECTIONS_PER_LOCALITY=4 HPX_INITIAL_AGAS_LOCAL_CACHE_SIZE=256 HPX_AGAS_LOCAL_CACHE_SIZE_PER_THREAD=32 HPX_PREFIX=/opt/tau/hpx.apex.tau.itt.debug {version}: V0.9.7-trunk (AGAS: V3.0), Git: 2bbd6a5f6ddd6803cd46cbd21780ca31591b9e66 {boost}: V1.52.0 {build-type}: debug {date}: Oct 28 2013 07:58:58 {platform}: linux {compiler}: GNU C++ version 4.6.4 {stdlib}: GNU libstdc++ version 20130412 {what}: unable to find executable filename, /proc may be unavailable: HPX(dynamic_link_failure)

Abort ariel01:~/scr> ldd ./factorial linux-vdso.so.1 (0x00007fffde2e8000) libhpxd.so.0 => /opt/tau/hpx.apex.tau.itt.debug/lib/hpx/libhpxd.so.0 (0x00002aaaaaccf000) libhpx_serializationd.so.0 => /opt/tau/hpx.apex.tau.itt.debug/lib/hpx/libhpx_serializationd.so.0 (0x00002aaaaaca1000) libboost_date_time.so.1.52.0 => /opt/boost/1.52.0-release/stage/lib/libboost_date_time.so.1.52.0 (0x00002aaaadc34000) libboost_filesystem.so.1.52.0 => /opt/boost/1.52.0-release/stage/lib/libboost_filesystem.so.1.52.0 (0x00002aaaade44000) libboost_program_options.so.1.52.0 => /opt/boost/1.52.0-release/stage/lib/libboost_program_options.so.1.52.0 (0x00002aaaae05a000) libboost_regex.so.1.52.0 => /opt/boost/1.52.0-release/stage/lib/libboost_regex.so.1.52.0 (0x00002aaaae2c6000) libboost_serialization.so.1.52.0 => /opt/boost/1.52.0-release/stage/lib/libboost_serialization.so.1.52.0 (0x00002aaaae5e6000) libboost_system.so.1.52.0 => /opt/boost/1.52.0-release/stage/lib/libboost_system.so.1.52.0 (0x00002aaaae861000) libboost_thread.so.1.52.0 => /opt/boost/1.52.0-release/stage/lib/libboost_thread.so.1.52.0 (0x00002aaaaea64000) libboost_chrono.so.1.52.0 => /opt/boost/1.52.0-release/stage/lib/libboost_chrono.so.1.52.0 (0x00002aaaaec87000) libprofiler.so.0 => /usr/lib/libprofiler.so.0 (0x00002aaaaee9a000) libtcmalloc.so.4 => /usr/lib/libtcmalloc.so.4 (0x00002aaaaf0ad000) libTAU.so => /opt/tau/tau2/x86_64/lib/libTAU.so (0x00002aaaaf31f000) libApex.so => /opt/tau/apex-0.2.2/lib/libApex.so (0x00002aaaaf548000) libhwloc.so.5 => /opt/tau/hwloc-1.7.2/lib/libhwloc.so.5 (0x00002aaaaf54c000) libdl.so.2 => /lib/x86_64-linux-gnu/libdl.so.2 (0x00002aaaaf579000) librt.so.1 => /lib/x86_64-linux-gnu/librt.so.1 (0x00002aaaaf77e000) libstdc++.so.6 => /usr/lib/x86_64-linux-gnu/libstdc++.so.6 (0x00002aaaaf986000) libm.so.6 => /lib/x86_64-linux-gnu/libm.so.6 (0x00002aaaafc8e000) libgcc_s.so.1 => /lib/x86_64-linux-gnu/libgcc_s.so.1 (0x00002aaaaff8d000) libpthread.so.0 => /lib/x86_64-linux-gnu/libpthread.so.0 (0x00002aaab01a3000) libc.so.6 => /lib/x86_64-linux-gnu/libc.so.6 (0x00002aaab03bf000) /lib64/ld-linux-x86-64.so.2 (0x00002aaaaaaab000) libicuuc.so.44 => /usr/lib/libicuuc.so.44 (0x00002aaab076c000) libicui18n.so.44 => /usr/lib/libicui18n.so.44 (0x00002aaab0abb000) libicudata.so.44 => /usr/lib/libicudata.so.44 (0x00002aaab0e90000) libunwind.so.8 => /usr/lib/x86_64-linux-gnu/libunwind.so.8 (0x00002aaab1ecd000) libz.so.1 => /lib/x86_64-linux-gnu/libz.so.1 (0x00002aaab20e9000) libpapi.so.5.2.0.0 => /opt/tau/papi-5.2.0/lib/libpapi.so.5.2.0.0 (0x00002aaab2301000) libOpenCL.so.1 => /usr/lib/x86_64-linux-gnu/libOpenCL.so.1 (0x00002aaab235a000) libcudart.so.5.0 => /usr/lib/x86_64-linux-gnu/libcudart.so.5.0 (0x00002aaab2560000) libxml2.so.2 => /usr/lib/x86_64-linux-gnu/libxml2.so.2 (0x00002aaab27ba000) libpfm.so.4 => /opt/tau/papi-5.2.0/lib/libpfm.so.4 (0x00002aaab291d000) liblzma.so.5 => /lib/x86_64-linux-gnu/liblzma.so.5 (0x00002aaab2a42000) ariel01:~/scr>

Aborts. Thanks,

hkaiser commented 10 years ago

This has been resolved by 3ccdeaff2f0157c87e2e69edf30fec3a0437a759