rocm-arch / tensorflow-rocm

tensorflow-rocm AUR package
17 stars 12 forks source link

error making: rocfft #28

Closed TheCoolerPlayer closed 3 years ago

TheCoolerPlayer commented 3 years ago

Everything downloads and builds well right up till this stage. This is what I get (only the rocfft part which fails, everything before this downloads and builds smoothly):

==> Making package: rocfft 4.2.0-1 (Wednesday 30 June 2021 01:10:06 AM)
==> Checking runtime dependencies...
==> Checking buildtime dependencies...
==> WARNING: Using existing $srcdir/ tree
==> Removing existing $pkgdir/ directory...
==> Starting build()...
-- The CXX compiler identification is Clang 12.0.0
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Check for working CXX compiler: /opt/rocm/bin/hipcc - skipped
-- Detecting CXX compile features
-- Detecting CXX compile features - done
-- Building with ROCm tools
-- Looking for C++ include pthread.h
-- Looking for C++ include pthread.h - found
-- Performing Test CMAKE_HAVE_LIBC_PTHREAD
-- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success
-- Found Threads: TRUE  
-- ROCclr at /opt/rocm/lib/cmake/rocclr
-- hip::amdhip64 is SHARED_LIBRARY
-- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS
-- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS - Success
-- ROCclr at /opt/rocm/lib/cmake/rocclr
-- hip::amdhip64 is SHARED_LIBRARY
-- Performing Test COMPILER_HAS_HIDDEN_VISIBILITY
-- Performing Test COMPILER_HAS_HIDDEN_VISIBILITY - Success
-- Performing Test COMPILER_HAS_HIDDEN_INLINE_VISIBILITY
-- Performing Test COMPILER_HAS_HIDDEN_INLINE_VISIBILITY - Success
-- Performing Test COMPILER_HAS_DEPRECATED_ATTR
-- Performing Test COMPILER_HAS_DEPRECATED_ATTR - Success
-- Configuring done
-- Generating done
-- Build files have been written to: /home/aquaman/.cache/yay/rocfft/src
[  0%] Building CXX object library/src/device/generator/CMakeFiles/rocfft-kernel-generator.dir/generator.file.cpp.o
[  0%] Building CXX object library/src/device/generator/CMakeFiles/rocfft-kernel-generator.dir/generator.main.cpp.o
[  1%] Linking CXX executable rocfft-kernel-generator
[  1%] Built target rocfft-kernel-generator
[  1%] Generator producing device kernels for rocfft-device
type: pow2 pow3 pow5 pow7,11,13 large 2D
valid manual small size:
valid manual large size:
precision: single double
group_num: 150

[  1%] Building CXX object library/src/device/CMakeFiles/rocfft-device.dir/bluestein.cpp.o
[  2%] Building CXX object library/src/device/CMakeFiles/rocfft-device.dir/transpose.cpp.o
[  2%] Building CXX object library/src/device/CMakeFiles/rocfft-device.dir/real2complex_embed.cpp.o
[  2%] Building CXX object library/src/device/CMakeFiles/rocfft-device.dir/complex2real_embed.cpp.o
[  3%] Building CXX object library/src/device/CMakeFiles/rocfft-device.dir/realcomplex_even.cpp.o
[  3%] Building CXX object library/src/device/CMakeFiles/rocfft-device.dir/realcomplex_pair.cpp.o
[  3%] Building CXX object library/src/device/CMakeFiles/rocfft-device.dir/realcomplex_even_transpose.cpp.o
[  4%] Building CXX object library/src/device/CMakeFiles/rocfft-device.dir/kernel_launch_single_0.cpp.o
[  4%] Building CXX object library/src/device/CMakeFiles/rocfft-device.dir/function_pool.cpp.o
[  4%] Building CXX object library/src/device/CMakeFiles/rocfft-device.dir/kernel_launch_single_2.cpp.o
[  4%] Building CXX object library/src/device/CMakeFiles/rocfft-device.dir/kernel_launch_single_1.cpp.o
[  5%] Building CXX object library/src/device/CMakeFiles/rocfft-device.dir/kernel_launch_single_3.cpp.o
[  5%] Building CXX object library/src/device/CMakeFiles/rocfft-device.dir/kernel_launch_single_4.cpp.o
[  5%] Building CXX object library/src/device/CMakeFiles/rocfft-device.dir/kernel_launch_single_5.cpp.o
[  6%] Building CXX object library/src/device/CMakeFiles/rocfft-device.dir/kernel_launch_single_6.cpp.o
[  6%] Building CXX object library/src/device/CMakeFiles/rocfft-device.dir/kernel_launch_single_7.cpp.o
[  6%] Building CXX object library/src/device/CMakeFiles/rocfft-device.dir/kernel_launch_single_8.cpp.o
[  6%] Building CXX object library/src/device/CMakeFiles/rocfft-device.dir/kernel_launch_single_9.cpp.o
[  7%] Building CXX object library/src/device/CMakeFiles/rocfft-device.dir/kernel_launch_single_10.cpp.o
[  7%] Building CXX object library/src/device/CMakeFiles/rocfft-device.dir/kernel_launch_single_11.cpp.o
[  7%] Building CXX object library/src/device/CMakeFiles/rocfft-device.dir/kernel_launch_single_12.cpp.o
[  8%] Building CXX object library/src/device/CMakeFiles/rocfft-device.dir/kernel_launch_single_13.cpp.o
PLEASE submit a bug report to https://bugs.llvm.org/ and include the crash backtrace, preprocessed source, and associated run script.
Stack dump:
0.      Program arguments: /opt/rocm/llvm/bin/clang-12 -cc1 -mllvm --amdhsa-code-object-version=4 -triple amdgcn-amd-amdhsa -aux-triple x86_64-pc-linux-gnu -emit-obj --mrelax-relocations -disable-free -disable-llvm-verifier -discard-value-names -main-file-name kernel_launch_single_9.cpp -mrelocation-model pic -pic-level 2 -fhalf-no-semantic-interposition -mframe-pointer=none -fdenormal-fp-math-f32=preserve-sign,preserve-sign -fno-rounding-math -fno-plt -aux-target-cpu x86-64 -fcuda-is-device -mllvm -amdgpu-internalize-symbols -fcuda-allow-variadic-functions -fvisibility hidden -fapply-global-visibility-to-externs -mlink-builtin-bitcode /opt/rocm/amdgcn/bitcode/hip.bc -mlink-builtin-bitcode /opt/rocm/amdgcn/bitcode/ocml.bc -mlink-builtin-bitcode /opt/rocm/amdgcn/bitcode/ockl.bc -mlink-builtin-bitcode /opt/rocm/amdgcn/bitcode/oclc_daz_opt_on.bc -mlink-builtin-bitcode /opt/rocm/amdgcn/bitcode/oclc_unsafe_math_off.bc -mlink-builtin-bitcode /opt/rocm/amdgcn/bitcode/oclc_finite_only_off.bc -mlink-builtin-bitcode /opt/rocm/amdgcn/bitcode/oclc_correctly_rounded_sqrt_on.bc -mlink-builtin-bitcode /opt/rocm/amdgcn/bitcode/oclc_wavefrontsize64_on.bc -mlink-builtin-bitcode /opt/rocm/amdgcn/bitcode/oclc_isa_version_803.bc -target-cpu gfx803 -fno-split-dwarf-inlining -debugger-tuning=gdb -resource-dir /opt/rocm/llvm/lib/clang/12.0.0 -dependency-file CMakeFiles/rocfft-device.dir/kernel_launch_single_9.cpp.o.d -MT library/src/device/CMakeFiles/rocfft-device.dir/kernel_launch_single_9.cpp.o -sys-header-deps -internal-isystem /opt/rocm/llvm/lib/clang/12.0.0/include/cuda_wrappers -internal-isystem /opt/rocm/include -include __clang_hip_runtime_wrapper.h -isystem /opt/rocm/llvm/lib/clang/12.0.0/include/.. -isystem /opt/rocm/hsa/include -isystem /opt/rocm/hip/include -D rocfft_device_EXPORTS -I /home/aquaman/.cache/yay/rocfft/src/rocFFT-rocm-4.2.0/library/src/device -I /home/aquaman/.cache/yay/rocfft/src/rocFFT-rocm-4.2.0/library/src/device/../include -I /home/aquaman/.cache/yay/rocfft/src/rocFFT-rocm-4.2.0/library/src/device/kernels -I /home/aquaman/.cache/yay/rocfft/src/rocFFT-rocm-4.2.0/library/src/device/generator -I /home/aquaman/.cache/yay/rocfft/src/library/src/device -I /home/aquaman/.cache/yay/rocfft/src/rocFFT-rocm-4.2.0/library/include -I /home/aquaman/.cache/yay/rocfft/src/include -D NDEBUG -D_FORTIFY_SOURCE=2 -D_GLIBCXX_ASSERTIONS -internal-isystem /usr/lib64/gcc/x86_64-pc-linux-gnu/11.1.0/../../../../include/c++/11.1.0 -internal-isystem /usr/lib64/gcc/x86_64-pc-linux-gnu/11.1.0/../../../../include/c++/11.1.0/x86_64-pc-linux-gnu -internal-isystem /usr/lib64/gcc/x86_64-pc-linux-gnu/11.1.0/../../../../include/c++/11.1.0/backward -internal-isystem /usr/lib64/gcc/x86_64-pc-linux-gnu/11.1.0/../../../../include/c++/11.1.0 -internal-isystem /usr/lib64/gcc/x86_64-pc-linux-gnu/11.1.0/../../../../include/c++/11.1.0/x86_64-pc-linux-gnu -internal-isystem /usr/lib64/gcc/x86_64-pc-linux-gnu/11.1.0/../../../../include/c++/11.1.0/backward -internal-isystem /usr/local/include -internal-isystem /opt/rocm/llvm/lib/clang/12.0.0/include -internal-externc-isystem /include -internal-externc-isystem /usr/include -internal-isystem /usr/local/include -internal-isystem /opt/rocm/llvm/lib/clang/12.0.0/include -internal-externc-isystem /include -internal-externc-isystem /usr/include -O3 -Wformat -Werror=format-security -std=c++14 -fdeprecated-macro -fno-autolink -fdebug-compilation-dir /home/aquaman/.cache/yay/rocfft/src/library/src/device -ferror-limit 19 -fhip-new-launch-api -fcf-protection=none -fgnuc-version=4.2.1 -fcxx-exceptions -fexceptions -fcolor-diagnostics -vectorize-loops -vectorize-slp -mllvm -amdgpu-early-inline-all=true -mllvm -amdgpu-function-calls=false -fcuda-allow-variadic-functions -munsafe-fp-atomics -faddrsig -o /tmp/kernel_launch_single_9-gfx803-2e3efb.o -x hip /home/aquaman/.cache/yay/rocfft/src/library/src/device/kernel_launch_single_9.cpp
1.      <eof> parser at end of file
2.      Code generation
3.      Running pass 'CallGraph Pass Manager' on module '/home/aquaman/.cache/yay/rocfft/src/library/src/device/kernel_launch_single_9.cpp'.
4.      Running pass 'Greedy Register Allocator' on function '@_Z17fft_back_ip_len22I15HIP_vector_typeIfLj2EEL9StrideBin1EEvPKT_mPKmS7_mPN9real_typeIS3_E4typeESB_'
 #0 0x000055ef132513f3 llvm::sys::PrintStackTrace(llvm::raw_ostream&, int) (/opt/rocm/llvm/bin/clang-12+0x22103f3)
 #1 0x000055ef1324f054 llvm::sys::RunSignalHandlers() (/opt/rocm/llvm/bin/clang-12+0x220e054)
 #2 0x000055ef1324f1d6 (/opt/rocm/llvm/bin/clang-12+0x220e1d6)
 #3 0x00007fac3a24a870 __restore_rt (/usr/lib/libpthread.so.0+0x13870)
 #4 0x000055ef128f384a llvm::VirtRegAuxInfo::weightCalcHelper(llvm::LiveInterval&, llvm::SlotIndex*, llvm::SlotIndex*) (/opt/rocm/llvm/bin/clang-12+0x18b284a)
 #5 0x000055ef128f4fa2 llvm::VirtRegAuxInfo::calculateSpillWeightAndHint(llvm::LiveInterval&) (/opt/rocm/llvm/bin/clang-12+0x18b3fa2)
 #6 0x000055ef128f5023 llvm::VirtRegAuxInfo::calculateSpillWeightsAndHints() (/opt/rocm/llvm/bin/clang-12+0x18b4023)
 #7 0x000055ef12885f7f (/opt/rocm/llvm/bin/clang-12+0x1844f7f)
 #8 0x000055ef125f1d19 llvm::MachineFunctionPass::runOnFunction(llvm::Function&) (/opt/rocm/llvm/bin/clang-12+0x15b0d19)
 #9 0x000055ef12a25680 llvm::FPPassManager::runOnFunction(llvm::Function&) (/opt/rocm/llvm/bin/clang-12+0x19e4680)
#10 0x000055ef122df55b (/opt/rocm/llvm/bin/clang-12+0x129e55b)
#11 0x000055ef12a24f15 llvm::legacy::PassManagerImpl::run(llvm::Module&) (/opt/rocm/llvm/bin/clang-12+0x19e3f15)
#12 0x000055ef13513643 (/opt/rocm/llvm/bin/clang-12+0x24d2643)
#13 0x000055ef13515235 clang::EmitBackendOutput(clang::DiagnosticsEngine&, clang::HeaderSearchOptions const&, clang::CodeGenOptions const&, clang::TargetOptions const&, clang::LangOptions const&, llvm::DataLayout const&, llvm::Module*, clang::BackendAction, std::unique_ptr<llvm::raw_pwrite_stream, std::default_delete<llvm::raw_pwrite_stream> >) (/opt/rocm/llvm/bin/clang-12+0x24d4235)
#14 0x000055ef141efa2e (/opt/rocm/llvm/bin/clang-12+0x31aea2e)
#15 0x000055ef14be0239 clang::ParseAST(clang::Sema&, bool, bool) (/opt/rocm/llvm/bin/clang-12+0x3b9f239)
#16 0x000055ef13b7ce71 clang::FrontendAction::Execute() (/opt/rocm/llvm/bin/clang-12+0x2b3be71)
#17 0x000055ef13b1c3ae clang::CompilerInstance::ExecuteAction(clang::FrontendAction&) (/opt/rocm/llvm/bin/clang-12+0x2adb3ae)
#18 0x000055ef13c32653 clang::ExecuteCompilerInvocation(clang::CompilerInstance*) (/opt/rocm/llvm/bin/clang-12+0x2bf1653)
#19 0x000055ef11c080ef cc1_main(llvm::ArrayRef<char const*>, char const*, void*) (/opt/rocm/llvm/bin/clang-12+0xbc70ef)
#20 0x000055ef11c042cc (/opt/rocm/llvm/bin/clang-12+0xbc32cc)
#21 0x000055ef11b78655 main (/opt/rocm/llvm/bin/clang-12+0xb37655)
#22 0x00007fac39c8ab25 __libc_start_main (/usr/lib/libc.so.6+0x27b25)
#23 0x000055ef11c036ee _start (/opt/rocm/llvm/bin/clang-12+0xbc26ee)
[  8%] Building CXX object library/src/device/CMakeFiles/rocfft-device.dir/kernel_launch_single_14.cpp.o
clang-12: error: unable to execute command: Segmentation fault (core dumped)
clang-12: error: clang frontend command failed due to signal (use -v to see invocation)
clang version 12.0.0 (https://aur.archlinux.org/llvm-amdgpu.git 93955b68cd8e002a7538dc0a8cac6a31d156e964)
Target: x86_64-pc-linux-gnu
Thread model: posix
InstalledDir: /opt/rocm/llvm/bin
[  8%] Building CXX object library/src/device/CMakeFiles/rocfft-device.dir/kernel_launch_single_15.cpp.o
clang-12: note: diagnostic msg: Error generating preprocessed source(s).
make[2]: *** [library/src/device/CMakeFiles/rocfft-device.dir/build.make:2184: library/src/device/CMakeFiles/rocfft-device.dir/kernel_launch_single_9.cpp.o] Error 254
make[2]: *** Waiting for unfinished jobs....
make[1]: *** [CMakeFiles/Makefile2:173: library/src/device/CMakeFiles/rocfft-device.dir/all] Error 2
make: *** [Makefile:156: all] Error 2
==> ERROR: A failure occurred in build().
    Aborting...
error making: rocfft

I have a Ryzen 3600XT with a Vega 64 Liquid, running Manjaro KDE on Kernel 5.4.124. Not sure if this is a hardware or a software issue? I already had rocm-dev rocm-utils rocm-libs installed prior to attempting tensorflow-rocm; should I try installing tensorflow-rocm without all the rocm stuff?

Edit: Should also add that the system sometimes shutdowns in the middle of trying to build this last step.

TheCoolerPlayer commented 3 years ago

Turned out to be a hardware issue with the CPU. Manually underclocking the CPU to 3Ghz fixed this.