IntelLabs / HPAT.jl

High Performance Analytics Toolkit (HPAT) is a Julia-based framework for big data analytics on clusters.
BSD 2-Clause "Simplified" License
120 stars 16 forks source link

Example error #14

Open julienmarie opened 8 years ago

julienmarie commented 8 years ago

Hello, the example fails on my machine this way. Is that a dependency version problem or something?

mpirun -np 2 julia ~/.julia/v0.4/HPAT/examples/pi.jl --points=1000000
WARNING: could not import ParallelIR.toSynGemOrInt into DistributedPass
WARNING: could not import ParallelIR.toSynGemOrInt into DistributedPass
ERROR: LoadError: LoadError: LoadError: UndefVarError: SymAllGen not defined
 in include at /Applications/Julia-0.4.6.app/Contents/Resources/julia/lib/julia/sys.dylib
 in include_from_node1 at /Applications/Julia-0.4.6.app/Contents/Resources/julia/lib/julia/sys.dylib
 in include at /Applications/Julia-0.4.6.app/Contents/Resources/julia/lib/julia/sys.dylib
 in include_from_node1 at /Applications/Julia-0.4.6.app/Contents/Resources/julia/lib/julia/sys.dylib
 in require at /Applications/Julia-0.4.6.app/Contents/Resources/julia/lib/julia/sys.dylib
 in include at /Applications/Julia-0.4.6.app/Contents/Resources/julia/lib/julia/sys.dylib
 in include_from_node1 at /Applications/Julia-0.4.6.app/Contents/Resources/julia/lib/julia/sys.dylib
 in process_options at /Applications/Julia-0.4.6.app/Contents/Resources/julia/lib/julia/sys.dylib
 in _start at /Applications/Julia-0.4.6.app/Contents/Resources/julia/lib/julia/sys.dylib
while loading /Users/julienmarie/.julia/v0.4/HPAT/src/distributed-pass.jl, in expression starting on line 121
while loading /Users/julienmarie/.julia/v0.4/HPAT/src/HPAT.jl, in expression starting on line 57
while loading /Users/julienmarie/.julia/v0.4/HPAT/examples/pi.jl, in expression starting on line 26
-------------------------------------------------------
Primary job  terminated normally, but 1 process returned
a non-zero exit code.. Per user-direction, the job has been aborted.
-------------------------------------------------------

signal (15): Terminated: 15
_ZN4llvm13LiveVariables17HandlePhysRegKillEjPNS_12MachineInstrE at /Applications/Julia-0.4.6.app/Contents/Resources/julia/lib/julia/libjulia.dylib (unknown line)
_ZN4llvm13LiveVariables16HandlePhysRegDefEjPNS_12MachineInstrERNS_11SmallVectorIjLj4EEE at /Applications/Julia-0.4.6.app/Contents/Resources/julia/lib/julia/libjulia.dylib (unknown line)
_ZN4llvm13LiveVariables20runOnMachineFunctionERNS_15MachineFunctionE at /Applications/Julia-0.4.6.app/Contents/Resources/julia/lib/julia/libjulia.dylib (unknown line)
_ZN4llvm19MachineFunctionPass13runOnFunctionERNS_8FunctionE at /Applications/Julia-0.4.6.app/Contents/Resources/julia/lib/julia/libjulia.dylib (unknown line)
_ZN4llvm13FPPassManager13runOnFunctionERNS_8FunctionE at /Applications/Julia-0.4.6.app/Contents/Resources/julia/lib/julia/libjulia.dylib (unknown line)
_ZN4llvm23FunctionPassManagerImpl3runERNS_8FunctionE at /Applications/Julia-0.4.6.app/Contents/Resources/julia/lib/julia/libjulia.dylib (unknown line)
_ZN4llvm19FunctionPassManager3runERNS_8FunctionE at /Applications/Julia-0.4.6.app/Contents/Resources/julia/lib/julia/libjulia.dylib (unknown line)
_ZN4llvm3JIT20getPointerToFunctionEPNS_8FunctionE at /Applications/Julia-0.4.6.app/Contents/Resources/julia/lib/julia/libjulia.dylib (unknown line)
jl_generate_fptr at /Users/osx/buildbot/slave/package_osx10_9-x64/build/src/codegen.cpp:765
jl_trampoline_compile_function at /Users/osx/buildbot/slave/package_osx10_9-x64/build/src/builtins.c:1020
jl_apply at /Users/osx/buildbot/slave/package_osx10_9-x64/build/src/./julia.h:1331
jl_apply at /Users/osx/buildbot/slave/package_osx10_9-x64/build/src/gf.c:1691
with_output_color at util.jl:316
jl_apply at /Users/osx/buildbot/slave/package_osx10_9-x64/build/src/gf.c:1691
display_error at client.jl:86
jl_apply at /Users/osx/buildbot/slave/package_osx10_9-x64/build/src/gf.c:1691
_start at /Applications/Julia-0.4.6.app/Contents/Resources/julia/lib/julia/sys.dylib (unknown line)
jlcall__start_18909 at /Applications/Julia-0.4.6.app/Contents/Resources/julia/lib/julia/sys.dylib (unknown line)
jl_apply at /Users/osx/buildbot/slave/package_osx10_9-x64/build/src/./julia.h:1331
true_main at /Applications/Julia-0.4.6.app/Contents/Resources/julia/bin//julia (unknown line)
main at /Applications/Julia-0.4.6.app/Contents/Resources/julia/bin//julia (unknown line)
--------------------------------------------------------------------------
mpirun detected that one or more processes exited with non-zero status, thus causing
the job to be terminated. The first process to do so was:

  Process name: [[27098,1],0]
  Exit code:    1
--------------------------------------------------------------------------
Wajihulhassan commented 8 years ago

@julienmarie Can you try the pi example with Julia version 0.5 rather than 0.4. HPAT has recently been updated to support Julia 0.5.

julienmarie commented 8 years ago

Tried. Quite the same output. The code seems to try to import ParallelAccelerator.ParallelIR.toSynGemOrInt but it's not part of ParallelAccelerator?

/Applications/Julia-0.5.app/Contents/Resources/julia/bin/julia ~/.julia/v0.5/HPAT/examples/pi.jl --points=1000000
WARNING: could not import ParallelIR.toSynGemOrInt into DistributedPass
ERROR: LoadError: LoadError: LoadError: UndefVarError: SymAllGen not defined
 in include_from_node1(::String) at ./loading.jl:488
 in include_from_node1(::String) at /Applications/Julia-0.5.app/Contents/Resources/julia/lib/julia/sys.dylib:?
 in include_from_node1(::String) at ./loading.jl:488
 in include_from_node1(::String) at /Applications/Julia-0.5.app/Contents/Resources/julia/lib/julia/sys.dylib:?
 in eval(::Module, ::Any) at ./boot.jl:234
 in eval(::Module, ::Any) at /Applications/Julia-0.5.app/Contents/Resources/julia/lib/julia/sys.dylib:?
 in require(::Symbol) at ./loading.jl:415
 in require(::Symbol) at /Applications/Julia-0.5.app/Contents/Resources/julia/lib/julia/sys.dylib:?
 in include_from_node1(::String) at ./loading.jl:488
 in include_from_node1(::String) at /Applications/Julia-0.5.app/Contents/Resources/julia/lib/julia/sys.dylib:?
 in process_options(::Base.JLOptions) at ./client.jl:262
 in _start() at ./client.jl:318
 in _start() at /Applications/Julia-0.5.app/Contents/Resources/julia/lib/julia/sys.dylib:?
while loading /Users/julienmarie/.julia/v0.5/HPAT/src/distributed-pass.jl, in expression starting on line 121
while loading /Users/julienmarie/.julia/v0.5/HPAT/src/HPAT.jl, in expression starting on line 57
while loading /Users/julienmarie/.julia/v0.5/HPAT/examples/pi.jl, in expression starting on line 26
ehsantn commented 8 years ago

I think you need to checkout the packages:

Pkg.checkout("CompilerTools")
Pkg.checkout("ParallelAcclerator")
Pkg.checkout("HPAT")
samuel100 commented 7 years ago

I am having the same issue (see error below). I am using Julia 0.5 on an Ubuntu box ("Ubuntu 16.04.1 LTS"). I attempted to checkout as per @ehsantn suggestion, however Julia replied that there were no packages to update.

samkemp@juliabox:~$ mpirun -np 2 julia ~/.julia/v0.5/HPAT/examples/pi.jl --points=1000000
--------------------------------------------------------------------------
[[6894,1],0]: A high-performance Open MPI point-to-point messaging module
was unable to find any relevant network interfaces:

Module: OpenFabrics (openib)
  Host: juliabox

Another transport will be used instead, although this may result in
lower performance.
--------------------------------------------------------------------------
WARNING: could not import ParallelIR.toSynGemOrInt into DistributedPass
ERROR: LoadError: LoadError: LoadError: UndefVarError: SymAllGen not defined

 in include_from_node1(::String) at ./loading.jl:488 (repeats 2 times)
 in eval(::Module, ::Any) at ./boot.jl:234
 in require(::Symbol) at ./loading.jl:415
 in include_from_node1(::String) at ./loading.jl:488
 in process_options(::Base.JLOptions) at ./client.jl:262
 in _start() at ./client.jl:318
while loading /home/samkemp/.julia/v0.5/HPAT/src/distributed-pass.jl, in expression starting on line 121
while loading /home/samkemp/.julia/v0.5/HPAT/src/HPAT.jl, in expression starting on line 57
while loading /home/samkemp/.julia/v0.5/HPAT/examples/pi.jl, in expression starting on line 26
-------------------------------------------------------
Primary job  terminated normally, but 1 process returned
a non-zero exit code.. Per user-direction, the job has been aborted.
-------------------------------------------------------
--------------------------------------------------------------------------
mpirun detected that one or more processes exited with non-zero status, thus causing
the job to be terminated. The first process to do so was:

  Process name: [[6894,1],0]
  Exit code:    1
--------------------------------------------------------------------------
samuel100 commented 7 years ago

I take that back. I did a Pkg.update() and now the example works.