Closed jamesETsmith closed 1 year ago
Does this only fail on hw when you're running multinode?
@mcordery no it crashes for single node
@jamesETsmith it seems to work for x86. How are you building for GC these days?
@mcordery the README.md is up-to-date with instructions (I hope, lmk if they aren't). Here's the condensed version:
# Build LucataGraphBLAS for LC
cmake -B build_lc <other cmake args> # configure
cmake --build build_lc --parallel 16 # build
cmake --build build_lc --target install # installs LGB in build/install
# Build LAGraph against LAGraph for LC
cmake -B build_lc -DGRAPHBLAS_ROOT=/path/to/LucataGraphBLAS/build_lc/install \
-DCMAKE_C_COMPILER=/tools/lucata/bin/emu-cc.sh \
-DCMAKE_CXX_COMPILER=/tools/lucata/bin/emu-cc.sh
cmake --build build_lc --parallel 16
Just a fyi, all the tests except for test_ConnectedComponents
work on x86.
That's what I figured but I was running other LC tests with emusim fine but TriangleCount kept giving me a 'could not fork' error
@mcordery are you using the latest version of our LAGraph? The could not fork
problem should be fixed (#7).
I did a pull on it. Guess I should just torch it and do a clean build.
If that doesn't fix it, just let me know and we can move this discussion to slack and start troubleshooting.
General
This is a follow-up to #4.
Details
Here's a complete error:
Here's the manual debugging for the TPC when the crash occurs: