nanopool / nanominer

Nanominer is a versatile tool for mining cryptocurrencies on GPUs and CPUs.
https://nanominer.org
631 stars 350 forks source link

Linux - suddenly exits nanominer after launching #428

Open 77wisher77 opened 2 months ago

77wisher77 commented 2 months ago

trying to run this on Arch-Linux and getting exit code 139, and SIGSEGV segmetntation (segfault) fault error 4 in libc.so.6

What's weird is I have working nanominers with the same coins on EndeavourOS, an Arch based distro. With the same parameters, i also have these parameters working on a windows machine through cmd. So i feel like this should be solveable, im just unsure at this point what could be causing the issue

i've attached the logs from journalctl and the console output of nanominer when it works Arch_Server_NanoMiner_Journal.log Arch_Server_NanoMiner.log

What Happens:

Launching from the cmd line, nanominer makes it to the "Detected 2 devices" line then abruptly exits. Sometimes if run immediately after rebooting and logging on, the program runs for a few seconds then exits, the Arch_Server_NanoMiner.log file has the console output of what happened, just before crashing it complains about AMD platform initialization hang up

When launching as a service it appears it just abruptly exits shortly after starting, very little is logged. The ..._Journal.log file attached displays what little info is garnered from the service, namely a coredump and segfault error 4 mentioning libc.so.6

77wisher77 commented 2 months ago

Heres the coredump from the systemctl service, no core is dumped due to some default system settings when running from the cmdline.

coredumpctl info:

           PID: 2319 (nanominer)
           UID: 0 (root)
           GID: 0 (root)
        Signal: 11 (SEGV)
     Timestamp: Sun 2024-04-07 00:41:04 AEST (1 day 14h ago)
  Command Line: /mining/nanominer -algo octopus -wallet <wallet> -coin cfx -rigName <rig_name> -email <email>
    Executable: /mining/nanominer
 Control Group: /system.slice/crypto-miner.service
          Unit: crypto-miner.service
         Slice: system.slice
       Boot ID: <BOOTID>
    Machine ID: <MACHINE_ID>
      Hostname: arch-server
       Storage: /var/lib/systemd/coredump/core.nanominer.0.29d9d815bed647c4a894633bbe4118d8.2319.1712414464000000.zst (present)
  Size on Disk: 48.9M
       Message: Process 2319 (nanominer) of user 0 dumped core.

                Stack trace of thread 2360:
                #0  0x00007450cc195e3e __libc_free (libc.so.6 + 0x9be3e)
                #1  0x00005a4b367541ea n/a (/mining/nanominer + 0x1541ea)
                ELF object binary architecture: AMD x86-64

coredumpctl debug:

           PID: 2319 (nanominer)
           UID: 0 (root)
           GID: 0 (root)
        Signal: 11 (SEGV)
     Timestamp: Sun 2024-04-07 00:41:04 AEST (1 day 14h ago)
  Command Line: /mining/nanominer -algo octopus -wallet <wallet> -coin cfx -rigName <RIG_NAME> -email <email> restarts_count 0
    Executable: /mining/nanominer
 Control Group: /system.slice/crypto-miner.service
          Unit: crypto-miner.service
         Slice: system.slice
       Boot ID: <BOOTID>
    Machine ID: <MACHINEID>
      Hostname: arch-server
       Storage: /var/lib/systemd/coredump/core.nanominer.0.29d9d815bed647c4a894633bbe4118d8.2319.1712414464000000.zst (present)
  Size on Disk: 48.9M
       Message: Process 2319 (nanominer) of user 0 dumped core.

                Stack trace of thread 2360:
                #0  0x00007450cc195e3e __libc_free (libc.so.6 + 0x9be3e)
                #1  0x00005a4b367541ea n/a (/mining/nanominer + 0x1541ea)
                ELF object binary architecture: AMD x86-64

GNU gdb (GDB) 14.2
Copyright (C) 2023 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html>
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law.
Type "show copying" and "show warranty" for details.
This GDB was configured as "x86_64-pc-linux-gnu".
Type "show configuration" for configuration details.
For bug reporting instructions, please see:
<https://www.gnu.org/software/gdb/bugs/>.
Find the GDB manual and other documentation resources online at:
    <http://www.gnu.org/software/gdb/documentation/>.

For help, type "help".
Type "apropos word" to search for commands related to "word"...
Reading symbols from /mining/nanominer...
(No debugging symbols found in /mining/nanominer)

warning: Can't open file /dev/zero (deleted) during file-backed mapping note processing
[New LWP 2360]
[New LWP 2326]
[New LWP 2329]
[New LWP 2325]
[New LWP 2319]
[New LWP 2331]
[New LWP 2351]
[New LWP 2350]
[New LWP 2345]
[New LWP 2353]
[New LWP 2349]
[New LWP 2354]
[New LWP 2328]
[New LWP 2327]
[New LWP 2352]
[New LWP 2330]
[New LWP 2359]
[New LWP 2356]
[New LWP 2346]
[New LWP 2355]
[New LWP 2361]
[New LWP 2357]
[New LWP 2358]

warning: Could not load shared library symbols for ./libnvrtc.so.12.0.76.
Do you need "set solib-search-path" or "set sysroot"?
[Thread debugging using libthread_db enabled]
Using host libthread_db library "/usr/lib/libthread_db.so.1".
Core was generated by `/mining/nanominer -algo octopus -wallet <wallet>'.
--Type <RET> for more, q to quit, c to continue without paging--c
Program terminated with signal SIGSEGV, Segmentation fault.
#0  0x00007450cc195e3e in free () from /usr/lib/libc.so.6
[Current thread is 1 (Thread 0x745068a006c0 (LWP 2360))]