Skip to content
This repository was archived by the owner on Jan 26, 2024. It is now read-only.
This repository was archived by the owner on Jan 26, 2024. It is now read-only.

ROCclr-rocm-5.4.3/os/os_posix.cpp:305: static void amd::Os::currentStackInfo(unsigned char**, size_t*): Assertion `Os::currentStackPtr() >= *base - *size && Os::currentStackPtr() < *base && "just checking"' failed #158

@darkbasic

Description

@darkbasic
niko@talos2 ~ $ clinfo
clinfo: /var/tmp/portage/dev-libs/rocm-opencl-runtime-5.4.3/work/ROCclr-rocm-5.4.3/os/os_posix.cpp:305: static void amd::Os::currentStackInfo(unsigned char**, size_t*): Assertion `Os::currentStackPtr() >= *base - *size && Os::currentStackPtr() < *base && "just checking"' failed.
Aborted (core dumped)

I'm on Gentoo Linux ppc64le (4K page size) using linux-6.1.12.
GPU is AMD RX 570 (mesa git master).
LLVM is 15.0.7.

rocm-opencl-runtime-5.4.3 compiles fine but as soon as I run clinfo it crashes.

The coredump looks completely useless:

Mon 2023-02-20 15:01:01 CET   212914 1000 1000 SIGABRT present  /usr/bin/clinfo                                                                              1.5M
talos2 ~ # coredumpctl gdb 212914
           PID: 212914 (clinfo)
           UID: 1000 (niko)
           GID: 1000 (niko)
        Signal: 6 (ABRT)
     Timestamp: Mon 2023-02-20 15:01:01 CET (46s ago)
  Command Line: clinfo
    Executable: /usr/bin/clinfo
 Control Group: /user.slice/user-1000.slice/user@1000.service/session.slice/vte-spawn-4515b441-519b-4357-9405-43fc61d5f6db.scope
          Unit: user@1000.service
     User Unit: vte-spawn-4515b441-519b-4357-9405-43fc61d5f6db.scope
         Slice: user-1000.slice
     Owner UID: 1000 (niko)
       Boot ID: 0dca6c1f75ea46d7b02761482c0ec1d6
    Machine ID: b3e834569b8ff461391f5ac061feb773
      Hostname: talos2
       Storage: /var/lib/systemd/coredump/core.clinfo.1000.0dca6c1f75ea46d7b02761482c0ec1d6.212914.1676901661000000.zst (present)
  Size on Disk: 1.5M
       Message: Process 212914 (clinfo) of user 1000 dumped core.

GNU gdb (Gentoo 12.1 vanilla) 12.1
Copyright (C) 2022 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html>
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law.
Type "show copying" and "show warranty" for details.
This GDB was configured as "powerpc64le-unknown-linux-gnu".
Type "show configuration" for configuration details.
For bug reporting instructions, please see:
<https://bugs.gentoo.org/>.
Find the GDB manual and other documentation resources online at:
    <http://www.gnu.org/software/gdb/documentation/>.

For help, type "help".
Type "apropos word" to search for commands related to "word"...
Reading symbols from /usr/bin/clinfo...
(No debugging symbols found in /usr/bin/clinfo)
[New LWP 212914]
[New LWP 212918]
[New LWP 212915]
[New LWP 212916]
[New LWP 212917]
[Thread debugging using libthread_db enabled]
Using host libthread_db library "/usr/lib64/libthread_db.so.1".
Core was generated by `clinfo '.
Program terminated with signal SIGABRT, Aborted.
#0  0x00003fff8c7e603c in ?? () from /usr/lib64/libc.so.6
[Current thread is 1 (Thread 0x3fff8ca58020 (LWP 212914))]
(gdb) info threads
  Id   Target Id                          Frame 
* 1    Thread 0x3fff8ca58020 (LWP 212914) 0x00003fff8c7e603c in ?? () from /usr/lib64/libc.so.6
  2    Thread 0x3fff7abab120 (LWP 212918) 0x00003fff8c7ddf84 in ?? () from /usr/lib64/libc.so.6
  3    Thread 0x3fff7c4ef120 (LWP 212915) 0x00003fff8c7ddf84 in ?? () from /usr/lib64/libc.so.6
  4    Thread 0x3fff7bbad120 (LWP 212916) 0x00003fff8c7ddf84 in ?? () from /usr/lib64/libc.so.6
  5    Thread 0x3fff7b3ac120 (LWP 212917) 0x00003fff8c7ddf84 in ?? () from /usr/lib64/libc.so.6
(gdb) thread 1
[Switching to thread 1 (Thread 0x3fff8ca58020 (LWP 212914))]
#0  0x00003fff8c7e603c in ?? () from /usr/lib64/libc.so.6
(gdb) thread 2
[Switching to thread 2 (Thread 0x3fff7abab120 (LWP 212918))]
#0  0x00003fff8c7ddf84 in ?? () from /usr/lib64/libc.so.6
(gdb) thread 3
[Switching to thread 3 (Thread 0x3fff7c4ef120 (LWP 212915))]
#0  0x00003fff8c7ddf84 in ?? () from /usr/lib64/libc.so.6
(gdb) thread 4
[Switching to thread 4 (Thread 0x3fff7bbad120 (LWP 212916))]
#0  0x00003fff8c7ddf84 in ?? () from /usr/lib64/libc.so.6
(gdb) thread 5
[Switching to thread 5 (Thread 0x3fff7b3ac120 (LWP 212917))]
#0  0x00003fff8c7ddf84 in ?? () from /usr/lib64/libc.so.6
(gdb) quit

I'm using dev-libs/rocm-opencl-runtime-5.4.3 and dev-libs/rocr-runtime, dev-libs/rocm-comgr, dev-libs/rocm-device-libs, dev-util/rocm-cmake and dev-libs/roct-thunk-interface 5.4.3 as well.

I've compiled dev-libs/rocm-opencl-runtime and dev-libs/rocr-runtime with debug symbols:

FEATURES="${FEATURES} nostrip"
CFLAGS="${CFLAGS} -ggdb3 -Wall"
CXXFLAGS="${CFLAGS}"

and I've enabled the debug use flag to enable assertions and other debug code paths as well.

I've tried ROCm-OpenCL-Runtime from git master but it still gives me the very same error at runtime.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions