ewlu / riscv-gnu-toolchain

GNU toolchain for RISC-V, including GCC
Other
0 stars 0 forks source link

RISC-V GNU Compiler Toolchain

This is the RISC-V C and C++ cross-compiler. It supports two build modes: a generic ELF/Newlib toolchain and a more sophisticated Linux-ELF/glibc toolchain.

Getting the sources

This repository uses submodules, but submodules will fetch automatically on demand, so --recursive or git submodule update --init --recursive is not needed.

$ git clone https://github.com/riscv/riscv-gnu-toolchain

Warning: git clone takes around 6.65 GB of disk and download size

Prerequisites

Several standard packages are needed to build the toolchain.

On Ubuntu, executing the following command should suffice:

$ sudo apt-get install autoconf automake autotools-dev curl python3 python3-pip libmpc-dev libmpfr-dev libgmp-dev gawk build-essential bison flex texinfo gperf libtool patchutils bc zlib1g-dev libexpat-dev ninja-build git cmake libglib2.0-dev

On Fedora/CentOS/RHEL OS, executing the following command should suffice:

$ sudo yum install autoconf automake python3 libmpc-devel mpfr-devel gmp-devel gawk  bison flex texinfo patchutils gcc gcc-c++ zlib-devel expat-devel

On Arch Linux, executing the following command should suffice:

$ sudo pacman -Syyu autoconf automake curl python3 libmpc mpfr gmp gawk base-devel bison flex texinfo gperf libtool patchutils bc zlib expat

Also available for Arch users on the AUR: https://aur.archlinux.org/packages/riscv-gnu-toolchain-bin

On OS X, you can use Homebrew to install the dependencies:

$ brew install python3 gawk gnu-sed gmp mpfr libmpc isl zlib expat texinfo flock

To build the glibc (Linux) on OS X, you will need to build within a case-sensitive file system. The simplest approach is to create and mount a new disk image with a case sensitive format. Make sure that the mount point does not contain spaces. This is not necessary to build newlib or gcc itself on OS X.

This process will start by downloading about 200 MiB of upstream sources, then will patch, build, and install the toolchain. If a local cache of the upstream sources exists in $(DISTDIR), it will be used; the default location is /var/cache/distfiles. Your computer will need about 8 GiB of disk space to complete the process.

Installation (Newlib)

To build the Newlib cross-compiler, pick an install path (that is writeable). If you choose, say, /opt/riscv, then add /opt/riscv/bin to your PATH. Then, simply run the following command:

./configure --prefix=/opt/riscv
make

You should now be able to use riscv64-unknown-elf-gcc and its cousins.

Installation (Linux)

To build the Linux cross-compiler, pick an install path (that is writeable). If you choose, say, /opt/riscv, then add /opt/riscv/bin to your PATH. Then, simply run the following command:

./configure --prefix=/opt/riscv
make linux

The build defaults to targeting RV64GC (64-bit) with glibc, even on a 32-bit build environment. To build the 32-bit RV32GC toolchain, use:

./configure --prefix=/opt/riscv --with-arch=rv32gc --with-abi=ilp32d
make linux

In case you prefer musl libc over glibc, configure just like above and opt for make musl instead of make linux.

Supported architectures are rv32i or rv64i plus standard extensions (a)tomics, (m)ultiplication and division, (f)loat, (d)ouble, or (g)eneral for MAFD.

Supported ABIs are ilp32 (32-bit soft-float), ilp32d (32-bit hard-float), ilp32f (32-bit with single-precision in registers and double in memory, niche use only), lp64 lp64f lp64d (same but with 64-bit long and pointers).

Installation (Newlib/Linux multilib)

To build either cross-compiler with support for both 32-bit and 64-bit, run the following command:

./configure --prefix=/opt/riscv --enable-multilib

And then either make, make linux or make musl for the Newlib, Linux glibc-based or Linux musl libc-based cross-compiler, respectively.

The multilib compiler will have the prefix riscv64-unknown-elf- or riscv64-unknown-linux-gnu- but will be able to target both 32-bit and 64-bit systems. It will support the most common -march/-mabi options, which can be seen by using the --print-multi-lib flag on either cross-compiler.

The musl compiler (riscv64-unknown-linux-musl-) will only be able to target 64-bit systems due to limitations in the upstream musl architecture support. The --enable-multilib flag therefore does not actually enable multilib support for musl libc.

Troubleshooting Build Problems

Builds work best if installing into an empty directory. If you build a hard-float toolchain and then try to build a soft-float toolchain with the same --prefix directory, then the build scripts may get confused and exit with a linker error complaining that hard float code can't be linked with soft float code. Removing the existing toolchain first, or using a different prefix for the second build, avoids the problem. It is OK to build one newlib and one linux toolchain with the same prefix. But you should avoid building two newlib or two linux toolchains with the same prefix.

If building a linux toolchain on a MacOS system, or on a Windows system using the Linux subsystem or cygwin, you must ensure that the filesystem is case-sensitive. A build on a case-insensitive filesystem will fail when building glibc because *.os and *.oS files will clobber each other during the build eventually resulting in confusing link errors.

Centos (and RHEL) provide old GNU tools versions that may be too old to build a RISC-V toolchain. There is an alternate toolset provided that includes current versions of the GNU tools. This is the devtoolset provided as part of the Software Collection service. For more info, see the devtoolset-7 URL. There are various versions of the devtoolset that are available, so you can also try other versions of it, but we have at least one report that devtoolset-7 works.

Advanced Options

There are a number of additional options that may be passed to configure. See './configure --help' for more details.

Also you can define extra flags to pass to specific projects: BINUTILS_NATIVE_FLAGS_EXTRA, BINUTILS_TARGET_FLAGS_EXTRA, GCC_EXTRA_CONFIGURE_FLAGS, GDB_NATIVE_FLAGS_EXTRA, GDB_TARGET_FLAGS_EXTRA, GLIBC_NATIVE_FLAGS_EXTRA, GLIBC_TARGET_FLAGS_EXTRA. Example: GCC_EXTRA_CONFIGURE_FLAGS=--with-gmp=/opt/gmp make linux

Set default ISA spec version

--with-isa-spec= can specify the default version of the RISC-V Unprivileged (formerly User-Level) ISA specification.

Possible options are: 2.2, 20190608 and 20191213.

The default version is 20191213.

More details about this option you can refer this post RISC-V GNU toolchain bumping default ISA spec to 20191213.

Build with customized multi-lib configure.

--with-multilib-generator= can specify what multilibs to build. The argument is a semicolon separated list of values, possibly consisting of a single value. Currently only supported for riscv--elf*. The accepted values and meanings are given below.

Every config is constructed with four components: architecture string, ABI, reuse rule with architecture string and reuse rule with sub-extension.

Re-use part support expansion operator (*) to simplify the combination of different sub-extensions, example 4 demonstrate how it uses and works.

Example 1: Add multi-lib support for rv32i with ilp32.

./configure --with-multilib-generator="rv32i-ilp32--"

Example 2: Add multi-lib support for rv32i with ilp32 and rv32imafd with ilp32.

./configure --with-multilib-generator="rv32i-ilp32--;rv32imafd-ilp32--"

Example 3: Add multi-lib support for rv32i with ilp32; rv32im with ilp32 and rv32ic with ilp32 will reuse this multi-lib set.

./configure --with-multilib-generator="rv32i-ilp32-rv32im-c"

Example 4: Add multi-lib support for rv64ima with lp64; rv64imaf with lp64, rv64imac with lp64 and rv64imafc with lp64 will reuse this multi-lib set.

./configure --with-multilib-generator="rv64ima-lp64--f*c"

Test Suite

The Dejagnu test suite has been ported to RISC-V. This can be run with a simulator for the elf and linux toolchains. The simulator can be selected by the SIM variable in the Makefile, e.g. SIM=qemu, SIM=gdb, or SIM=spike (experimental).In addition, the simulator can also be selected with the configure time option --with-sim=.However, the testsuite allowlist is only mintained for qemu.Other simulators might get extra failures. To test GCC, run the following commands:

./configure --prefix=$RISCV --disable-linux --with-arch=rv64ima # or --with-arch=rv32ima
make newlib
make report-newlib SIM=gdb # Run with gdb simulator

./configure --prefix=$RISCV
make linux
make report-linux SIM=qemu # Run with qemu

./configure --prefix=$RISCV --with-sim=spike
make linux
make report               # Run with spike

Note:

Selecting the tests to run in GCC's regression test suite

By default GCC will execute all tests of its regression test suite. While running them in parallel (e.g. make -j$(nproc) report) will significanlty speed up the execution time on multi-processor systems, the required time for executing all tests is usually too high for typical development cycles. Therefore GCC allows to select the tests that are being executed using the environment variable RUNTESTFLAGS.

To restrict a test run to only RISC-V specific tests the following command can be used:

RUNTESTFLAGS="riscv.exp" make report

To to restrict a test run to only RISC-V specific tests with match the pattern "zb.c" and "sm.c" the following command can be used:

RUNTESTFLAGS="riscv.exp=zb.c\ sm.c" make report

Testing GCC, Binutils, and glibc of a Linux toolchain

The default Makefile target to run toolchain tests is report. This will run all tests of the GCC regression test suite. Alternatively, the following command can be used to do the same:

make check-gcc

The following command can be used to run the Binutils tests:

make check-binutils

The command below can be used to run the glibc tests:

make check-glibc-linux

LLVM / clang

LLVM can be used in combination with the RISC-V GNU Compiler Toolchain to build RISC-V applications. To build LLVM with C and C++ support the configure flag --enable-llvm can be used.

E.g. to build LLVM on top of a RV64 Linux toolchain the following commands can be used:

./configure --prefix=$RISCV --enable-llvm --enable-linux make

Note, that a combination of --enable-llvm and multilib configuration flags is not supported.

Below are examples how to build a rv64gc Linux/newlib toolchain with LLVM support, how to use it to build a C and a C++ application using clang, and how to execute the generated binaries using QEMU.

Build Linux toolchain and run examples:

# Build rv64gc toolchain with LLVM
./configure --prefix=$RISCV --enable-llvm --enable-linux --with-arch=rv64gc --with-abi=lp64d
make -j$(nproc) all build-sim SIM=qemu
# Build C application with clang
$RISCV/bin/clang -march=rv64imafdc -o hello_world hello_world.c
$RISCV/bin/qemu-riscv64 -L $RISCV/sysroot ./hello_world
# Build C++ application with clang
$RISCV/bin/clang++ -march=rv64imafdc -stdlib=libc++ -o hello_world_cpp hello_world_cpp.cxx
$RISCV/bin/qemu-riscv64 -L $RISCV/sysroot ./hello_world_cpp

Build newlib toolchain and run examples (don't work with --with-multilib-generator=):

# Build rv64gc bare-metal toolchain with LLVM
./configure --prefix=$RISCV --enable-llvm --disable-linux --with-arch=rv64gc --with-abi=lp64d
make -j$(nproc) all build-sim SIM=qemu
# Build C application with clang
$RISCV/bin/clang -march=rv64imafdc -o hello_world hello_world.c
$RISCV/bin/qemu-riscv64 -L $RISCV/sysroot ./hello_world
# Build C++ application with clang using static link
$RISCV/bin/clang++ -march=rv64imafdc -static -o hello_world_cpp hello_world_cpp.cxx
$RISCV/bin/qemu-riscv64 -L $RISCV/sysroot ./hello_world_cpp

Development

This section is only for developer or advanced user, or you want to build toolchain with your own source tree.

Update Source Tree

riscv-gnu-toolchain contain stable but not latest source for each submodule, in case you want to using latest develoment tree, you can use following command to upgrade all submodule.

git submodule update --remote

Or you can upgrade specific submodule only.

git submodule update --remote <component>

For example, upgrade gcc only, you can using following command:

git submodule update --remote gcc

How to Check Which Branch are Used for Specific submodule

The branch info has recorded in .gitmodules file, which can set or update via git submodule add -b or git submodule set-branch.

However the only way to check which branch are using is to check .gitmodules file, here is the example for gcc, it's using releases/gcc-12 branch, so it will has a section named gcc and has a field branch is releases/gcc-12.

[submodule "gcc"]
        path = gcc
        url = ../gcc.git
        branch = releases/gcc-12

Use Source Tree Other Than riscv-gnu-toolchain

riscv-gnu-toolchain also support using out-of-tree source to build toolchain, there is couple configure option to specify the source tree of each submodule/component.

For example you have a gcc in $HOME/gcc, use --with-gcc-src can specify that:

./configure --with-gcc-src=$HOME/gcc

Here is the list of configure option for specify source tree:

--with-binutils-src
--with-gcc-src
--with-gdb-src
--with-glibc-src
--with-linux-headers-src
--with-llvm-src
--with-musl-src
--with-newlib-src
--with-pk-src
--with-qemu-src
--with-spike-src