rust-lang / rust-project-goals

Rust Project Goals tracker
https://rust-lang.github.io/rust-project-goals/
MIT License
46 stars 44 forks source link

Explore sandboxed build scripts #108

Open nikomatsakis opened 3 months ago

nikomatsakis commented 3 months ago
Metadata
Owner(s) @weihanglo
Team(s) cargo, compiler
Goal document 2024h2/sandboxed-build-script

Summary

Explore different strategies for sandboxing build script executions in Cargo.

Tasks and status

nikomatsakis commented 3 months ago

This issue is intended for status updates only.

For general questions or comments, please contact the owner(s) directly.

weihanglo commented 3 months ago

Key developments:

Have been looking into different sandbox runtime choices. Here is a simple version of the comparison of three potential choices:

There are prior research on cross-over between each of these options. I've been busy these two weeks. Will update a more detail post for prior arts afterward.

The biggest challenge I am seeing now is spawning external processes. Most build script usages invoke some external binaries, like pkg-config for building *-sys crates, or protoc for generating protobuf bindings. If process spawning is that common, we need to find a way to provide a fine-grained permission granting scheme. I don't want it to see an β€œall-or-nothing” mechanism when process spawning is needed.

The other huge headache is setting library search paths. We cannot know every possible path of system libraries ahead of time, but we need to grant access to the runtime.

Blockers:

None.

Help wanted:

None.

weihanglo commented 2 months ago

Having a family urgency. I will be back after RustConf.

weihanglo commented 1 month ago

Key developments:

Building a workable version of wasm-based build script (not yet done). There are some technical difficulties. Not blocking but need to be addressed.

In order to make the development independent of Cargo, RUSTC_WRAPPER was the first approach I tried, though it still lacks some extension point in Cargo:

Some compatibility issues came up when integrating with Cargo:

Blockers:

None.

Help wanted:

None.

weihanglo commented 2 weeks ago

Have a working-in-process pull request in https://github.com/weihanglo/cargo/pull/66.

Let me copy some texts from there :)

What did we achieve in this experiment?

As you can see, we can easily swap to any sandbox runner with a custom target.

We use wasm32-wasip1 above as an example, as it is the one with smaller footprint, very cross-platform, and pretty popular in the Rust community. However, it turns out that `wasm32-wasip1 doesn't support POSIX process spawning. Use cases of process spawning in build scripts are essential, such as

According to the design axioms,

Restrcting process spawning is the top one axiom. We made that with wasm32-wasip1, but have no way to opt-out. This contradicts to the "ensuring -sys crates be built" axiom.

As a result, it is unlikely to use wasm32-wasip1 as a default sandbox environment with this experiment.

Other possibilities

Have talked to some other folks, there are some potential route we could take if we chose wasi as a default sandbox environment.

The offical build-rs crate to the rescue

The Cargo team recently adopted the build-rs crate. It is going to be the official crate providing API for writing build scripts. We could take the advantage of it, telling everyone instead of using std::process::Command, use something like build_rs::Command. So that Cargo could have a full control over how a build script spawning processes.

While it sounds ideal, this doesn't help the current situation because

A Cargo-flavored wasi standard library

There was a discussion in the GSoC "sandboxed proc-macro with wasm" about shipping a custom verion of the standard library for sandboxed wasm. For Cargo's build script, we could potentially ship a wasm32-wasi-cargo target. The std in this custom target could intercept any exec call or process spawning. Then it calls back to the host process (which is Cargo in our case) to determine how to handle process execution. The host process could either reject, or run the external program and post back the result.

This idea sounds pretty hacky and need more investigations of the communication mechanism between the host process and the wasm runtime. Perhaps via sockets, The WebAssembly Component Model, other host function call mechanism. There is also WASIX project which supprots fork/exec though it is currently not a WASI standard not even a proposal.

Continue with other more mature sandbox runtime choices

Since one of major design space is the user interafce of sandbox configuration, we could leave off sandbox runtimes and explore more on the configuration side.

We could, for example, use docker or eBPF as a temporary default runtime, and explore how the configuration should look like. We may want to take a look at the configuration of Cackle-rs as a starting point. By doing so, we wouldn't block on waiting for wasm runtime to being more mature.