Handle panics in Oak Functions loader

One of my takeaways from the Rust-for-Linux RFC discussion on LKML is that trading liveness for safety is not ideal for a server/kernel. That is, panicking when something bad happens (eg array bounds check fails) prevents something bad happening but it may also cause loss of data, loss of logging (eg an attacker might quickly perform an attack and then crash the kernel before logging info can be saved), denial of service, and similar which are maybe just as bad as the original problem.

So, yes, really important to try to eliminate panics from the entire Rust codebase. A combination of fuzzing and our formal verification tools is the answer.

wrt Bytecode... The thing not to do is generate random files of bytes and call it good. You will find lots of ways to create malformed bytecode but you will only be getting coverage over the bytecode loader. You need multiple layers of fuzzing - each layer bypassing the issues that the previous layers have already checked.

Random byte files - to check for errors in the bytecode loader
Correctly structured wasm files - with type errors, overly large stack offsets as immediate fields, illegal opcodes, etc.
Well formed wasm that probes memory boundaries, etc.
Random sequences of commands. eg if the Oak API provides functions F(,), G(_) and H(), then you generate random sequences like [("F", [42, 87]), ("H", []), ("F", [234231,-5]), ("H", [])] and supply that to a Wasm program that loops over the list applying the named function to the list of arguments. (Variations on this approach exist - but that's the basic idea)

(Did a variation of (4) on a microcontroller OS before - found some issues.)

project-oak / oak

Handle panics in Oak Functions loader #1992