Enable (optional) link-time optimizations for the libCHIP library.
Add early exits in couple functions.
Avoid redundant kernel argument copies. Along the way, fix CHIPGraphNodeKernel instances didn't copy kernel arguments fully (they only copied pointers to arguments but not their values).
Eliminate map lookups in SPVFuncInfo::visit*Args().
... for reducing kernel launch host overhead.