Model evaluation for complex Booleans takes too much time -> accessing Boolean assignment directly

dbueno commented 5 years ago

Problem

I'm writing a model checker. On very large formulas, my bottleneck is eval()ing formulas in Z3. Evaluating Boolean formulas takes too much time doing rewriting, even with cached rewrites. I'd like to access the Boolean assignment from the solver directly.

The time eval() takes is critical because I query the model for every Boolean subformula of the input formula. I don't need to simplify the formula or to complete the model; I just want to know, is the assignment to a subformula true, false, or irrelevant.

Context

I use Z3 4.8.5 as a backend solver. I'm using the C++ API. I configure each solver with auto-config=false, using either theory QF_UF or QF_AUFBV. I'm using push() and pop() as well as solving under assumptions.

Attempted solution

I tried modifying the formulas sent to Z3 by adding Tseitin-like variables. I introduced, for a given subformula f, a fresh variable t and the constraint t = f. The goal of this was to call eval(t) whenever eval(f) was demanded, obviating Z3's need to rewrite anything, since t is just a variable.

Unfortunately, this solution perturbed the models found during the many internal checking queries; in all cases I tested, the model checker's performance was significantly worse. There was no obvious reason for the models to be different.

Solution

Is it possible to expose the Boolean assignment to the end user?

The smt_context has a method, get_assignment. If I were to try, I'd expose this method through smt::kernel, smt_solver, solver, all the way through z3's public c++ interface. Presumable, then, I'd be able to call get_assignment on Boolean z3::expr nodes and access their assignments quickly.

My questions are: is this a good proposed solution? Is it going to work even after all the pre-processing that Z3 does on the input formula? Will it work for theories other than QF_UF and QF_AUFBV? Is there another way I should consider?

Thank you for reading.

NikolajBjorner commented 5 years ago

possibly doable for UF and AUFBV like formulas: The model object may be fed a cache comprising of the current literals. Z3 uses "model transformers" to deal with preprocessing transformations and hiding internal variables. For some transformations the cache would become invalid, but not too likely in the EUF case. It still requires some evaluation to understand whether this approach is going to work.

The alternative is for the consumer of models to build a custom eval that understands how to do something better than the linear scan of function tables that happens for n-ary functions and predicates.

A significant limitation of just caching solver state is that bit-vectors and arithmetic terms are rewritten during pre-processing so the atomic formulas maintained by the solver are unrelated to the input.

NikolajBjorner commented 5 years ago

To narrow in on what is required, could you upload a representative example in smtlib2 format using eval/evaluate?

dbueno commented 5 years ago

Thanks for following up. I admit I'm not sure what action to take based on your first comment. I created an example C++ program and an SMT2 file that shows the behavior I'm trying to avoid in eval(). In the gist there are two files: z3eval_slow.cc and z3eval_slow.smt2 https://gist.github.com/dbueno/70d13750d25a1fbb71b687c65fedb5eb. After compiling, just run it on the smt2 file.

While I was working on this example, I figured out a temporary workaround (guarded by #ifndef FAST in the file). However, even with that I don't know how to explain why it's so much faster, since it only avoids about 1/3 of the evals but the runtime is 7s vs originally 2m30s on my machine.

Let me know if there's anything else you need or I can explain what my test is doing.

NikolajBjorner commented 5 years ago

The smt2 file is truncated.
the cc file is not easy to compile on my end
seems you are walking (a lot of) subexpressions and evaluating these, contrast with my preliminary understanding that you just wanted to evaluate atomic predicates.
your atomic predicates are either propositional variables or equalities between bit-vectors. I didn't find any functions or predicates taking arguments. In this case you can directly enumerate the interpretations for these atomic predicates on the model object. Then you can bottom-up evaluate formulas using truth-tables. It is easier for you to manage this process than rely on the model evaluator that has to balance internal cache eviction.

dbueno commented 5 years ago

Apologies, I've changed it so it shouldn't be truncated
Did I make a mistake that I can fix in the cc file, or should I try to make a .smt2 file that emulates the cc file?
That's right, I'm starting at the top of the formula and evaluating subexpressions on the way down
There should now be occurrences of BADD_32_32 and SGT_32_32; the first is a function and the second is a predicate, both uninterpreted. I don't want to enumerate atomic predicates directly, as you suggest, because I want to prune which predicates are evaluated based on my top-down evaluation of the formula. For instance, if P1 and P2 are given a valuation in the model, and (or P1 P2) is true, I will just evaluate either P1 or P2 but not both. Let me know if this doesn't make sense.

NikolajBjorner commented 5 years ago

A bottom-up labeling of the formula should be faster than your top-down evaluation. The built-in evaluator uses very few shortcuts and it has a bottleneck on SGT_32_32 and similar functions. I don't think I will have bandwidth to get over the compilation issues so I suggest some approaches:

you can add calls to (eval ..) in the smt2 file to emulate what the cc program does.
you can run a profiler on your code and determine how much time Z3 really spends in function definition lookups. Caching the function table lookups as hash-tables is one possible feature to add to the evaluator, but I would only want to add such a feature if it is a realistic bottleneck.
A bottom-up evaluator can build the very same hash-tables for the functions: the func_interp class contains a vector of func_interp entries. Each entry maps a tuple of expressions (that are values) to a value. Combinations that don't exist in the vector map to the "else_value".

dbueno commented 5 years ago

I've updated the files for this issue: https://gist.github.com/dbueno/70d13750d25a1fbb71b687c65fedb5eb

My delay was due to finding an interesting workaround for the time being and doing further investigation. I updated the example so it should compile under the exact same compiler flags that z3 uses (I was using C++17 features last time and now I'm only using C++11).

I tried to print (eval ...) calls to an smt2 file but it did not terminate after hours
I ran a profile showing that 90% of time is spent in bool_rewriter:mk_flat_and_core(unsigned, expr *, obj_ref<expr, ast_manager>&). You can look the profile screenshot here (https://gist.github.com/dbueno/70d13750d25a1fbb71b687c65fedb5eb#file-profile-png). I believe this method is flattening many 2-arg ANDs that are nested into a single AND with >10k args; in the process, it's creating many unnecessary, intermediate ANDs. I haven't tested this hypothesis, but I ran into a similar problem when I wrote my own AND-flattener.

To see the behavior I saw, run the cc file on the smt2 file. To see the behavior go away, add a second argument "fast" to the cc file. This changes one line in my depth-first traversal of the formula and then the evaluation will terminate after a few seconds. The line does one thing: when it is traversing the args of a true AND (resp. false OR), it avoids calls eval() on the children of these nodes, since it’s known that all the children of a true AND must be true (resp. children of a false OR must be false).

I would still like to optimize this process so let me know what else I can provide.

NikolajBjorner commented 5 years ago

Your analysis seems right: the evaluator asks the simplifier to flatten AC operators (and, or, +, *, bvadd, bvmul). This seems not well-motivated for how it is used from the API. Changing this behavior could break something. An experiment is to change https://github.com/Z3Prover/z3/blob/master/src/model/model_evaluator.cpp#L79 to assign flat to false instead of true.

The more systematic solution is to write custom bottom-up evaluator that does a linear pass on the expressions to label them with values instead of rewriting them to normal form.

dbueno commented 5 years ago

I changed flat to false and the runtime is shorter. It takes a little over 2 minutes even though solving is practically instantaneous. Here is the new profile: https://gist.github.com/dbueno/70d13750d25a1fbb71b687c65fedb5eb#file-profile2-png

All the time is in rewriter_tpl::process_app.

Does this mean the bottom up evaluator is the way to go?

NikolajBjorner commented 5 years ago

yes

Z3Prover / z3