VeriFIT / z3-noodler

The Z3-Noodler String Solver
Other
9 stars 5 forks source link

Reenable rewriter rule `(str.in A (str.to_re B))` -> `(A = B)` #137

Closed jurajsic closed 6 months ago

jurajsic commented 6 months ago

This PR reenables the rewriter rule (str.in A (str.to_re B)) -> (A = B) which was turned off in #39 (probably because of slent) and partly reenabled in #126.

jurajsic commented 6 months ago

After reenabling the rule a in (str.to_re a') -> (a = a'):

# of formulae: 103318
--------------------------------------------------
tool                                                 ✅     ❌       time    avg    med    std    sat    unsat    unknown     TO    MO+ERR    other
-----------------------------------------------  ------  -----  ---------  -----  -----  -----  -----  -------  ---------  -----  --------  -------
z3-noodler-e58cb97-d95fe13                       102120   1198   13149.57   0.13   0.01   1.55  62768    39352        636    509        53        0
z3-noodler-605ccd5-d95fe13                       101976   1342   15990.37   0.16   0.01   1.75  62637    39339        638    658        46        0
cvc5-1.1.2                                        99699   3619   76478.24   0.77   0.00   6.77  60798    38901          2   3608         9        0
z3-4.13.0                                         97687   5631  132036.99   1.35   0.01   8.11  58874    38813        210   4960       461        0
ostrich-5dd2e10ca                                 86344  16986  782639.00   9.06   1.75  19.20  47556    38788          0  16962        12       12
z3-4.13.0+cvc5-1.1.2                             100756   2562   60604.78   0.60   0.00   6.00  61099    39657       2562      0         0        0
z3-noodler-e58cb97-d95fe13+cvc5-1.1.2            103176    142    9532.31   0.09   0.00   1.43  63261    39915        142      0         0        0
z3-noodler-e58cb97-d95fe13+z3-4.13.0             102785    533   10145.54   0.10   0.01   1.10  62956    39829        533      0         0        0
z3-noodler-e58cb97-d95fe13+z3-4.13.0+cvc5-1.1.2  103190    128    8963.63   0.09   0.00   1.46  63270    39920        128      0         0        0
--------------------------------------------------

Interesting benchmarks:

Benchmark automatark
# of formulae: 15995
--------------------------------------------------
tool                                                ✅    ❌      time    avg    med    std    sat    unsat    unknown    TO    MO+ERR    other
-----------------------------------------------  -----  ----  --------  -----  -----  -----  -----  -------  ---------  ----  --------  -------
z3-noodler-e58cb97-d95fe13                       15970    25    904.53   0.06   0.01   0.67  10466     5504          0    25         0        0
z3-noodler-605ccd5-d95fe13                       15963    32   1014.05   0.06   0.01   0.67  10463     5500          0    32         0        0
cvc5-1.1.2                                       15902    93   2216.97   0.14   0.00   2.83  10460     5442          0    85         8        0
z3-4.13.0                                        15870   125   6425.06   0.40   0.02   3.90  10469     5401          0   124         1        0
ostrich-5dd2e10ca                                15947    48  24332.97   1.53   1.41   1.89  10440     5507          0    48         0        0
z3-4.13.0+cvc5-1.1.2                             15969    26    543.82   0.03   0.00   1.22  10481     5488         26     0         0        0
z3-noodler-e58cb97-d95fe13+cvc5-1.1.2            15991     4    177.80   0.01   0.00   0.27  10478     5513          4     0         0        0
z3-noodler-e58cb97-d95fe13+z3-4.13.0             15995     0    389.91   0.02   0.01   0.36  10481     5514          0     0         0        0
z3-noodler-e58cb97-d95fe13+z3-4.13.0+cvc5-1.1.2  15995     0    100.21   0.01   0.00   0.16  10481     5514          0     0         0        0
--------------------------------------------------

Benchmark slent
# of formulae: 1128
--------------------------------------------------
tool                                               ✅    ❌     time    avg    med    std    sat    unsat    unknown    TO    MO+ERR    other
-----------------------------------------------  ----  ----  -------  -----  -----  -----  -----  -------  ---------  ----  --------  -------
z3-noodler-e58cb97-d95fe13                       1124     4   236.71   0.21   0.01   3.16    626      498          0     4         0        0
z3-noodler-605ccd5-d95fe13                       1125     3    25.59   0.02   0.01   0.11    629      496          1     2         0        0
cvc5-1.1.2                                       1104    24  1079.50   0.98   0.00   6.51    618      486          0    24         0        0
z3-4.13.0                                        1055    73   385.56   0.37   0.01   4.01    571      484          0    73         0        0
ostrich-5dd2e10ca                                1002   126  5154.54   5.14   2.00  13.05    592      410          0   126         0        0
z3-4.13.0+cvc5-1.1.2                             1112    16  1044.79   0.94   0.00   6.35    620      492         16     0         0        0
z3-noodler-e58cb97-d95fe13+cvc5-1.1.2            1128     0    10.29   0.01   0.00   0.08    630      498          0     0         0        0
z3-noodler-e58cb97-d95fe13+z3-4.13.0             1128     0    28.62   0.03   0.01   0.39    630      498          0     0         0        0
z3-noodler-e58cb97-d95fe13+z3-4.13.0+cvc5-1.1.2  1128     0     7.56   0.01   0.00   0.05    630      498          0     0         0        0
--------------------------------------------------

Benchmark webapp
# of formulae: 681
--------------------------------------------------
tool                                               ✅    ❌     time    avg    med    std    sat    unsat    unknown    TO    MO+ERR    other
-----------------------------------------------  ----  ----  -------  -----  -----  -----  -----  -------  ---------  ----  --------  -------
z3-noodler-e58cb97-d95fe13                        361   320    14.47   0.04   0.02   0.09    133      228        316     2         2        0
z3-noodler-605ccd5-d95fe13                        355   326    66.63   0.19   0.02   1.99    130      225        316     6         4        0
cvc5-1.1.2                                        588    93   864.69   1.47   0.01   9.96    165      423          0    93         0        0
z3-4.13.0                                         450   231   312.63   0.69   0.02   6.78     36      414        120   111         0        0
ostrich-5dd2e10ca                                 539   142  6434.26  11.94   4.14  24.06    125      414          0   142         0        0
z3-4.13.0+cvc5-1.1.2                              590    91   861.65   1.46   0.01   9.95    167      423         91     0         0        0
z3-noodler-e58cb97-d95fe13+cvc5-1.1.2             633    48   444.93   0.70   0.01   6.96    210      423         48     0         0        0
z3-noodler-e58cb97-d95fe13+z3-4.13.0              551   130    70.95   0.13   0.02   2.30    134      417        130     0         0        0
z3-noodler-e58cb97-d95fe13+z3-4.13.0+cvc5-1.1.2   633    48   443.27   0.70   0.01   6.96    210      423         48     0         0        0
--------------------------------------------------

Benchmark kaluza
# of formulae: 19432
--------------------------------------------------
tool                                                ✅    ❌      time    avg    med    std    sat    unsat    unknown    TO    MO+ERR    other
-----------------------------------------------  -----  ----  --------  -----  -----  -----  -----  -------  ---------  ----  --------  -------
z3-noodler-e58cb97-d95fe13                       19209   223    940.47   0.05   0.01   1.58  16111     3098          0   209        14        0
z3-noodler-605ccd5-d95fe13                       19106   326   1063.84   0.06   0.01   1.38  15999     3107          0   320         6        0
cvc5-1.1.2                                       19431     1   1061.97   0.05   0.00   1.87  16302     3129          0     1         0        0
z3-4.13.0                                        19148   284   2019.12   0.11   0.01   2.02  16032     3116          0   284         0        0
ostrich-5dd2e10ca                                19144   290  37052.30   1.94   1.39   3.45  16015     3129          0   286         2        2
z3-4.13.0+cvc5-1.1.2                             19431     1    911.09   0.05   0.00   1.82  16302     3129          1     0         0        0
z3-noodler-e58cb97-d95fe13+cvc5-1.1.2            19431     1   1045.84   0.05   0.00   1.87  16302     3129          1     0         0        0
z3-noodler-e58cb97-d95fe13+z3-4.13.0             19266   166    484.52   0.03   0.01   0.47  16144     3122        166     0         0        0
z3-noodler-e58cb97-d95fe13+z3-4.13.0+cvc5-1.1.2  19431     1    899.28   0.05   0.00   1.82  16302     3129          1     0         0        0
--------------------------------------------------

Benchmark redos
# of formulae: 3287
--------------------------------------------------
tool                                               ✅    ❌      time    avg    med    std    sat    unsat    unknown    TO    MO+ERR    other
-----------------------------------------------  ----  ----  --------  -----  -----  -----  -----  -------  ---------  ----  --------  -------
z3-noodler-e58cb97-d95fe13                       3284     3   2490.22   0.76   0.03   3.61   2444      840          0     3         0        0
z3-noodler-605ccd5-d95fe13                       3256    31   5271.29   1.62   0.91   6.17   2428      828          0    31         0        0
cvc5-1.1.2                                       1070  2217  27894.06  26.07   0.03  38.26    574      496          0  2217         0        0
z3-4.13.0                                         888  2399    326.46   0.37   0.01   2.85     48      840          0  2099       300        0
ostrich-5dd2e10ca                                1715  1572  55284.56  32.24  33.17  36.98   1053      662          0  1572         0        0
z3-4.13.0+cvc5-1.1.2                             1414  1873  27009.57  19.10   0.02  34.70    574      840       1873     0         0        0
z3-noodler-e58cb97-d95fe13+cvc5-1.1.2            3284     3   2349.02   0.72   0.02   3.60   2444      840          3     0         0        0
z3-noodler-e58cb97-d95fe13+z3-4.13.0             3284     3   2221.45   0.68   0.02   3.60   2444      840          3     0         0        0
z3-noodler-e58cb97-d95fe13+z3-4.13.0+cvc5-1.1.2  3284     3   2217.38   0.68   0.02   3.60   2444      840          3     0         0        0
--------------------------------------------------

We improved on automatark, redos, kaluza, and webapp (but the last one is probably some randomness), while there is slight degradation on few (4 or so) formulae from slent (which might be the reason this rule was turned off in #39.