VeriFIT / z3-noodler

The Z3-Noodler String Solver
Other
6 stars 5 forks source link

Model of singleton automata #180

Open jurajsic opened 1 day ago

jurajsic commented 1 day ago

If the language of an automaton for a variable contains only one word, we return it immediately instead of looking trough incusions. This can help for cyclic inclusions, we can now generate model for two of the three mentioned formulas in #179 (it cannot generate model for QF_SLIA/20230329-woorpje-lu/track05/05_track_169.smt2).

jurajsic commented 3 hours ago

I am not sure which version is previous, I compare with #176 (0122b33-17ffaf6) and #174 (193f136-17ffaf6):

# of formulae: 103405
--------------------------------------------------
tool                                  ✅    ❌       time    avg    med    std    sat    unsat    unknown    TO    MO+ERR    other
--------------------------------  ------  ----  ---------  -----  -----  -----  -----  -------  ---------  ----  --------  -------
z3-noodler-model-cd34a4a-17ffaf6  102444   961   14328.80   0.14   0.02   1.90  62953    39491        530   359         0       72
z3-noodler-cd34a4a-17ffaf6        102442   963   12208.15   0.12   0.02   1.65  62951    39491        528   374        61        0
z3-noodler-193f136-17ffaf6        102442   963   12345.08   0.12   0.02   1.72  62953    39489        530   372        61        0
z3-noodler-model-193f136-17ffaf6  102439   966   14252.71   0.14   0.02   1.95  62950    39489        534   364         0       68
z3-noodler-0122b33-17ffaf6        102443   962   12100.47   0.12   0.02   1.66  62952    39491        528   372        62        0
cvc5-1.2.0                         99841  3564   73861.19   0.74   0.01   6.42  60931    38910          2  3553         9        0
z3-4.13.0                          97745  5660  128712.47   1.32   0.01   7.95  58927    38818        211  4914       465       70
--------------------------------------------------

Benchmark sygus_qgen
# of formulae: 343
--------------------------------------------------
tool                                ✅    ❌    time    avg    med    std    sat    unsat    unknown    TO    MO+ERR    other
--------------------------------  ----  ----  ------  -----  -----  -----  -----  -------  ---------  ----  --------  -------
z3-noodler-model-cd34a4a-17ffaf6   343     0    5.83   0.02   0.02   0.01    343        0          0     0         0        0
z3-noodler-cd34a4a-17ffaf6         343     0    4.45   0.01   0.01   0.01    343        0          0     0         0        0
z3-noodler-193f136-17ffaf6         343     0    4.65   0.01   0.01   0.01    343        0          0     0         0        0
z3-noodler-model-193f136-17ffaf6   343     0    6.39   0.02   0.02   0.01    343        0          0     0         0        0
z3-noodler-0122b33-17ffaf6         343     0    4.46   0.01   0.01   0.01    343        0          0     0         0        0
cvc5-1.2.0                         343     0   80.99   0.24   0.04   0.93    343        0          0     0         0        0
z3-4.13.0                          343     0   10.35   0.03   0.02   0.02    343        0          0     0         0        0
--------------------------------------------------

Benchmark denghang
# of formulae: 999
--------------------------------------------------
tool                                ✅    ❌    time    avg    med    std    sat    unsat    unknown    TO    MO+ERR    other
--------------------------------  ----  ----  ------  -----  -----  -----  -----  -------  ---------  ----  --------  -------
z3-noodler-model-cd34a4a-17ffaf6   999     0   49.08   0.05   0.02   0.59    186      813          0     0         0        0
z3-noodler-cd34a4a-17ffaf6         999     0   41.88   0.04   0.02   0.59    186      813          0     0         0        0
z3-noodler-193f136-17ffaf6         999     0   43.22   0.04   0.02   0.61    186      813          0     0         0        0
z3-noodler-model-193f136-17ffaf6   999     0   48.75   0.05   0.02   0.59    186      813          0     0         0        0
z3-noodler-0122b33-17ffaf6         999     0   42.03   0.04   0.02   0.58    186      813          0     0         0        0
cvc5-1.2.0                         982    17  371.55   0.38   0.00   4.64    170      812          0    17         0        0
z3-4.13.0                          883   116  326.28   0.37   0.08   2.10    186      697          0   105        11        0
--------------------------------------------------

Benchmark automatark
# of formulae: 15995
--------------------------------------------------
tool                                 ✅    ❌     time    avg    med    std    sat    unsat    unknown    TO    MO+ERR    other
--------------------------------  -----  ----  -------  -----  -----  -----  -----  -------  ---------  ----  --------  -------
z3-noodler-model-cd34a4a-17ffaf6  15990     5   892.63   0.06   0.02   0.54  10477     5513          0     5         0        0
z3-noodler-cd34a4a-17ffaf6        15990     5   733.44   0.05   0.01   0.51  10477     5513          0     5         0        0
z3-noodler-193f136-17ffaf6        15990     5   737.85   0.05   0.01   0.51  10477     5513          0     5         0        0
z3-noodler-model-193f136-17ffaf6  15990     5   894.39   0.06   0.02   0.55  10477     5513          0     5         0        0
z3-noodler-0122b33-17ffaf6        15990     5   741.71   0.05   0.01   0.52  10477     5513          0     5         0        0
cvc5-1.2.0                        15903    92  2196.84   0.14   0.00   2.77  10460     5443          0    84         8        0
z3-4.13.0                         15871   124  6107.03   0.38   0.02   3.72  10469     5402          0   123         1        0
--------------------------------------------------

Benchmark stringfuzz
# of formulae: 11618
--------------------------------------------------
tool                                 ✅    ❌      time    avg    med    std    sat    unsat    unknown    TO    MO+ERR    other
--------------------------------  -----  ----  --------  -----  -----  -----  -----  -------  ---------  ----  --------  -------
z3-noodler-model-cd34a4a-17ffaf6  11616     2    505.60   0.04   0.02   0.96   6354     5262          1     1         0        0
z3-noodler-cd34a4a-17ffaf6        11616     2    312.34   0.03   0.01   0.17   6354     5262          1     1         0        0
z3-noodler-193f136-17ffaf6        11616     2    323.48   0.03   0.01   0.18   6354     5262          1     1         0        0
z3-noodler-model-193f136-17ffaf6  11616     2    497.35   0.04   0.02   0.91   6354     5262          1     1         0        0
z3-noodler-0122b33-17ffaf6        11616     2    321.25   0.03   0.01   0.18   6354     5262          1     1         0        0
cvc5-1.2.0                        10946   672  30049.07   2.75   0.00  12.34   6184     4762          0   671         1        0
z3-4.13.0                         11090   528  44638.05   4.03   0.01  14.89   6004     5086          0   403       125        0
--------------------------------------------------

Benchmark redos
# of formulae: 3287
--------------------------------------------------
tool                                ✅    ❌      time    avg    med    std    sat    unsat    unknown    TO    MO+ERR    other
--------------------------------  ----  ----  --------  -----  -----  -----  -----  -------  ---------  ----  --------  -------
z3-noodler-model-cd34a4a-17ffaf6  3283     4   2673.46   0.81   0.05   2.85   2443      840          0     4         0        0
z3-noodler-cd34a4a-17ffaf6        3283     4   2548.21   0.78   0.03   2.82   2443      840          0     4         0        0
z3-noodler-193f136-17ffaf6        3285     2   2631.41   0.80   0.03   3.80   2445      840          0     2         0        0
z3-noodler-model-193f136-17ffaf6  3285     2   2709.66   0.82   0.04   3.83   2445      840          0     2         0        0
z3-noodler-0122b33-17ffaf6        3285     2   2620.25   0.80   0.03   3.77   2445      840          0     2         0        0
cvc5-1.2.0                        1116  2171  27007.04  24.20   0.14  34.28    620      496          0  2171         0        0
z3-4.13.0                          888  2399    297.55   0.34   0.01   2.60     48      840          0  2099       300        0
--------------------------------------------------

Benchmark norn
# of formulae: 1027
--------------------------------------------------
tool                                ✅    ❌    time    avg    med    std    sat    unsat    unknown    TO    MO+ERR    other
--------------------------------  ----  ----  ------  -----  -----  -----  -----  -------  ---------  ----  --------  -------
z3-noodler-model-cd34a4a-17ffaf6  1027     0   20.49   0.02   0.02   0.01    712      315          0     0         0        0
z3-noodler-cd34a4a-17ffaf6        1027     0   14.70   0.01   0.01   0.01    712      315          0     0         0        0
z3-noodler-193f136-17ffaf6        1027     0   14.31   0.01   0.01   0.01    712      315          0     0         0        0
z3-noodler-model-193f136-17ffaf6  1027     0   20.04   0.02   0.02   0.01    712      315          0     0         0        0
z3-noodler-0122b33-17ffaf6        1027     0   14.60   0.01   0.01   0.01    712      315          0     0         0        0
cvc5-1.2.0                         943    84  247.71   0.26   0.01   2.41    705      238          0    84         0        0
z3-4.13.0                          904   123  278.96   0.31   0.02   5.21    712      192          0   123         0        0
--------------------------------------------------

Benchmark slog
# of formulae: 1976
--------------------------------------------------
tool                                ✅    ❌    time    avg    med    std    sat    unsat    unknown    TO    MO+ERR    other
--------------------------------  ----  ----  ------  -----  -----  -----  -----  -------  ---------  ----  --------  -------
z3-noodler-model-cd34a4a-17ffaf6  1976     0   37.79   0.02   0.02   0.02    808     1168          0     0         0        0
z3-noodler-cd34a4a-17ffaf6        1976     0   28.59   0.01   0.01   0.02    808     1168          0     0         0        0
z3-noodler-193f136-17ffaf6        1976     0   27.91   0.01   0.01   0.02    808     1168          0     0         0        0
z3-noodler-model-193f136-17ffaf6  1976     0   37.81   0.02   0.02   0.02    808     1168          0     0         0        0
z3-noodler-0122b33-17ffaf6        1976     0   27.83   0.01   0.01   0.02    808     1168          0     0         0        0
cvc5-1.2.0                        1976     0    1.85   0.00   0.00   0.00    808     1168          0     0         0        0
z3-4.13.0                         1956    20  539.58   0.28   0.01   3.81    788     1168          0    20         0        0
--------------------------------------------------

Benchmark slent
# of formulae: 1128
--------------------------------------------------
tool                                ✅    ❌     time    avg    med    std    sat    unsat    unknown    TO    MO+ERR    other
--------------------------------  ----  ----  -------  -----  -----  -----  -----  -------  ---------  ----  --------  -------
z3-noodler-model-cd34a4a-17ffaf6  1124     4   220.31   0.20   0.02   3.06    626      498          0     4         0        0
z3-noodler-cd34a4a-17ffaf6        1124     4   243.12   0.22   0.01   2.99    626      498          0     4         0        0
z3-noodler-193f136-17ffaf6        1125     3   242.43   0.22   0.01   2.98    627      498          0     3         0        0
z3-noodler-model-193f136-17ffaf6  1125     3   220.51   0.20   0.02   3.07    627      498          0     3         0        0
z3-noodler-0122b33-17ffaf6        1124     4   239.77   0.21   0.01   2.94    626      498          0     4         0        0
cvc5-1.2.0                        1104    24  1002.26   0.91   0.00   6.12    618      486          0    24         0        0
z3-4.13.0                         1052    76   345.30   0.33   0.01   4.17    571      481          0    76         0        0
--------------------------------------------------

Benchmark omark
# of formulae: 17
--------------------------------------------------
tool                                ✅    ❌    time    avg    med    std    sat    unsat    unknown    TO    MO+ERR    other
--------------------------------  ----  ----  ------  -----  -----  -----  -----  -------  ---------  ----  --------  -------
z3-noodler-model-cd34a4a-17ffaf6     9     8    0.19   0.02   0.01   0.02      2        7          0     6         0        2
z3-noodler-cd34a4a-17ffaf6           9     8    0.12   0.01   0.01   0.01      2        7          0     6         2        0
z3-noodler-193f136-17ffaf6           9     8    0.13   0.01   0.01   0.01      2        7          0     6         2        0
z3-noodler-model-193f136-17ffaf6     8     9    0.20   0.03   0.02   0.03      1        7          1     6         0        2
z3-noodler-0122b33-17ffaf6           9     8    0.11   0.01   0.01   0.01      2        7          0     6         2        0
cvc5-1.2.0                           5    12    1.03   0.21   0.00   0.46      2        3          0    12         0        0
z3-4.13.0                            7    10    8.06   1.15   0.01   3.01      2        5          0     8         2        0
--------------------------------------------------

Benchmark kepler
# of formulae: 587
--------------------------------------------------
tool                                ✅    ❌    time    avg    med    std    sat    unsat    unknown    TO    MO+ERR    other
--------------------------------  ----  ----  ------  -----  -----  -----  -----  -------  ---------  ----  --------  -------
z3-noodler-model-cd34a4a-17ffaf6   584     3   61.33   0.11   0.02   0.21    284      300          0     3         0        0
z3-noodler-cd34a4a-17ffaf6         584     3   16.21   0.03   0.01   0.07    284      300          0     3         0        0
z3-noodler-193f136-17ffaf6         584     3   16.39   0.03   0.01   0.07    284      300          0     3         0        0
z3-noodler-model-193f136-17ffaf6   584     3   62.36   0.11   0.02   0.21    284      300          0     3         0        0
z3-noodler-0122b33-17ffaf6         584     3   15.45   0.03   0.01   0.07    284      300          0     3         0        0
cvc5-1.2.0                         347   240    0.86   0.00   0.00   0.00     50      297          0   240         0        0
z3-4.13.0                          278   309   25.95   0.09   0.02   0.90    135      143          0   283        26        0
--------------------------------------------------

Benchmark woorpje
# of formulae: 809
--------------------------------------------------
tool                                ✅    ❌    time    avg    med    std    sat    unsat    unknown    TO    MO+ERR    other
--------------------------------  ----  ----  ------  -----  -----  -----  -----  -------  ---------  ----  --------  -------
z3-noodler-model-cd34a4a-17ffaf6   769    40  115.50   0.15   0.02   1.50    607      162          1    16         0       23
z3-noodler-cd34a4a-17ffaf6         777    32  415.11   0.53   0.01   6.28    615      162          0    13        19        0
z3-noodler-193f136-17ffaf6         776    33  298.61   0.38   0.01   4.63    614      162          0    14        19        0
z3-noodler-model-193f136-17ffaf6   768    41  114.76   0.15   0.02   1.50    606      162          2    15         0       24
z3-noodler-0122b33-17ffaf6         776    33  295.27   0.38   0.01   4.62    614      162          0    14        19        0
cvc5-1.2.0                         755    54   47.07   0.06   0.01   0.25    591      164          0    54         0        0
z3-4.13.0                          782    27  295.18   0.38   0.02   4.74    618      164          0    27         0        0
--------------------------------------------------

Benchmark webapp
# of formulae: 681
--------------------------------------------------
tool                                ✅    ❌    time    avg    med    std    sat    unsat    unknown    TO    MO+ERR    other
--------------------------------  ----  ----  ------  -----  -----  -----  -----  -------  ---------  ----  --------  -------
z3-noodler-model-cd34a4a-17ffaf6   364   317  244.65   0.67   0.03   5.73    136      228        316     0         0        1
z3-noodler-cd34a4a-17ffaf6         365   316  197.66   0.54   0.02   4.81    137      228        315     0         1        0
z3-noodler-193f136-17ffaf6         365   316  200.77   0.55   0.02   4.79    137      228        315     0         1        0
z3-noodler-model-193f136-17ffaf6   364   317  249.13   0.68   0.03   5.58    136      228        316     0         0        1
z3-noodler-0122b33-17ffaf6         365   316  193.41   0.53   0.02   4.71    137      228        315     0         1        0
cvc5-1.2.0                         588    93  767.89   1.31   0.02   8.73    165      423          0    93         0        0
z3-4.13.0                          451   230  317.75   0.70   0.02   6.71     37      414        121   109         0        0
--------------------------------------------------

Benchmark kaluza
# of formulae: 19432
--------------------------------------------------
tool                                 ✅    ❌     time    avg    med    std    sat    unsat    unknown    TO    MO+ERR    other
--------------------------------  -----  ----  -------  -----  -----  -----  -----  -------  ---------  ----  --------  -------
z3-noodler-model-cd34a4a-17ffaf6  19254   178  2395.76   0.12   0.02   3.15  16157     3097          0   163         0       15
z3-noodler-cd34a4a-17ffaf6        19245   187  1526.94   0.08   0.01   2.35  16148     3097          0   176        11        0
z3-noodler-193f136-17ffaf6        19245   187  1542.89   0.08   0.01   2.38  16148     3097          0   176        11        0
z3-noodler-model-193f136-17ffaf6  19254   178  2425.80   0.13   0.02   3.21  16157     3097          0   163         0       15
z3-noodler-0122b33-17ffaf6        19245   187  1518.68   0.08   0.01   2.33  16148     3097          0   176        11        0
cvc5-1.2.0                        19426     6   459.94   0.02   0.00   0.42  16297     3129          0     6         0        0
z3-4.13.0                         19139   293  1428.86   0.07   0.01   1.71  16022     3117          0   293         0        0
--------------------------------------------------

Benchmark transducer_plus
# of formulae: 91
--------------------------------------------------
tool                                ✅    ❌    time     avg     med     std    sat    unsat    unknown    TO    MO+ERR    other
--------------------------------  ----  ----  ------  ------  ------  ------  -----  -------  ---------  ----  --------  -------
z3-noodler-model-cd34a4a-17ffaf6     0    91    0.00  nan     nan     nan         0        0         91     0         0        0
z3-noodler-cd34a4a-17ffaf6           0    91    0.00  nan     nan     nan         0        0         91     0         0        0
z3-noodler-193f136-17ffaf6           0    91    0.00  nan     nan     nan         0        0         91     0         0        0
z3-noodler-model-193f136-17ffaf6     0    91    0.00  nan     nan     nan         0        0         91     0         0        0
z3-noodler-0122b33-17ffaf6           0    91    0.00  nan     nan     nan         0        0         91     0         0        0
cvc5-1.2.0                          42    49  482.14   11.48    3.11   22.06     41        1          0    49         0        0
z3-4.13.0                            1    90    0.01    0.01    0.01  nan         0        1         90     0         0        0
--------------------------------------------------

Benchmark leetcode
# of formulae: 2652
--------------------------------------------------
tool                                ✅    ❌    time    avg    med    std    sat    unsat    unknown    TO    MO+ERR    other
--------------------------------  ----  ----  ------  -----  -----  -----  -----  -------  ---------  ----  --------  -------
z3-noodler-model-cd34a4a-17ffaf6  2649     3   70.84   0.03   0.02   0.06    865     1784          0     0         0        3
z3-noodler-cd34a4a-17ffaf6        2650     2   53.12   0.02   0.01   0.05    866     1784          0     0         2        0
z3-noodler-193f136-17ffaf6        2649     3   52.13   0.02   0.01   0.06    865     1784          0     0         3        0
z3-noodler-model-193f136-17ffaf6  2650     2   90.61   0.03   0.02   0.38    866     1784          0     0         0        2
z3-noodler-0122b33-17ffaf6        2650     2   52.10   0.02   0.01   0.05    866     1784          0     0         2        0
cvc5-1.2.0                        2652     0   36.00   0.01   0.00   0.03    867     1785          0     0         0        0
z3-4.13.0                         2652     0   41.89   0.02   0.01   0.11    867     1785          0     0         0        0
--------------------------------------------------

Benchmark str_small_rw
# of formulae: 1880
--------------------------------------------------
tool                                ✅    ❌    time    avg    med    std    sat    unsat    unknown    TO    MO+ERR    other
--------------------------------  ----  ----  ------  -----  -----  -----  -----  -------  ---------  ----  --------  -------
z3-noodler-model-cd34a4a-17ffaf6  1815    65  365.25   0.20   0.02   2.97      3     1812          4    36         0       25
z3-noodler-cd34a4a-17ffaf6        1815    65  321.88   0.18   0.01   2.57      3     1812          4    37        24        0
z3-noodler-193f136-17ffaf6        1814    66  402.83   0.22   0.01   3.20      4     1810          6    35        25        0
z3-noodler-model-193f136-17ffaf6  1814    66  412.04   0.23   0.02   3.10      4     1810          6    37         0       23
z3-noodler-0122b33-17ffaf6        1815    65  330.24   0.18   0.01   2.67      3     1812          4    36        25        0
cvc5-1.2.0                        1860    20   34.43   0.02   0.00   0.62     19     1841          2    18         0        0
z3-4.13.0                         1821    59  202.53   0.11   0.01   2.66     25     1796          0    59         0        0
--------------------------------------------------

Benchmark pyex
# of formulae: 23845
--------------------------------------------------
tool                                 ✅    ❌      time    avg    med    std    sat    unsat    unknown    TO    MO+ERR    other
--------------------------------  -----  ----  --------  -----  -----  -----  -----  -------  ---------  ----  --------  -------
z3-noodler-model-cd34a4a-17ffaf6  23757    88   5108.15   0.22   0.12   1.55  20386     3371          0    86         0        2
z3-noodler-cd34a4a-17ffaf6        23752    93   4341.90   0.18   0.10   1.36  20381     3371          0    91         2        0
z3-noodler-193f136-17ffaf6        23753    92   4432.06   0.19   0.10   1.39  20382     3371          0    92         0        0
z3-noodler-model-193f136-17ffaf6  23750    95   4934.37   0.21   0.12   1.40  20379     3371          0    95         0        0
z3-noodler-0122b33-17ffaf6        23752    93   4288.73   0.18   0.10   1.35  20381     3371          0    91         2        0
cvc5-1.2.0                        23820    25   6002.12   0.25   0.07   2.51  20400     3420          0    25         0        0
z3-4.13.0                         22895   950  55228.54   2.41   0.12  10.38  19624     3271          0   950         0        0
--------------------------------------------------

Benchmark full_str_int
# of formulae: 16968
--------------------------------------------------
tool                                 ✅    ❌      time    avg    med    std    sat    unsat    unknown    TO    MO+ERR    other
--------------------------------  -----  ----  --------  -----  -----  -----  -----  -------  ---------  ----  --------  -------
z3-noodler-model-cd34a4a-17ffaf6  16815   153   1560.69   0.09   0.02   1.48   2494    14321        117    35         0        1
z3-noodler-cd34a4a-17ffaf6        16817   151   1407.62   0.08   0.01   1.46   2496    14321        117    34         0        0
z3-noodler-193f136-17ffaf6        16816   152   1373.22   0.08   0.01   1.53   2495    14321        117    35         0        0
z3-noodler-model-193f136-17ffaf6  16816   152   1527.30   0.09   0.02   1.52   2495    14321        117    34         0        1
z3-noodler-0122b33-17ffaf6        16817   151   1393.70   0.08   0.01   1.44   2496    14321        117    34         0        0
cvc5-1.2.0                        16963     5   5072.37   0.30   0.01   1.39   2521    14442          0     5         0        0
z3-4.13.0                         16732   236  18620.60   1.11   0.01   6.38   2476    14256          0   236         0        0
--------------------------------------------------

Benchmark snia
# of formulae: 70
--------------------------------------------------
tool                                ✅    ❌    time     avg     med     std    sat    unsat    unknown    TO    MO+ERR    other
--------------------------------  ----  ----  ------  ------  ------  ------  -----  -------  ---------  ----  --------  -------
z3-noodler-model-cd34a4a-17ffaf6    70     0    1.25    0.02    0.02    0.01     70        0          0     0         0        0
z3-noodler-cd34a4a-17ffaf6          70     0    0.86    0.01    0.01    0.01     70        0          0     0         0        0
z3-noodler-193f136-17ffaf6          70     0    0.79    0.01    0.01    0.00     70        0          0     0         0        0
z3-noodler-model-193f136-17ffaf6    70     0    1.24    0.02    0.02    0.01     70        0          0     0         0        0
z3-noodler-0122b33-17ffaf6          70     0    0.88    0.01    0.01    0.01     70        0          0     0         0        0
cvc5-1.2.0                          70     0    0.03    0.00    0.00    0.00     70        0          0     0         0        0
z3-4.13.0                            0    70    0.00  nan     nan     nan         0        0          0     0         0       70
--------------------------------------------------