Inefficient join when complex where clause is supplied

GoogleCodeExporter commented 9 years ago

What steps will reproduce the problem?

1. Compile and run attached Java program, having H2 on classpath.
2. Run it.
3. Uncomment part of WHERE clause and run it again.

What is the expected output? What do you see instead?

There are two tables "entity" and "relation", "entity" has the FK on
"relation"'s PK. The test tries to select the entities which are connected
to other entities with distinct set of values.

The performance of H2 drops significantly with complex WHERE query turned
on. EXPLAIN shows H2 tries to perform "entity"{x}"entity" join first and
then filter off the results, thus ends up being O(N^2) on "entity" count.

* WHERE clause commented
Iteration 0: 26 ms
Iteration 1: 1 ms
Iteration 2: 2 ms
Iteration 3: 2 ms
Iteration 4: 1 ms
Iteration 5: 2 ms
Iteration 6: 1 ms
Iteration 7: 2 ms
Iteration 8: 1 ms
Iteration 9: 1 ms

* WHERE clause uncommented
Iteration 0: 1132 ms
Iteration 1: 720 ms
Iteration 2: 713 ms
Iteration 3: 733 ms
Iteration 4: 733 ms
Iteration 5: 722 ms
Iteration 6: 726 ms
Iteration 7: 726 ms
Iteration 8: 699 ms
Iteration 9: 720 ms

What version of the product are you using? On what operating system, file
system, and virtual machine?

Latest, 1.2.123. The issue was reproduced on both Linux and Solaris on Sun
JDK 1.6.0_14/1.6.0_16.

Do you know a workaround?

No.

How important/urgent is the problem for you?

This test mimics the real performance problem with H2 in production
environment. In real situation, there are 10K+ entities present, and the
query in question easily takes 200+ seconds (!) to run. Apache Derby,
PostgreSQL, and Oracle seem not to have the this problem. You can change
JDBC connection string in the test to confirm.

In your view, is this a defect or a feature request?

This is the performance defect (surprisingly, the only one we faced in
complex JPA-driven system, good job!).

Please provide any additional information below.

I'm going to donate $10 via PayPal as the gratitude, if this issue will be
fixed and patched binary is provided before 8 p.m. PST, December 6.

Original issue reported on code.google.com by aleksey....@gmail.com on 5 Dec 2009 at 1:56

Attachments:

JoinTest.java

GoogleCodeExporter commented 9 years ago

Hi,

I'm sorry this will require some more time. Merge joins
are not yet implemented in H2. There is a feature request,
and I will increase the priority however.

There is a workaround, it is to give H2 a 'hint' not
to use the index on e1.value. See the last query (which is fast):

DROP TABLE entity;

DROP TABLE relation;

CREATE TABLE entity (id INT NOT NULL, rel_id INT, 
value VARCHAR(200), PRIMARY KEY(id));

CREATE INDEX rel_index ON entity(rel_id);

CREATE INDEX value_index ON entity(value);

CREATE TABLE relation (id INT NOT NULL, PRIMARY KEY(id));

INSERT INTO relation(id) select x from system_range(1, 10);

INSERT INTO entity(id, rel_id, value) 
select x, casewhen(rand()<0.1, mod(x, 10), null), 
'COMMONVALUE' from system_range(1, 1000);

SELECT e1.value FROM entity AS e1, relation, entity AS e2 
WHERE ((e1.value = 'COMMONVALUE') 
AND ((e2.value = 'NOTEXISTVALUE1') OR (e2.value = 'NOTEXISTVALUE2'))) 
AND ((e2.rel_id = relation.id) AND (e1.rel_id = relation.id));

SELECT e1.value FROM entity AS e1, relation, entity AS e2 
WHERE (('x' || e1.value = 'x' || 'COMMONVALUE') 
AND (e2.value in ('NOTEXISTVALUE1', 'NOTEXISTVALUE2')))
AND ((e2.rel_id = relation.id) AND (e1.rel_id = relation.id));

Original comment by thomas.t...@gmail.com on 5 Dec 2009 at 2:54

GoogleCodeExporter commented 9 years ago

Got it, thanks!

Unfortunately, we can't hack the SQL select query because it's being generated 
by JPA
provider. Is there some workaround that can be executed as separate query 
during DB
initialization?

Here's what I got from optimizer instrumentation:

Computing cost for plan: 
[E1, PUBLIC.RELATION, E2]
 cost before adjusting = 40.0
 cost after adjusting = 38.8
org.h2.table.PlanItem@12940b3 = 38.8
 cost before adjusting = 30.0
 cost after adjusting = 29.7
org.h2.table.PlanItem@156b6b9 = 29.7
 cost before adjusting = 40.0
 cost after adjusting = 39.6
org.h2.table.PlanItem@1f66cff = 39.6
[E1, PUBLIC.RELATION, E2] = 49607.515999999996

Computing cost for plan: 
[E1, E2, PUBLIC.RELATION]
 cost before adjusting = 40.0
 cost after adjusting = 38.8
org.h2.table.PlanItem@16de49c = 38.8
 cost before adjusting = 40.0
 cost after adjusting = 39.4
org.h2.table.PlanItem@1bbf1ca = 39.4
 cost before adjusting = 30.0
 cost after adjusting = 29.8
org.h2.table.PlanItem@1ff0dde = 29.8
[E1, E2, PUBLIC.RELATION] = 49523.935999999994

First plan is very fast and second is very slow (as described in original 
issue). The
cost difference is very low though. What can be done to dope the cost for the 
first plan?

Original comment by aleksey....@gmail.com on 5 Dec 2009 at 3:05

GoogleCodeExporter commented 9 years ago

> Is there some workaround 
No, I'm afraid not. You can't use a different selectivity for the column,
because it's actually the same column...

Original comment by thomas.t...@gmail.com on 6 Dec 2009 at 6:51

GoogleCodeExporter commented 9 years ago

So optimizer can't get the information that rel_id is mostly null? If it does, 
then
plan [E1, PUBLIC.RELATION, E2] will cost a lot less than [E1, E2, 
PUBLIC.RELATION].

Original comment by aleksey....@gmail.com on 6 Dec 2009 at 7:29

GoogleCodeExporter commented 9 years ago

Hi,

> So optimizer can't get the information that rel_id is mostly null?

Yes, the optimizer knows that (selectivity is 1).

> If it does, then plan [E1, PUBLIC.RELATION, E2] will cost a lot less than 
[E1, E2,
PUBLIC.RELATION].

Actually, I think the problem is not the query plan, but that null should be 
ignored
for index lookup... Example:

drop table test;
create table test(id int) as select x from system_range(1, 10);
insert into test select null from system_range(1, 2000);
create index idx_id on test(id);
analyze;
select * from test t1, test t2 where t1.id = t2.id;

The query takes 400 ms. It is using an index lookup, which is OK. But it should
ignore the rows where t1.id is null, and it doesn't do that currently - it 
looks like
it is matching all t1.id=null against all rows where t2.id=null, and filter the 
rows
later (in SQL, NULL=x is NULL). It should filter the rows immediately.

I will have a look.

Regards,
Thomas

Original comment by thomas.t...@gmail.com on 8 Dec 2009 at 8:36

Changed state: Accepted

GoogleCodeExporter commented 9 years ago

Original comment by thomas.t...@gmail.com on 8 Dec 2009 at 8:36

GoogleCodeExporter commented 9 years ago

This problem should be fixed in version 1.2.126

Original comment by thomas.t...@gmail.com on 18 Dec 2009 at 9:18

GoogleCodeExporter commented 9 years ago

Original comment by thomas.t...@gmail.com on 18 Dec 2009 at 9:18

Changed state: Fixed

GoogleCodeExporter commented 9 years ago

Verified in production.
Degradation is gone, thanks!

Original comment by aleksey....@gmail.com on 18 Dec 2009 at 11:19

GoogleCodeExporter commented 9 years ago

I ran the above test and it again took half a second. I tried it with 20000 
NULLs and it took nearly one minute. It looks like a regression.

Original comment by Maaarti...@gmail.com on 5 Apr 2011 at 10:14

lbehnke / h2database

Inefficient join when complex where clause is supplied #146