Buffers: Prevent bug-prone iteration, speed up accidentally-quadratic iteration

BenWiederhake commented 11 months ago

This PR:

Raises a new exception when the caller tries to iterate over a nested buffer. This causes the caller to only see the last part of a larger dataset, and is usually a bug. When I first ran into this issue, I first thought this was a bug with libosmium, since there was data missing. Let's just disable these kinds of accesses; none of the examples or code attempt this kind of access anyway.
Speeds up Buffer::get_last_nested(). This method used to be accidentally-quadratic: Each invocation takes a linear amount of time in the depth, and needs to be invoked a linear amount of times in order to iterate all buffers. This means an overall quadratic running time, even though one would expect a linear running time. I also added a test that is very sensitive to this (4s versus <0.01s). When iterating over the entire planet, I observe a speedup from 163.183 s to 160.942 s. (And an observed stddev of 0.836 s, so this result has 2.68 sigma. Physics scientists would laugh at that, but it's good enough proof for me, in this case.)

joto commented 11 months ago

Where did you encounter those nested buffers? The Reader class should only ever return unnested buffers from read(), so the user should never see them?

BenWiederhake commented 11 months ago

Nested buffers occur internally, so this affects the running time even if the user's code doesn't have direct access to them.

Checking for errors makes even more sense in this case. The user shouldn't be able to trigger this issue even if they wanted to, and the checks make sure that no data is lost internally (e.g. by new code).

I personally encountered these because osmium::io::detail::PBFDataBlobDecoder::operator() returns deeply nested buffers, and that's what the random access code uses / will use.

joto commented 11 months ago

There are two issues here which we should not mix up:

Some functions can now throw which couldn't before. And it is somewhat unexpected that a simple function like begin() can throw. So I am not too happy about that. And because this is internal to libosmium anyway, an assert could be a better solution. But this is something I have to look into.
The performance issue. I can not remember why I implemented it the way I did. Probably because it was the simplest way of doing things. As this code is only run on I/O which does a lot more stuff which will swamp any efficiencies gained here. The 1-2% improvement you measured doesn't seem to me to be a big deal. On the other hand the changs are not huge. I am wondering myself now why I coulnd't implement the linked list the other way around so that the next buffer is always at the beginning of the list and not at the end.

In any case these changes are orthogonal to all the other work you are doing, right?

BenWiederhake commented 11 months ago

I take that as a request to change the exceptions into asserts. Sure, I'll do that.
Yes, these changes are orthogonal. I expected the performance impact to be significantly higher when I started. The TEST_CASE("Can quickly handle deeply nested buffer") in test_buffer_nested.cpp proves that this change does speed things up, so I believe it's worth to improve the code in this way.

BenWiederhake commented 11 months ago

Changes since last push:

Changed the exceptions into asserts, as requested.

BenWiederhake commented 11 months ago

Changes since last push summary:

Relaxed a test from buffer.written() == 1360 to buffer.written() >= 1360 && buffer.written() <= 1440, since apparently Windows writes 8 bytes more per node. This used to break windows-minimal-2019 and windows-minimal-2022.
Remove a few forgotten printf statements. Whoops!

BenWiederhake commented 11 months ago

CI failure seems to be a flake: It fails long before the code of this PR becomes relevant, specifically during package installation.

BenWiederhake commented 11 months ago

Closing because you don't seem to accept any PRs at this time.

osmcode / libosmium

Buffers: Prevent bug-prone iteration, speed up accidentally-quadratic iteration #369