convolution: Passing non-contiguous 'out' array results in corrupt data

stuarteberg commented 8 years ago

Is it permitted to call vector-valued convolutions (e.g. hessianOfGaussianEigenvalues) with a non-contiguous out parameter? For example:

# output_zyxc[..., 2:5] is non-contiguous
vigra.filters.hessianOfGaussianEigenvalues(input_zyx, 1.0, out=output_zyxc[..., 2:5])

That doesn't throw an exception, but it doesn't produce the correct results. Instead, you get an image that has obvious artifacts from incorrect pointer math:

output-slice

Furthermore, the other slices of the output array (which were supposed to remain untouched) get corrupted:

corrupted-slice

Here's a complete test case. This example uses hessianOfGaussianEigenvalues(), but I can also reproduce the problem with structureTensorEigenValues(). The problem is NOT reproducible with single-channel outputs, such as laplacianOfGaussian().

import vigra
import numpy as np

input_zyx = np.random.random((100,100,100)).astype(np.float32)
input_zyx = vigra.taggedView( input_zyx, 'zyx' )

expected_zyxc = vigra.filters.hessianOfGaussianEigenvalues(input_zyx, 1.0)
assert expected_zyxc.shape == (100,100,100,3)

# contiguous output
output_zyxc = np.zeros( (100,100,100,3), dtype=np.float32 )
output_zyxc = vigra.taggedView(output_zyxc, 'zyxc')
vigra.filters.hessianOfGaussianEigenvalues(input_zyx, 1.0, out=output_zyxc[..., 0:3])
assert (output_zyxc[..., 0:3] == expected_zyxc).all()

# non-contiguous output (sliced view)
output_zyxc = np.zeros( (100,100,100,10), dtype=np.float32 )
output_zyxc = vigra.taggedView(output_zyxc, 'zyxc')
vigra.filters.hessianOfGaussianEigenvalues(input_zyx, 1.0, out=output_zyxc[..., 0:3])

## Sample the output for viewing
#vigra.impex.writeImage(output_zyxc[0, ..., 0], '/tmp/output-slice.png')
#vigra.impex.writeImage(output_zyxc[0, ..., 9], '/tmp/corrupted-slice.png')

# Both of these assertions will fail.
assert (output_zyxc[..., 0:3] == expected_zyxc).all(), "Output is wrong."
assert (output_zyxc[..., 3:] == 0).all(), "Data beyond the output view was corrupted."

ukoethe commented 8 years ago

Thanks for reporting. This code

output_zyxc = np.zeros( (100,100,100,10), dtype=np.float32 )
output_zyxc = vigra.taggedView(output_zyxc, 'zyxc')
vigra.filters.hessianOfGaussianEigenvalues(input_zyx, 1.0, out=output_zyxc[..., 0:3])

should throw an exception. I meant to check for this case in the converters, but apparently forgot a condition.

The reason it doesn't work is this: when one maps a Python array with explicit channel axis onto a vigra array with value type TinyVector<float, 3>, the x-stride must be divided by 3. This is only legal if the original stride was a multiple of 3, which is not true here because the original length of the channel axis (and therefore the x-stride) is 10.

BTW, the first two lines of your code could be written compactly as

output_zyxc = vigra.Volume((100,100,100,10), order='C')

ukoethe commented 8 years ago

I uploaded PR #380 which applies two changes:

add the missing check, so that incompatible strides cause an error message
add a work-around in vigranumpy/__init__.py that transparantly uses a temporary array if outhas incompatible strides.

ukoethe commented 8 years ago

This bug raises an interesting general question: vigra measures strides in units of sizeof(T). Alternatively, one can measure strides in bytes (which numpy does). In this case, your attempted slicing would have worked, but there are some downsides as well:

Byte-based strides may be suprising for users who don't read documentation.
Subtle mistakes can happen, e.g. unaligned addresses for floats and doubles (most CPUs don't like them) or arrays with overlapping elements.

What do you think?

hmeine commented 8 years ago

Just another data point: Qt (QImage) also measures strides in bytes.

ukoethe commented 8 years ago

And what is your own opinion, @hmeine?

hmeine commented 8 years ago

I don't have a strong opinion either way (which is also the TL;DR summary), but I experienced difficulties (probably in the context of qimage2ndarray) when working with different systems at the same time that require conversion between byte and pixel strides. In particular, it may lead to runtime errors when trying to convert byte (Qt, numpy) to pixel strides (vigra) if the former are not compatible (divisible). That's probably an unlikely case, but requires extra code for checking and handling this condition anyhow. (Very much like what's discussed here.) I don't know if there's any performance difference. (Probably not anymore nowadays, and/or not relevant.) In the absence of evidence, I would vote against premature optimization and for byte strides. On the other hand, we don't implement vigra from scratch, so we're talking about a behavior change. That's probably the strongest point, which suggests not to change it.

stuarteberg commented 8 years ago

For my needs, the error-check you've already proposed is good enough. (Thanks, BTW.)

I don't quite follow the explanation of why switching to byte-based strides is more powerful than sizeof(T)-based strides. But in any case, it sounds like such a change is likely to introduce subtle bugs. The vigra test suite is pretty good, but it's hard to cover all the potential problems such a change could introduce (as this issue demonstrates).

ukoethe commented 8 years ago

I don't quite follow the explanation of why switching to byte-based strides is more powerful than sizeof(T)-based strides.

Using, sizeof(T)-based strides the best I can do is issue an error message and/or transparently use an auxiliary array with matching strides. Using byte-based strides, your use case would Just Work (the x-axis stride would be 40, regardless of the channels of your view output[...,:3] to be interpreted as a three floats or a TinyVector<float, 3>).

ukoethe commented 8 years ago

The question if the strides should change refers mainly to vigra 2, should this ever materialize.

bluescarni commented 8 years ago

It seems to me that sizeof strides are safer on the C++ side from issues of undefined behaviour (e.g., unaligned memory access). On the other hand, it sounds like bytes-based strides might make interoperability with NumPy easier. Maybe some type of checked bytes strides could be a good compromise?

ukoethe / vigra

convolution: Passing non-contiguous 'out' array results in corrupt data #379