image-rs / image

Encoding and decoding images in Rust
Apache License 2.0
5.01k stars 619 forks source link

Tracking issue: Remove libwebp dependency #1984

Closed fintelia closed 8 months ago

fintelia commented 1 year ago

Now that a pure-Rust encoder implementation is being added in #1978, we're planning on removing usage of libwebp which is written in C.

Our encoder is very fast, but lacks support for writing lossy files and the output files it produces generally aren't quite as small. If you are relying on one of those features, please comment on this issue to explain your use case!

Note: regardless of changes to the image crate, the webp crate will continue to provide Rust bindings to libwebp. Switching should only be a couple lines of code:

fn encode_lossy_webp(img: DynamicImage, quality: f32) -> Vec<u8> {
    let img = img.to_rgba8();
    let (width, height) = img.dimensions();
    webp::Encoder::new(&*img, webp::PixelLayout::Rgba, width, height)
        .encode(quality)
        .to_vec()
}
djc commented 10 months ago

We (as the original contributors of WebP encoding support) would be disappointed to see a removal of lossy WebP encoding support. We're using lossy WebP support exactly because we feel it provides a good trade-off between image size and quality, whereas lossless WebP images do not seem to do as well on that particular trade-off. Maybe the C-based WebP encoding support could be retained for lossy encoding only?

randomairborne commented 10 months ago

I was also using lossy webp encoding, for the smaller encoding size compared to JPEG.

fintelia commented 10 months ago

Just added a code sample to the original post showing how to use the do lossy encoding by calling into the webp crate. I'd also be curious to hear whether anyone has experimented with AVIF and/or mozjpeg? Sources online claim that those options achieve as good or better size and quality to WebP

djc commented 10 months ago

We were looking at AVIF earlier, but it looks like browser support might not be quite where we'd want it to be.

https://caniuse.com/avif

fintelia commented 10 months ago

Oh, thanks for sharing that! Looks like the last holdout is Edge, which is getting support in version 121 and should be released in about two weeks (then it's just however long it takes for people to upgrade from older versions)

omid commented 10 months ago

We also use lossy webp. Same reason as others said.

bakhtiyarneyman commented 10 months ago

We use lossy webp because it allows pages to load much faster.

timleslie commented 9 months ago

We use lossy webp as a carefully considered tradeoff between image size and image quality (in a context where this trade-off is of significant financial importance).

schmitch commented 9 months ago

we also use the lossy encoding since we create preview pictures from pdf files and we do not need high image quality and more like "good enough" quality with a small size

fintelia commented 9 months ago

Thanks for the responses! It would be helpful to know whether the lossy encoding must be part of the image crate? If we go ahead with the planned removal, is there a reason the code snippet above would not be sufficient?

randomairborne commented 9 months ago

I think it would confuse people to have a partial implementation in image-rs. Not sure what others might think, though.

Shnatsel commented 9 months ago

I see two downsides to having the C libwebp around:

  1. The risk of memory safety vulnerabilities due to inherently unsafe C code
  2. More complicated build process (incl. cross-compilation)

I am not particularly concerned about CVEs in the encoding step. Historically it has proven much easier to get right than decoding. The hassle of dealing with C dependencies, however, is very real, and bites you before you even touch WebP - it blocks access to installing the image crate in general.

I think hiding libwebp dependency behind a non-default feature flag is sufficient. I figure exposing the lossy encoding interface through image might make it a little more convenient, but then there's the question of how to expose two different lossless encoders in a way that isn't a footgun.

randomairborne commented 9 months ago

Unfortunately, as long as rust libwebp is incomplete, keeping libwebp around is kind of unavoidable. It at the very least needs to be documented that the rust encode function is always lossless.

neckaros commented 9 months ago

Just added a code sample to the original post showing how to use the do lossy encoding by calling into the webp crate. I'd also be curious to hear whether anyone has experimented with AVIF and/or mozjpeg? Sources online claim that those options achieve as good or better size and quality to WebP

Doesn't AVIF also depends on C library libdav1d?

fintelia commented 9 months ago

Doesn't AVIF also depends on C library libdav1d?

AVIF encoding uses the Rust-based ravif library. Decoding does however require libdav1d.

fintelia commented 8 months ago

I'm going to go ahead and close this now that 0.25.0 is released.

There is a place in the Rust ecosystem for a crate that simply provides safe wrappers around the C/C++ reference implementations of all the image formats. However, that has never been the goal of this crate or the broader image-rs org.

From the discussion, it sounds like a pure-Rust lossy WebP implementation wouldn't actually be a suitable replacement in current use cases unless it were able to achieve compression ratios on-par or better than libwebp. But perfect feature parity isn't the criteria we use for other codecs, and I don't think it can be the standard we use to decide when to drop a native dependency either

randomairborne commented 8 months ago

I would argue that we aren't really looking for "perfect feature parity", just something that allows the format to work as advertised- which is with high compression ratios.

fintelia commented 3 months ago

Also for the record, this sort of engagement is not OK:

[your crate is only a] partial implementation

rust libwebp is incomplete

[your crate crate doesn't] work as advertised

Technical disagreements are fine. But repeatedly putting down the work of org members to bully us into reverting a change crosses a line.

schmitch commented 3 months ago

@fintelia well to be fair:

[your crate is only a] partial implementation

is actually the correct term form the webp handling, even if its a little bit harsh. Actually I know that removing code is always a cool thing, since it removes an enormous maintance burden.

But in case of the webp encoder at least provide one of the two things:

btw. why you get so many complaints is because of that: your library is one of the best (if not the best) image libraries in the wild and is loved by many. you will probably more often find yourself in a tight spot, but we love this library regardless

Shnatsel commented 3 months ago

provide documentation how to read the webp via libweb

The WebP decoder is feature-complete and the decoding result is bit-identical to libwebp. I don't see why we wouldn't have people just use it directly.

I agree the limitations of the encoder should be documented. The doc comments right now don't mention any of the limitations, and that can easily be improved. We could even link to the example code showing how to encode WebP with libwebp.

Shnatsel commented 3 months ago

I've opened #2314 to document the trade-offs involved more clearly, and link to example code for encoding with libwebp.

fintelia commented 3 months ago

is actually the correct term form the webp handling, even if its a little bit harsh.

To be clear, that's how this sort of bullying works. Start with "partial implementation" which is harsh but factual. Then escalate to "incomplete", which isn't really true (all planned functionality had been implemented), but isn't worth arguing about. Also get the library name wrong in the same comment. Then when neither of those provoked the desired response, escalate again with the point about just wanting the encoder to "work as advertised". This sort of thing is extremely common in open source, which makes it really easy to go unnoticed. But it doesn't have to be something that's tolerated