Closed deus0ww closed 1 year ago
edit 1: I cleanup the code a bit. On an M1 Mac, this version speeds up the last shader pass by about 20-25%. Also works well with MimeBilateral. edit 2: swtiched CHROMA* to HOOKED*
Can gather or compute be used for the downscaling passes also?
Currently I'm doing this for speed: https://pastebin.com/raw/si8RdrED . Basically only calculating every 2nd pixel but scaling to Luma. For whatever reason scaling to Luma on the first pass seems to make the biggest difference, even for width (which is currently set to just Chroma in master). NOTE: I'm not actually using the upscaling code at all since there's still too many issues with reds and it's still too heavy for my use that I need to lower luma scaling. For whatever reason this causes the inbuilt chroma scalers to perform better but I'm still unsure why.
EDIT: Okay nvm, looks like interpolation wouldn't really help in this situation, it was just co-incidence... Lol okay... Even doing something like this is way better than not doing any: https://pastebin.com/raw/gSSBnkaJ
Possibly something like this: