Artoriuz / ArtCNN

Super-Resolution Convolutional Neural Networks as GLSL shaders for mpv
MIT License
109 stars 2 forks source link

_DS_CMP shaders aren't working? #21

Closed Trinity3e closed 2 months ago

Trinity3e commented 2 months ago

Here's what mpv says:

[vo/gpu-next/libplacebo] shader compile log (status=0): 0:67(12): warning: extension `GL_EXT_shader_explicit_arithmetic_types_float16' unsupported in compute shader [vo/gpu-next/libplacebo] 0:67(1): error: #extension directive is not allowed in the middle of a shader

Thanks in advance

Artoriuz commented 2 months ago

All three of them compile just fine here, maybe this is unsupported by your driver/GPU?

Jules-A commented 2 months ago

I haven't checked the logs yet but the CMP versions were still failing for me under DX11.

Trinity3e commented 2 months ago

The normal _CMP ones work fine, I will try again and play around with some configs.

LE: Huh, found the reason, it needs gpu-api=vulkan. It's very weird. Probably a bug turned feature somewhere which allows plain _CMP to work without vulkan but not _CMP_DS. I guess i'll close this but please mention this in the readme. I am on amdgpu.

Artoriuz commented 2 months ago

The shader gen code is the same so they should have the same issues... I'm currently generating the compute shaders again with Kasper's new changes so feel free to give it another go soon.

Jules-A commented 2 months ago

The shader gen code is the same so they should have the same issues... I'm currently generating the compute shaders again with Kasper's new changes so feel free to give it another go soon.

Compiles under DX11 now and speed is about the same as gather for C4F32 under DX11, under Vulkan it's slightly slower than previous version though.

EDIT: I checked why it failed on previous version in log:

[  16.309][e][vo/gpu-next/libplacebo] D3DCompile failed: Unspecified error (E_FAIL, 0x80004005)
[  16.309][e][vo/gpu-next/libplacebo] error X4586: The total amount of group shared memory (41472 bytes) exceeds the cs_5_0 limit of 32768 bytes

Not the same error as OP.