Closed benwoo1110 closed 1 month ago
Can also confirm that this is an issue on the web version.
interesting! @mertalev I assume the fix for this is to make all search queries as lower cases?
No, this should be handled in the tokenizer used in machine learning. Different models want different inputs.
I guess Immich can do some form of input sanitisation/validation based on the ml model used. E.g. If we know this mode only support lowercased characters we can just lowercase it before passing to the tokeniser. If it's a completely unsupported character by the model e.g. emoji or chinese, Immich should throw an error instead of letting the search return useless results.
This should be possible if the models used have a documentation of the supported characters as their input.
The bug
I switched from the default
ViT-B-32__openai
model to the newViT-B-16-SigLIP__webli
and re-ran all the smart search. I noticed the results when search string has uppercase letters will return completely wrong results, as can be seen in the screenshot below:The OS that Immich Server is running on
Raspberry Pi OS Lite (Bookworm)
Version of Immich Server
v1.112.1
Version of Immich Mobile App
v1.112.1
Platform with the issue
Your docker-compose.yml content
Your .env content
Reproduction steps
Relevant log output
No response
Additional information
No response