stashapp / stash

An organizer for your porn, written in Go. Documentation: https://docs.stashapp.cc
https://stashapp.cc/
GNU Affero General Public License v3.0
8.75k stars 777 forks source link

[RFC] Info Request | Scene matching: understanding how stash decides if a scene matches or not #4259

Open Dark-Obsidian opened 10 months ago

Dark-Obsidian commented 10 months ago

Scope

I am finding that a number of my scenes don't match when scraping in scene tagger, but then do show when searching... however when reviewing the match found by search, it shows that multiple phashes and ohashes matched the scene... so why did this not match when scraping?

Examples

Here is a recent example from Scene Tagger... image

... given these multiple match points, I am curious why stash didn't match the scene when scraping, and only matched it when searching

Request / Discussion topic

Reference Reading

None at this time.

Dark-Obsidian commented 10 months ago

Search can also be mis-leading / incorrect...

On an adjacent topic, I am also a bit perplexed with "Search" some times...

Leading to wasted time...

Ultimately...

When I search for Dangerous Games...


Appendix

Search results -- pasted here to avoid messing up flow with large image in middle... Dangerous Games

Flashy78 commented 10 months ago

I believe StashDB matches phashes with distance of 8, so your scene should have been found. If you Scrape by Fragment (not Scrape All), does it work?

Text search on StashDB (which is a different Github repo over at Stashbox) is not perfect. There's an existing PR to allow you to use quotations to search multiple words as a phrase instead of individual words, which could help in your case. But also there's just a 10 result limit in order to keep things performant, and those two words are very common.

I'd suggest jumping on Discord where folks can help you in real time to debug your issue.