lucagobbi commented 2 weeks ago

Description

Currently the cat.classify() method in StrayCat is looping over labels and checking if every label is a substring of the response (was this logic intentional?) which is the string classified by the LLM. This could lead to bugs if one defines his/her labels as substrings of one another. For instance, if you define your labels as: ['positive', 'not_so_positive', 'negative'], even if the LLM classifies correctly the sentence as not_so_positive the method will return positive, since the label is a substring of the response.

I've also optimized type hinting and type checking for better readability.

Type of change

[X] Bug fix (non-breaking change which fixes an issue)
[ ] New feature (non-breaking change which adds functionality)
[ ] Breaking change (fix or feature that would cause existing functionality to not work as expected)
[ ] This change requires a documentation update

Checklist:

[X] My code follows the style guidelines of this project
[X] I have performed a self-review of my own code
[X] I have commented my code, particularly in hard-to-understand areas

pieroit commented 2 weeks ago

Agree on the correction, but now it could happen that the LLM writes more chars then needed (i.e. adding quotes or punctuation) and it will not work :/

What about using utils.levhenstein_distance?

lucagobbi commented 2 weeks ago

Sure, I agree on a fuzzy system like the one you proposed. Plus, we should encourage the use of non similar labels to avoid mistakes like these (via documentation). I find these cat methods really useful, they deserve more space in the docs. Will update the PR. Thanks Piero!!!

lucagobbi commented 2 weeks ago

Ushh in the previous version we were not forcing not classified responses, since the method was returning None if no label was matched. With levhenstein_distance we are forcing the nearest label to be returned even if the LLM answer with an outlier like:

labels: ['positive', 'not_so_positive', 'negative'] response: "none" result: positive

It's a strong stance to force it. What do you think?

pieroit commented 1 week ago

@lucagobbi maybe nearest label with a threshold? Let's do some experiment and take a direction, thanks for this!

lucagobbi commented 1 week ago

@lucagobbi maybe nearest label with a threshold? Let's do some experiment and take a direction, thanks for this!

I havent added the levhenstein method yet, need to reopen this PR if we want to include it

cheshire-cat-ai / core

Fix classify method in stray #859

Description

Type of change

Checklist: