refuel-ai / autolabel

Label, clean and enrich text datasets with LLMs.
https://docs.refuel.ai/
MIT License
2.03k stars 139 forks source link

Filter multilabel confidence keys for llm labels #883

Closed Abhinav-Naikawadi closed 1 month ago

Abhinav-Naikawadi commented 1 month ago

Pull Review Summary

Description

Filter multilabel confidence keys for llm labels. Currently, the multilabel confidence may be computed for tokens that were filtered out when parsing the llm response. We don't want to include confidence for labels that were filtered out during parsing logic.

Type of change

Tests

Tested confidence key filtering logic