huggingface / text-generation-inference

Large Language Model Text Generation Inference
http://hf.co/docs/text-generation-inference
Apache License 2.0
8.36k stars 948 forks source link

Inference Explainability/Suppression (AtMan) #1665

Closed stefanobranco closed 2 months ago

stefanobranco commented 3 months ago

Feature request

Flags for inference enrich the output with explainability information or suppress specific input token/embedding spaces, as described here.

Motivation

I'm not sure if this is out of scope of TGI, but I figured I might bring it up at least.

Explainability and truthfulness is a very important topic for us and our users, and after watchin GTC and reading this paper (github), I believe there's potential here to significant work here that could be done.

The explainability and suppression methods proposed in that paper would be extremely useful for us, though I can understand if that is too niche.

Your contribution

I'd be willing to put in groundwork if that helps, though I also have to admit that I'm out of my depth at that level of manipulating attention, so I would definitely need some guidance if I were to contribute.

github-actions[bot] commented 2 months ago

This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 5 days.