panuthept / IRIS

Improving Robustness of LLMs on Input Variations by Mitigating Spurious Intermediate States
Apache License 2.0
8 stars 3 forks source link

72 debiasing add counterfactual inference with oracle bias model #89

Closed panuthept closed 1 month ago