I am a beginner in interpretability and would like some quantitative metrics assessing the "interpretability." I found the "localization metric" in your paper really interesting but unfortunately could find the definition for it.
I am wondering if you could point me to some repositories/papers, or suggest some other promising metrics.
Thank you for your nice work!
I am a beginner in interpretability and would like some quantitative metrics assessing the "interpretability." I found the "localization metric" in your paper really interesting but unfortunately could find the definition for it.
I am wondering if you could point me to some repositories/papers, or suggest some other promising metrics.
Thank you so much!