skywalker023 fantom issues - Githubissues

skywalker023 / fantom

👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"

https://aclanthology.org/2023.emnlp-main.890/

MIT License

50 stars 3 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Dataset size does not match what is reported in the paper

#4 dniku closed 6 months ago
1
Some factQA questions have identical correct_answer and wrong_answer

#3 dniku opened 6 months ago
7
Did you try few-shot prompting GPT-4?

#2 lukasberglund opened 8 months ago
1
Dataset/code release?

#1 Jiayi-Pan closed 10 months ago
1