issues
search
skywalker023
/
fantom
👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"
https://aclanthology.org/2023.emnlp-main.890/
MIT License
50
stars
3
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Dataset size does not match what is reported in the paper
#4
dniku
closed
6 months ago
1
Some factQA questions have identical correct_answer and wrong_answer
#3
dniku
opened
6 months ago
7
Did you try few-shot prompting GPT-4?
#2
lukasberglund
opened
8 months ago
1
Dataset/code release?
#1
Jiayi-Pan
closed
10 months ago
1