HEmile / iswc-challenge

MIT License
10 stars 1 forks source link

Test fact checking #12

Open HEmile opened 2 years ago

HEmile commented 2 years ago

Fact checking basically just converts all predictions into facts and asks the language model whether this fact is true.

Some interesting failure cases

Detection of errors:

image image image

To fix:

HEmile commented 2 years ago
image

This is not in the dev set as an answer!

image

Neither is this

HEmile commented 2 years ago
image

It doesn't work well for the chemical compounds. It'd filter out correct results. But this could maybe be improved with a better prompt.

HEmile commented 2 years ago

PersonLanguage:

image image image image

Correct

HEmile commented 2 years ago

Occupation:

image image image

Incorrect

image image image image image

correct

HEmile commented 2 years ago

PersonInstrument Some good results, but

image

is incorrect

HEmile commented 2 years ago

Employer:

image image image image image

BUT ALSO

image

which is incorrect

correct

HEmile commented 2 years ago

Easiest way to implement this is as a separate post processing script on the generated results. Makes it easier to compare

thiviyanT commented 2 years ago

Very interesting indeed! I think we should include this in the paper.