issues
search
asaparov
/
prontoqa
Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.
Apache License 2.0
119
stars
13
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
How to evaluate the LLM's Answer with the Gold Answer?
#7
Aiolus-X
opened
3 weeks ago
2
Please can you provide a PRONTOQA dataset after the bugfix?
#6
alybel
closed
3 weeks ago
2
Which file contains label data for QA.
#5
librahu
opened
2 months ago
1
analyze_results.analyze_log producing incorrect statistics
#4
horenbergerb
closed
1 year ago
7
Raising the difficulty even more?
#3
alexconstant9108
closed
11 months ago
23
infinite loop when generating false ontologies with hop greater or equal to 4
#2
shenglih
closed
1 year ago
1
why are true ontologies generated without distractors?
#1
shenglih
closed
1 year ago
1