Figure 1: Illustration of the GSM-Symbolic template creation process. This dataset serves as a tool to investigate the presumed reasoning capabilities of LLMs, enabling the design of controllable mathematical reasoning evaluations with more reliable metrics. Our results reveal that all state-of-th
Description
There's overlapping text here, and boxes don't appear in the right places. There's copius amounts of whitespace too.
(Optional:) Please add any files, screenshots, or other information here.
https://arxiv.org/html/2410.05229v1 figure 1
(Required) What is this issue most closely related to? Select one.
Figures
Internal issue ID
a822ee36-776d-4093-86e1-af17529af5d0
Paper URL
https://arxiv.org/html/2410.05229v1
Browser
Firefox/131.0
Device Type
Desktop