Snippet:
Structured generation, the process of producing content in standardized formats like JSON and XML, is widely utilized in real-world applications to extract key output information from large language models (LLMs). This study investigates whether such constraints on generation space impact LLMs' abilities, including reasoning and domain knowledge comprehension. Specifically, we evaluate LLMs' performance when restricted to adhere to structured formats versus generating free-form responses across various common tasks. Surprisingly, we observe a significant decline in LLMs' reasoning abilities under format restrictions. Furthermore, we find that stricter format constraints generally lead to greater performance degradation in reasoning tasks.
Comments: 18 pages
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2408.02442 [cs.CL] (or arXiv:2408.02442v1 [cs.CL] for this version)
Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models
Snippet: Structured generation, the process of producing content in standardized formats like JSON and XML, is widely utilized in real-world applications to extract key output information from large language models (LLMs). This study investigates whether such constraints on generation space impact LLMs' abilities, including reasoning and domain knowledge comprehension. Specifically, we evaluate LLMs' performance when restricted to adhere to structured formats versus generating free-form responses across various common tasks. Surprisingly, we observe a significant decline in LLMs' reasoning abilities under format restrictions. Furthermore, we find that stricter format constraints generally lead to greater performance degradation in reasoning tasks.
Comments: 18 pages
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2408.02442 [cs.CL] (or arXiv:2408.02442v1 [cs.CL] for this version)
https://doi.org/10.48550/arXiv.2408.02442
Suggested labels
None