ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.14542
28
60

Evaluating Large Language Models on Controlled Generation Tasks

23 October 2023
Jiao Sun
Yufei Tian
Wangchunshu Zhou
Nan Xu
Qian Hu
Rahul Gupta
John Wieting
Nanyun Peng
Xuezhe Ma
    LRM
    ELM
ArXivPDFHTML
Abstract

While recent studies have looked into the abilities of large language models in various benchmark tasks, including question generation, reading comprehension, multilingual and etc, there have been few studies looking into the controllability of large language models on generation tasks. We present an extensive analysis of various benchmarks including a sentence planning benchmark with different granularities. After comparing large language models against state-of-the-start finetuned smaller models, we present a spectrum showing large language models falling behind, are comparable, or exceed the ability of smaller models. We conclude that **large language models struggle at meeting fine-grained hard constraints**.

View on arXiv
Comments on this paper