Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.05858
Cited By
CTRL: A Conditional Transformer Language Model for Controllable Generation
11 September 2019
N. Keskar
Bryan McCann
L. Varshney
Caiming Xiong
R. Socher
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CTRL: A Conditional Transformer Language Model for Controllable Generation"
50 / 761 papers shown
Title
LZ Penalty: An information-theoretic repetition penalty for autoregressive language models
Antonio A. Ginart
Naveen Kodali
J. Lee
Caiming Xiong
S.
John Emmons
24
0
0
28 Apr 2025
When Does Metadata Conditioning (NOT) Work for Language Model Pre-Training? A Study with Context-Free Grammars
Rei Higuchi
Ryotaro Kawata
Naoki Nishikawa
Kazusato Oko
Shoichiro Yamaguchi
Sosuke Kobayashi
Seiya Tokui
K. Hayashi
Daisuke Okanohara
Taiji Suzuki
AI4CE
35
0
0
24 Apr 2025
Code Copycat Conundrum: Demystifying Repetition in LLM-based Code Generation
Mingwei Liu
Juntao Li
Ying Wang
Xueying Du
Zuoyu Ou
...
Zhao Wei
Y. Xu
Fangming Zou
Xin Peng
Yiling Lou
38
0
0
17 Apr 2025
Teaching Large Language Models to Reason through Learning and Forgetting
Tianwei Ni
Allen Nie
Sapana Chaudhary
Yao Liu
Huzefa Rangwala
Rasool Fakoor
ReLM
CLL
LRM
95
0
0
15 Apr 2025
Looking beyond the next token
Abitha Thankaraj
Yiding Jiang
J. Zico Kolter
Yonatan Bisk
LRM
57
1
0
15 Apr 2025
SIFT-50M: A Large-Scale Multilingual Dataset for Speech Instruction Fine-Tuning
Prabhat Pandey
R. Swaminathan
K V Vijay Girish
Arunasish Sen
Jian Xie
Grant P. Strimel
Andreas Schwarz
90
0
0
12 Apr 2025
The Challenge of Achieving Attributability in Multilingual Table-to-Text Generation with Question-Answer Blueprints
Aden Haussmann
LMTD
52
0
0
29 Mar 2025
GAPO: Learning Preferential Prompt through Generative Adversarial Policy Optimization
Zhouhong Gu
Xingzhou Chen
Xiaoran Shi
Tao Wang
Suhang Zheng
Tianyu Li
Hongwei Feng
Yanghua Xiao
67
0
0
26 Mar 2025
What's Producible May Not Be Reachable: Measuring the Steerability of Generative Models
Keyon Vafa
Sarah Bentley
Jon M. Kleinberg
S. Mullainathan
38
0
0
21 Mar 2025
LLM Braces: Straightening Out LLM Predictions with Relevant Sub-Updates
Ying Shen
Lifu Huang
47
1
0
20 Mar 2025
Benchmarking Large Language Models for Handwritten Text Recognition
Giorgia Crosilla
Lukas Klic
Giovanni Colavizza
38
0
0
19 Mar 2025
Palette of Language Models: A Solver for Controlled Text Generation
Zhe Yang
Yi Huang
Yaqin Chen
Xiaoting Wu
Junlan Feng
Chao Deng
44
0
0
14 Mar 2025
Enhancing High-Quality Code Generation in Large Language Models with Comparative Prefix-Tuning
Yuan Jiang
Yujian Zhang
Liang Lu
Christoph Treude
Xiaohong Su
Shan Huang
Tiantian Wang
ALM
61
0
0
12 Mar 2025
Beyond One-Size-Fits-All Summarization: Customizing Summaries for Diverse Users
Mehmet Samet Duran
Tevfik Aytekin
50
0
0
10 Mar 2025
Dynamic Knowledge Integration for Evidence-Driven Counter-Argument Generation with Large Language Models
Anar Yeginbergen
Maite Oronoz
Rodrigo Agerri
45
0
0
07 Mar 2025
Learning from Noisy Labels with Contrastive Co-Transformer
Yan Han
S. Roy
Mehrtash Harandi
L. Petersson
NoLa
61
0
0
04 Mar 2025
From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence Generation up to 100K Tokens
Tong Wu
Junzhe Shen
Zixia Jia
Y. Wang
Zilong Zheng
80
0
0
26 Feb 2025
Synthetic Text Generation for Training Large Language Models via Gradient Matching
Dang Nguyen
Zeman Li
M. Bateni
Vahab Mirrokni
Meisam Razaviyayn
Baharan Mirzasoleiman
42
0
0
24 Feb 2025
Selective Prompt Anchoring for Code Generation
Yuan Tian
Tianyi Zhang
86
3
0
24 Feb 2025
Pastiche Novel Generation Creating: Fan Fiction You Love in Your Favorite Author's Style
Xueran Han
Yuhan Liu
Mingzhe Li
W. Liu
Sen Hu
Rui Yan
Zhiqiang Xu
Xiuying Chen
62
0
0
24 Feb 2025
Evolving Form and Function: Dual-Objective Optimization in Neural Symbolic Regression Networks
Amanda Bertschinger
James P. Bagrow
Joshua Bongard
76
1
0
24 Feb 2025
Towards Conditioning Clinical Text Generation for User Control
Osman Alperen Koras
Rabi Bahnan
Jens Kleesiek
Amin Dada
31
0
0
24 Feb 2025
Repetition Neurons: How Do Language Models Produce Repetitions?
Tatsuya Hiraoka
Kentaro Inui
MILM
73
6
0
21 Feb 2025
Comprehensive Analysis of Transparency and Accessibility of ChatGPT, DeepSeek, And other SoTA Large Language Models
Ranjan Sapkota
Shaina Raza
Manoj Karkee
40
4
0
21 Feb 2025
Slamming: Training a Speech Language Model on One GPU in a Day
Gallil Maimon
Avishai Elmakies
Yossi Adi
38
3
0
19 Feb 2025
Model-Based Offline Reinforcement Learning with Reliability-Guaranteed Sequence Modeling
Shenghong He
OffRL
129
0
0
10 Feb 2025
\Éclair -- Extracting Content and Layout with Integrated Reading Order for Documents
Ilia Karmanov
A. Deshmukh
Lukas Voegtle
Philipp Fischer
Kateryna Chumachenko
...
Jarno Seppänen
Jupinder Parmar
Joseph Jennings
Andrew Tao
Karan Sapra
68
0
0
06 Feb 2025
High-Fidelity Simultaneous Speech-To-Speech Translation
Tom Labiausse
Laurent Mazaré
Edouard Grave
P. Pérez
Alexandre Défossez
Neil Zeghidour
130
0
0
05 Feb 2025
Enhancing Patient-Centric Communication: Leveraging LLMs to Simulate Patient Perspectives
Xinyao Ma
Rui Zhu
Zihao Wang
Jingwei Xiong
Qingyu Chen
Haixu Tang
L. Jean Camp
Lucila Ohno-Machado
LM&MA
44
0
0
12 Jan 2025
Clinical Insights: A Comprehensive Review of Language Models in Medicine
Nikita Neveditsin
Pawan Lingras
V. Mago
LM&MA
52
3
0
08 Jan 2025
TARDiS : Text Augmentation for Refining Diversity and Separability
Kyungmin Kim
Sanghun Im
Gibaeg Kim
Heung-Seon Oh
VLM
26
0
0
06 Jan 2025
PsychAdapter: Adapting LLM Transformers to Reflect Traits, Personality and Mental Health
Huy-Hien Vu
Huy Anh Nguyen
Adithya V Ganesan
Swanie Juhng
O. Kjell
...
Margaret L. Kern
Ryan L. Boyd
L. Ungar
H. A. Schwartz
J. Eichstaedt
72
0
0
03 Jan 2025
A Survey of Controllable Learning: Methods and Applications in Information Retrieval
Chenglei Shen
Xiao Zhang
Teng Shi
Changshuo Zhang
Guofu Xie
Jun Xu
63
5
0
03 Jan 2025
LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language Texts
Helia Hashemi
J. Eisner
Corby Rosset
Benjamin Van Durme
Chris Kedzie
68
1
0
03 Jan 2025
KITE-DDI: A Knowledge graph Integrated Transformer Model for accurately predicting Drug-Drug Interaction Events from Drug SMILES and Biomedical Knowledge Graph
Azwad Tamir
Jiann-Shiun Yuan
62
0
0
08 Dec 2024
CA-SSLR: Condition-Aware Self-Supervised Learning Representation for Generalized Speech Processing
Yen-Ju Lu
Jing Liu
Thomas Thebaud
Laureano Moro Velázquez
Ariya Rastrow
Najim Dehak
Jesus Villalba
74
1
0
05 Dec 2024
VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interaction Format
Yueqian Wang
Xiaojun Meng
Y. Wang
Jianxin Liang
Jiansheng Wei
Huishuai Zhang
Dongyan Zhao
VGen
83
8
0
27 Nov 2024
Bias in Large Language Models: Origin, Evaluation, and Mitigation
Yufei Guo
Muzhe Guo
Juntao Su
Zhou Yang
Mengqiu Zhu
Hongfei Li
Mengyang Qiu
Shuo Shuo Liu
AILaw
25
9
0
16 Nov 2024
DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive Learning
Xun Guo
Shan Zhang
Yongxin He
Ting Zhang
Wanquan Feng
Haibin Huang
Chongyang Ma
DeLMO
40
4
0
28 Oct 2024
RSA-Control: A Pragmatics-Grounded Lightweight Controllable Text Generation Framework
Yifan Wang
Vera Demberg
24
0
0
24 Oct 2024
Cross-model Control: Improving Multiple Large Language Models in One-time Training
Jiayi Wu
Hao-Lun Sun
Hengyi Cai
Lixin Su
S. Wang
Dawei Yin
Xiang Li
Ming Gao
MU
34
0
0
23 Oct 2024
Enhancing AI Assisted Writing with One-Shot Implicit Negative Feedback
Benjamin Towle
Ke Zhou
21
0
0
14 Oct 2024
Evaluating Differentially Private Synthetic Data Generation in High-Stakes Domains
Krithika Ramesh
Nupoor Gandhi
Pulkit Madaan
Lisa Bauer
Charith Peris
Anjalie Field
SyDa
30
1
0
10 Oct 2024
Generating Synthetic Datasets for Few-shot Prompt Tuning
Xu Guo
Zilin Du
Boyang Li
Chunyan Miao
31
1
0
08 Oct 2024
Non-Halting Queries: Exploiting Fixed Points in LLMs
Ghaith Hammouri
Kemal Derya
B. Sunar
28
0
0
08 Oct 2024
Control Large Language Models via Divide and Conquer
Bingxuan Li
Yiwei Wang
Tao Meng
Kai-Wei Chang
Nanyun Peng
24
0
0
06 Oct 2024
DiDOTS: Knowledge Distillation from Large-Language-Models for Dementia Obfuscation in Transcribed Speech
Dominika Woszczyk
Soteris Demetriou
25
0
0
05 Oct 2024
Human-aligned Chess with a Bit of Search
Yiming Zhang
Athul Paul Jacob
Vivian Lai
Daniel Fried
Daphne Ippolito
21
1
0
04 Oct 2024
Large Language Models can be Strong Self-Detoxifiers
Ching-Yun Ko
Pin-Yu Chen
Payel Das
Youssef Mroueh
Soham Dan
Georgios Kollias
Subhajit Chaudhury
Tejaswini Pedapati
Luca Daniel
26
2
0
04 Oct 2024
Conditional Enzyme Generation Using Protein Language Models with Adapters
Jason Yang
Aadyot Bhatnagar
Jeffrey A. Ruffolo
Ali Madani
26
4
0
04 Oct 2024
1
2
3
4
...
14
15
16
Next