DIRECTOR: Generator-Classifiers For Supervised Language Modeling

15 June 2022

Jason Weston

Papers citing "DIRECTOR: Generator-Classifiers For Supervised Language Modeling"

33 / 33 papers shown

Title
RSA-Control: A Pragmatics-Grounded Lightweight Controllable Text Generation Framework Yifan Wang Vera Demberg 24 0 0 24 Oct 2024
Enhancing AI Assisted Writing with One-Shot Implicit Negative Feedback Benjamin Towle Ke Zhou 21 0 0 14 Oct 2024
Unchosen Experts Can Contribute Too: Unleashing MoE Models' Power by Self-Contrast Chufan Shi Cheng Yang Xinyu Zhu Jiahao Wang Taiqiang Wu Siheng Li Deng Cai Yujiu Yang Yu Meng MoE 45 9 0 23 May 2024
RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs Shreyas Chaudhari Pranjal Aggarwal Vishvak Murahari Tanmay Rajpurohit A. Kalyan Karthik Narasimhan A. Deshpande Bruno Castro da Silva 21 33 0 12 Apr 2024
Controlled Text Generation for Large Language Model with Dynamic Attribute Graphs Xun Liang Hanyu Wang Shichao Song Mengting Hu Xunzhi Wang Zhiyu Li Feiyu Xiong Bo Tang 15 9 0 17 Feb 2024
LiFi: Lightweight Controlled Text Generation with Fine-Grained Control Codes Chufan Shi Deng Cai Yujiu Yang 17 3 0 10 Feb 2024
Some things are more CRINGE than others: Iterative Preference Optimization with the Pairwise Cringe Loss Jing Xu Andrew Lee Sainbayar Sukhbaatar Jason Weston 10 86 0 27 Dec 2023
Controlled Text Generation for Black-box Language Models via Score-based Progressive Editor Sangwon Yu Changmin Lee Hojin Lee Sungroh Yoon 22 0 0 13 Nov 2023
Learning to love diligent trolls: Accounting for rater effects in the dialogue safety task M. Ilagan 17 0 0 30 Oct 2023
Controlled Decoding from Language Models Sidharth Mudgal Jong Lee H. Ganapathy Yaguang Li Tao Wang ... Michael Collins Trevor Strohman Jilin Chen Alex Beutel Ahmad Beirami 32 69 0 25 Oct 2023
KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detection Sehyun Choi Tianqing Fang Zhaowei Wang Yangqiu Song 25 32 0 13 Oct 2023
The Past, Present and Better Future of Feedback Learning in Large Language Models for Subjective Human Preferences and Values Hannah Rose Kirk Andrew M. Bean Bertie Vidgen Paul Röttger Scott A. Hale ALM 14 40 0 11 Oct 2023
LaDA: Latent Dialogue Action For Zero-shot Cross-lingual Neural Network Language Modeling Zhanyu Ma Jian Ye Shuang Cheng 18 1 0 05 Aug 2023
System-Level Natural Language Feedback Weizhe Yuan Kyunghyun Cho Jason Weston 20 5 0 23 Jun 2023
Improving Open Language Models by Learning from Organic Interactions Jing Xu Da Ju Joshua Lane M. Komeili Eric Michael Smith ... Rashel Moritz Sainbayar Sukhbaatar Y-Lan Boureau Jason Weston Kurt Shuster 17 8 0 07 Jun 2023
Click: Controllable Text Generation with Sequence Likelihood Contrastive Learning Chujie Zheng Pei Ke Zheng Zhang Minlie Huang BDL 18 30 0 06 Jun 2023
Revisiting the Architectures like Pointer Networks to Efficiently Improve the Next Word Distribution, Summarization Factuality, and Beyond Haw-Shiuan Chang Zonghai Yao Alolika Gon Hong-ye Yu Andrew McCallum 28 10 0 20 May 2023
Semantic Space Grounded Weighted Decoding for Multi-Attribute Controllable Dialogue Generation Zhiling Zhang Mengyue Wu Ke Zhu AI4CE 15 1 0 04 May 2023
Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation Patrick Fernandes Aman Madaan Emmy Liu António Farinhas Pedro Henrique Martins ... José G. C. de Souza Shuyan Zhou Tongshuang Wu Graham Neubig André F. T. Martins ALM 113 56 0 01 May 2023
Learn What NOT to Learn: Towards Generative Safety in Chatbots Leila Khalatbari Yejin Bang Dan Su Willy Chung Saeedeh Ghadimi Hossein Sameti Pascale Fung 20 7 0 21 Apr 2023
Using In-Context Learning to Improve Dialogue Safety Nicholas Meade Spandana Gella Devamanyu Hazarika Prakhar Gupta Di Jin Siva Reddy Yang Liu Dilek Z. Hakkani-Tür 25 37 0 02 Feb 2023
Critic-Guided Decoding for Controlled Text Generation Minbeom Kim Hwanhee Lee Kang Min Yoo Joonsuk Park Hwaran Lee Kyomin Jung 26 35 0 21 Dec 2022
DialGuide: Aligning Dialogue Model Behavior with Developer Guidelines Prakhar Gupta Yang Liu Di Jin Behnam Hedayatnia Spandana Gella Sijia Liu P. Lange Julia Hirschberg Dilek Z. Hakkani-Tür 18 5 0 20 Dec 2022
KRLS: Improving End-to-End Response Generation in Task Oriented Dialog with Reinforced Keywords Learning Xiao Yu Qingyang Wu Kun Qian Zhou Yu OffRL 18 10 0 30 Nov 2022
The CRINGE Loss: Learning what language not to model Leonard Adolphs Tianyu Gao Jing Xu Kurt Shuster Sainbayar Sukhbaatar Jason Weston MU 15 34 0 10 Nov 2022
When Life Gives You Lemons, Make Cherryade: Converting Feedback from Bad Responses into Good Labels Weiyan Shi Emily Dinan Kurt Shuster Jason Weston Jing Xu 44 19 0 28 Oct 2022
DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Generation Hanqing Zhang Dawei Song 19 36 0 18 Oct 2022
Learning New Skills after Deployment: Improving open-domain internet-driven dialogue with human feedback Jing Xu Megan Ung M. Komeili Kushal Arora Y-Lan Boureau Jason Weston 9 37 0 05 Aug 2022
BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage Kurt Shuster Jing Xu M. Komeili Da Ju Eric Michael Smith ... Naman Goyal Arthur Szlam Y-Lan Boureau Melanie Kambadur Jason Weston LM&Ro KELM 20 229 0 05 Aug 2022
R2D2: Robust Data-to-Text with Replacement Detection Linyong Nan Lorenzo Jaime Yu Flores Yilun Zhao Yixin Liu Luke Benson Weijin Zou Dragomir R. Radev 31 17 0 25 May 2022
Reward Reports for Reinforcement Learning T. Gilbert Nathan Lambert Sarah Dean Tom Zick Aaron J. Snoswell 19 33 0 22 Apr 2022
Challenges in Detoxifying Language Models Johannes Welbl Amelia Glaese J. Uesato Sumanth Dathathri John F. J. Mellor Lisa Anne Hendricks Kirsty Anderson Pushmeet Kohli Ben Coppin Po-Sen Huang LM&MA 242 191 0 15 Sep 2021
Uni-Encoder: A Fast and Accurate Response Selection Paradigm for Generation-Based Dialogue Systems Chiyu Song Hongliang He Haofei Yu Pengfei Fang Leyang Cui Zhenzhong Lan 8 6 0 02 Jun 2021