Reinforcement Learning and Bandits for Speech and Language Processing: Tutorial, Review and Outlook

24 October 2022

Papers citing "Reinforcement Learning and Bandits for Speech and Language Processing: Tutorial, Review and Outlook"

18 / 18 papers shown

Title
Efficient, Low-Regret, Online Reinforcement Learning for Linear MDPs Philips George John Arnab Bhattacharyya Silviu Maniu Dimitrios Myrisiotis Zhenan Wu OffRL 21 0 0 16 Nov 2024
Towards Evaluating Large Language Models for Graph Query Generation Siraj Munir Alessandro Aldini ELM 23 0 0 13 Nov 2024
Enhancing Trust in Autonomous Agents: An Architecture for Accountability and Explainability through Blockchain and Large Language Models Laura Fernández-Becerra Miguel Ángel González Santamarta Ángel Manuel Guerrero Higueras Francisco J. Rodríguez-Lera Vicente Matellán Olivera 26 0 0 14 Mar 2024
From Google Gemini to OpenAI Q* (Q-Star): A Survey of Reshaping the Generative Artificial Intelligence (AI) Research Landscape Timothy R. McIntosh Teo Susnjak Tong Liu Paul Watters Malka N. Halgamuge 79 46 0 18 Dec 2023
Reinforcement Learning for Generative AI: A Survey Yuanjiang Cao Quan.Z Sheng Julian McAuley Lina Yao SyDa 36 10 0 28 Aug 2023
Towards Healthy AI: Large Language Models Need Therapists Too Baihan Lin Djallel Bouneffouf Guillermo Cecchi Kush R. Varshney AI4MH 22 16 0 02 Apr 2023
Psychotherapy AI Companion with Reinforcement Learning Recommendations and Interpretable Policy Dynamics Baihan Lin Guillermo Cecchi Djallel Bouneffouf OffRL AI4TS AI4MH 14 9 0 16 Mar 2023
A Reinforcement Learning Framework for Online Speaker Diarization Baihan Lin Xinxin Zhang OffRL 18 2 0 21 Feb 2023
Working Alliance Transformer for Psychotherapy Dialogue Classification Baihan Lin Guillermo Cecchi Djallel Bouneffouf 14 13 0 27 Oct 2022
SupervisorBot: NLP-Annotated Real-Time Recommendations of Psychotherapy Treatment Strategies with Deep Reinforcement Learning Baihan Lin Guillermo Cecchi Djallel Bouneffouf OffRL 6 12 0 27 Aug 2022
Knowledge Management System with NLP-Assisted Annotations: A Brief Survey and Outlook Baihan Lin 20 11 0 15 Jun 2022
Neural Topic Modeling of Psychotherapy Sessions Baihan Lin Djallel Bouneffouf Guillermo Cecchi Ravi Tejwani BDL 22 15 0 13 Apr 2022
Training language models to follow instructions with human feedback Long Ouyang Jeff Wu Xu Jiang Diogo Almeida Carroll L. Wainwright ... Amanda Askell Peter Welinder Paul Christiano Jan Leike Ryan J. Lowe OSLM ALM 301 11,730 0 04 Mar 2022
Advances and Challenges in Conversational Recommender Systems: A Survey Chongming Gao Wenqiang Lei Xiangnan He Maarten de Rijke Tat-Seng Chua 128 270 0 23 Jan 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems Sergey Levine Aviral Kumar George Tucker Justin Fu OffRL GP 321 1,662 0 04 May 2020
RL-Duet: Online Music Accompaniment Generation Using Deep Reinforcement Learning Nan Jiang Sheng Jin Z. Duan Changshui Zhang OffRL 24 49 0 08 Feb 2020
Provably Efficient Online Hyperparameter Optimization with Population-Based Bandits Jack Parker-Holder Vu Nguyen Stephen J. Roberts OffRL 62 82 0 06 Feb 2020
Deep Reinforcement Learning for Dialogue Generation Jiwei Li Will Monroe Alan Ritter Michel Galley Jianfeng Gao Dan Jurafsky 192 1,325 0 05 Jun 2016