ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.13623
  4. Cited By
Reinforcement Learning and Bandits for Speech and Language Processing:
  Tutorial, Review and Outlook

Reinforcement Learning and Bandits for Speech and Language Processing: Tutorial, Review and Outlook

24 October 2022
Baihan Lin
    OffRL
    AI4TS
ArXivPDFHTML

Papers citing "Reinforcement Learning and Bandits for Speech and Language Processing: Tutorial, Review and Outlook"

18 / 18 papers shown
Title
Efficient, Low-Regret, Online Reinforcement Learning for Linear MDPs
Philips George John
Arnab Bhattacharyya
Silviu Maniu
Dimitrios Myrisiotis
Zhenan Wu
OffRL
21
0
0
16 Nov 2024
Towards Evaluating Large Language Models for Graph Query Generation
Towards Evaluating Large Language Models for Graph Query Generation
Siraj Munir
Alessandro Aldini
ELM
23
0
0
13 Nov 2024
Enhancing Trust in Autonomous Agents: An Architecture for Accountability
  and Explainability through Blockchain and Large Language Models
Enhancing Trust in Autonomous Agents: An Architecture for Accountability and Explainability through Blockchain and Large Language Models
Laura Fernández-Becerra
Miguel Ángel González Santamarta
Ángel Manuel Guerrero Higueras
Francisco J. Rodríguez-Lera
Vicente Matellán Olivera
26
0
0
14 Mar 2024
From Google Gemini to OpenAI Q* (Q-Star): A Survey of Reshaping the
  Generative Artificial Intelligence (AI) Research Landscape
From Google Gemini to OpenAI Q* (Q-Star): A Survey of Reshaping the Generative Artificial Intelligence (AI) Research Landscape
Timothy R. McIntosh
Teo Susnjak
Tong Liu
Paul Watters
Malka N. Halgamuge
79
46
0
18 Dec 2023
Reinforcement Learning for Generative AI: A Survey
Reinforcement Learning for Generative AI: A Survey
Yuanjiang Cao
Quan.Z Sheng
Julian McAuley
Lina Yao
SyDa
36
10
0
28 Aug 2023
Towards Healthy AI: Large Language Models Need Therapists Too
Towards Healthy AI: Large Language Models Need Therapists Too
Baihan Lin
Djallel Bouneffouf
Guillermo Cecchi
Kush R. Varshney
AI4MH
22
16
0
02 Apr 2023
Psychotherapy AI Companion with Reinforcement Learning Recommendations
  and Interpretable Policy Dynamics
Psychotherapy AI Companion with Reinforcement Learning Recommendations and Interpretable Policy Dynamics
Baihan Lin
Guillermo Cecchi
Djallel Bouneffouf
OffRL
AI4TS
AI4MH
14
9
0
16 Mar 2023
A Reinforcement Learning Framework for Online Speaker Diarization
A Reinforcement Learning Framework for Online Speaker Diarization
Baihan Lin
Xinxin Zhang
OffRL
18
2
0
21 Feb 2023
Working Alliance Transformer for Psychotherapy Dialogue Classification
Working Alliance Transformer for Psychotherapy Dialogue Classification
Baihan Lin
Guillermo Cecchi
Djallel Bouneffouf
14
13
0
27 Oct 2022
SupervisorBot: NLP-Annotated Real-Time Recommendations of Psychotherapy
  Treatment Strategies with Deep Reinforcement Learning
SupervisorBot: NLP-Annotated Real-Time Recommendations of Psychotherapy Treatment Strategies with Deep Reinforcement Learning
Baihan Lin
Guillermo Cecchi
Djallel Bouneffouf
OffRL
6
12
0
27 Aug 2022
Knowledge Management System with NLP-Assisted Annotations: A Brief
  Survey and Outlook
Knowledge Management System with NLP-Assisted Annotations: A Brief Survey and Outlook
Baihan Lin
20
11
0
15 Jun 2022
Neural Topic Modeling of Psychotherapy Sessions
Neural Topic Modeling of Psychotherapy Sessions
Baihan Lin
Djallel Bouneffouf
Guillermo Cecchi
Ravi Tejwani
BDL
22
15
0
13 Apr 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
Advances and Challenges in Conversational Recommender Systems: A Survey
Advances and Challenges in Conversational Recommender Systems: A Survey
Chongming Gao
Wenqiang Lei
Xiangnan He
Maarten de Rijke
Tat-Seng Chua
128
270
0
23 Jan 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
321
1,662
0
04 May 2020
RL-Duet: Online Music Accompaniment Generation Using Deep Reinforcement
  Learning
RL-Duet: Online Music Accompaniment Generation Using Deep Reinforcement Learning
Nan Jiang
Sheng Jin
Z. Duan
Changshui Zhang
OffRL
24
49
0
08 Feb 2020
Provably Efficient Online Hyperparameter Optimization with
  Population-Based Bandits
Provably Efficient Online Hyperparameter Optimization with Population-Based Bandits
Jack Parker-Holder
Vu Nguyen
Stephen J. Roberts
OffRL
62
82
0
06 Feb 2020
Deep Reinforcement Learning for Dialogue Generation
Deep Reinforcement Learning for Dialogue Generation
Jiwei Li
Will Monroe
Alan Ritter
Michel Galley
Jianfeng Gao
Dan Jurafsky
192
1,325
0
05 Jun 2016
1