Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.01824
Cited By
Human in the loop approaches in multi-modal conversational task guidance system development
3 November 2022
R. Manuvinakurike
Sovan Biswas
G. Raffa
R. Beckwith
A. Rhodes
Meng Shi
Gesem Gudino Mejia
Saurav Sahay
L. Nachman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Human in the loop approaches in multi-modal conversational task guidance system development"
9 / 9 papers shown
Title
ACE, Action and Control via Explanations: A Proposal for LLMs to Provide Human-Centered Explainability for Multimodal AI Assistants
E. A. Watkins
Emanuel Moss
R. Manuvinakurike
Meng Shi
R. Beckwith
G. Raffa
LLMAG
36
2
0
27 Feb 2025
HoloAssist: an Egocentric Human Interaction Dataset for Interactive AI Assistants in the Real World
Linghao Yang
Taein Kwon
Mahdi Rad
Bowen Pan
Ishani Chakraborty
...
Ashley Feniello
Rui Tian
Felipe Vieira Frujeri
Neel Joshi
Marc Pollefeys
EgoV
18
44
0
29 Sep 2023
A Dataset for Medical Instructional Video Classification and Question Answering
D. Gupta
Kush Attal
Dina Demner-Fushman
24
30
0
30 Jan 2022
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Kristen Grauman
Andrew Westbury
Eugene Byrne
Zachary Chavis
Antonino Furnari
...
Mike Zheng Shou
Antonio Torralba
Lorenzo Torresani
Mingfei Yan
Jitendra Malik
EgoV
218
682
0
13 Oct 2021
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding
Hu Xu
Gargi Ghosh
Po-Yao (Bernie) Huang
Dmytro Okhonko
Armen Aghajanyan
Florian Metze
Luke Zettlemoyer
Florian Metze Luke Zettlemoyer Christoph Feichtenhofer
CLIP
VLM
245
554
0
28 Sep 2021
MultiDoc2Dial: Modeling Dialogues Grounded in Multiple Documents
Song Feng
S. Patel
H. Wan
Sachindra Joshi
49
66
0
26 Sep 2021
ActionCLIP: A New Paradigm for Video Action Recognition
Mengmeng Wang
Jiazheng Xing
Yong Liu
VLM
144
261
0
17 Sep 2021
Recent Advances in Leveraging Human Guidance for Sequential Decision-Making Tasks
Ruohan Zhang
F. Torabi
Garrett A. Warnell
Peter Stone
62
28
0
13 Jul 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
293
2,875
0
11 Feb 2021
1