ManyModalQA: Modality Disambiguation and QA over Diverse Inputs

AAAI Conference on Artificial Intelligence (AAAI), 2020

22 January 2020

ArXiv (abs)PDF HTML Github (17★)

Papers citing "ManyModalQA: Modality Disambiguation and QA over Diverse Inputs"

37 / 37 papers shown

Memory-QA: Answering Recall Questions Based on Multimodal Memories

...

198

22 Sep 2025

Rethinking Information Synthesis in Multimodal Question Answering A Multi-Agent Perspective

234

27 May 2025

VLMT: Vision-Language Multimodal Transformer for Multimodal Multi-hop Question Answering

Qi Zhi Lim

C. Lee

K. Lim

Kalaiarasi Sonai Muthu Anbananthen

299

11 Apr 2025

FCMR: Robust Evaluation of Financial Cross-Modal Multi-Hop ReasoningAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

525

17 Dec 2024

CT2C-QA: Multimodal Question Answering over Chinese Text, Table and ChartACM Multimedia (MM), 2024

251

28 Oct 2024

RA-BLIP: Multimodal Adaptive Retrieval-Augmented Bootstrapping Language-Image Pre-trainingIEEE transactions on multimedia (IEEE TMM), 2024

Muhe Ding

Liqiang Nie

289

18 Oct 2024

MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal ModelsInternational Conference on Learning Representations (ICLR), 2024

Pan Lu

Kai-Wei Chang

Nanyun Peng

VLM

395

10 Oct 2024

Towards Robust Evaluation: A Comprehensive Taxonomy of Datasets and Metrics for Open Domain Question Answering in the Era of Large Language ModelsIEEE Access (IEEE Access), 2024

Akchay Srivastava

Atif Memon

ELM

251

19 Jun 2024

cPAPERS: A Dataset of Situated and Multimodal Interactive Conversations in Scientific Papers

Anirudh S. Sundar

Jin Xu

William Gay

Christopher Richardson

Larry Heck

321

12 Jun 2024

MileBench: Benchmarking MLLMs in Long Context

Xiang Wan

414

29 Apr 2024

iTBLS: A Dataset of Interactive Conversations Over Tabular Information

Anirudh S. Sundar

Christopher Richardson

William Gay

Larry Heck

LMTD

406

19 Apr 2024

SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLMConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Andrea Madotto

Babak Damavandi

269

07 Mar 2024

Exploring Hybrid Question Answering via Program-based Prompting

224

16 Feb 2024

DIVKNOWQA: Assessing the Reasoning Ability of LLMs via Open-Domain Question Answering over Knowledge Base and Text

Philip S. Yu

Yingbo Zhou

278

31 Oct 2023

EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray ImagesNeural Information Processing Systems (NeurIPS), 2023

...

Jungwoo Oh

Lei Ji

E. Chang

Tackeun Kim

Edward Choi

356

28 Oct 2023

MoqaGPT : Zero-Shot Multi-modal Open-domain Question Answering with Large Language Model

259

20 Oct 2023

Progressive Evidence Refinement for Open-domain Multimodal Retrieval Question Answering

Xingjiao Wu

246

15 Oct 2023

MMHQA-ICL: Multimodal In-context Learning for Hybrid Question Answering over Text, Tables and Images

Jun Zhao

216

09 Sep 2023

Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative InstructionsInternational Conference on Learning Representations (ICLR), 2023

Wei Ji

401

08 Aug 2023

Unified Language Representation for Question Answering over Text, Tables, and ImagesAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Fei Huang

281

29 Jun 2023

Pre-Training Multi-Modal Dense Retrievers for Outside-Knowledge Visual Question AnsweringInternational Conference on the Theory of Information Retrieval (ICTIR), 2023

Alireza Salemi

Mahta Rafiee

Hamed Zamani

232

28 Jun 2023

A Symmetric Dual Encoding Dense Retrieval Framework for Knowledge-Intensive Visual Question AnsweringAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2023

Alireza Salemi

Juan Altmayer Pizzorno

Hamed Zamani

166

26 Apr 2023

MPMQA: Multimodal Question Answering on Product ManualsAAAI Conference on Artificial Intelligence (AAAI), 2023

Liangfu Zhang

Anwen Hu

Jing Zhang

Shuo Hu

Qin Jin

230

19 Apr 2023

cTBLS: Augmenting Large Language Models with Conversational Tables

Anirudh S. Sundar

Larry Heck

LMTD

409

21 Mar 2023

MuRAG: Multimodal Retrieval-Augmented Generator for Open Question Answering over Images and TextConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

425

259

06 Oct 2022

OPERA: Harmonizing Task-Oriented Dialogs and Information Seeking ExperienceACM Transactions on the Web (TWEB), 2022

298

24 Jun 2022

Multimodal Conversational AI: A Survey of Datasets and Approaches

Anirudh S. Sundar

Larry Heck

176

13 May 2022

DrugEHRQA: A Question Answering Dataset on Structured and Unstructured Electronic Health Records For Medicine Related QueriesInternational Conference on Language Resources and Evaluation (LREC), 2022

147

03 May 2022

Conversational Question Answering on Heterogeneous SourcesAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2022

Philipp Christmann

Rishiraj Saha Roy

Gerhard Weikum

365

25 Apr 2022

MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and GroundingAAAI Conference on Artificial Intelligence (AAAI), 2021

Revanth Reddy Gangi Reddy

...

Heng Ji

302

20 Dec 2021

Echo-Reconstruction: Audio-Augmented 3D Scene Reconstruction

176

05 Oct 2021

WebQA: Multihop and Multimodal QAComputer Vision and Pattern Recognition (CVPR), 2021

427

130

01 Sep 2021

MultiBench: Multiscale Benchmarks for Multimodal Representation Learning

...

Peter Wu

Michelle A. Lee

Yuke Zhu

Ruslan Salakhutdinov

Louis-Philippe Morency

VLM

337

238

15 Jul 2021

Question Decomposition with Dependency GraphsConference on Automated Knowledge Base Construction (AKBC), 2021

Matan Hasson

Jonathan Berant

GNN

204

17 Apr 2021

Effect of Visual Extensions on Natural Language Understanding in Vision-and-Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Taichi Iki

Akiko Aizawa

VLM

271

16 Apr 2021

MultiModalQA: Complex Question Answering over Text, Tables and ImagesInternational Conference on Learning Representations (ICLR), 2021

336

220

13 Apr 2021

Challenges in Information-Seeking QA: Unanswerable Questions and Paragraph Retrieval

Akari Asai

Eunsol Choi

RALM

373

22 Oct 2020