ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.06039
  4. Cited By
MultiModalQA: Complex Question Answering over Text, Tables and Images

MultiModalQA: Complex Question Answering over Text, Tables and Images

International Conference on Learning Representations (ICLR), 2021
13 April 2021
Alon Talmor
Ori Yoran
Amnon Catav
Dan Lahav
Yizhong Wang
Akari Asai
Gabriel Ilharco
Hannaneh Hajishirzi
Jonathan Berant
    LMTD
ArXiv (abs)PDFHTML

Papers citing "MultiModalQA: Complex Question Answering over Text, Tables and Images"

39 / 89 papers shown
VTQA: Visual Text Question Answering via Entity Alignment and
  Cross-Media Reasoning
VTQA: Visual Text Question Answering via Entity Alignment and Cross-Media ReasoningComputer Vision and Pattern Recognition (CVPR), 2023
Kan Chen
Xiangqian Wu
CoGe
170
19
0
05 Mar 2023
Complex QA and language models hybrid architectures, Survey
Complex QA and language models hybrid architectures, Survey
Xavier Daull
P. Bellot
Emmanuel Bruno
Vincent Martin
Elisabeth Murisasco
ELM
731
17
0
17 Feb 2023
SlideVQA: A Dataset for Document Visual Question Answering on Multiple
  Images
SlideVQA: A Dataset for Document Visual Question Answering on Multiple ImagesAAAI Conference on Artificial Intelligence (AAAI), 2023
Ryota Tanaka
Kyosuke Nishida
Kosuke Nishida
Taku Hasegawa
Itsumi Saito
Kuniko Saito
245
150
0
12 Jan 2023
A Survey on Table-and-Text HybridQA: Concepts, Methods, Challenges and
  Future Directions
A Survey on Table-and-Text HybridQA: Concepts, Methods, Challenges and Future Directions
Dingzirui Wang
Longxu Dou
Wanxiang Che
324
7
0
27 Dec 2022
Enhancing Multi-modal and Multi-hop Question Answering via Structured
  Knowledge and Unified Retrieval-Generation
Enhancing Multi-modal and Multi-hop Question Answering via Structured Knowledge and Unified Retrieval-GenerationACM Multimedia (ACM MM), 2022
Qian Yang
Qian Chen
Wen Wang
Baotian Hu
Min Zhang
286
35
0
16 Dec 2022
Training Vision-Language Models with Less Bimodal Supervision
Training Vision-Language Models with Less Bimodal SupervisionConference on Automated Knowledge Base Construction (AKBC), 2022
Elad Segal
Ben Bogin
Jonathan Berant
VLM
129
2
0
01 Nov 2022
PACIFIC: Towards Proactive Conversational Question Answering over
  Tabular and Textual Data in Finance
PACIFIC: Towards Proactive Conversational Question Answering over Tabular and Textual Data in FinanceConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yang Deng
Wenqiang Lei
Wenxuan Zhang
W. Lam
Tat-Seng Chua
359
66
0
17 Oct 2022
Large Language Models are few(1)-shot Table Reasoners
Large Language Models are few(1)-shot Table ReasonersFindings (Findings), 2022
Wenhu Chen
LMTDReLMLRM
277
198
0
13 Oct 2022
OpenCQA: Open-ended Question Answering with Charts
OpenCQA: Open-ended Question Answering with ChartsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Shankar Kantharaj
Do Xuan Long
Rixie Tiffany Ko Leong
J. Tan
Enamul Hoque
Shafiq Joty
173
70
0
12 Oct 2022
Text-Derived Knowledge Helps Vision: A Simple Cross-modal Distillation
  for Video-based Action Anticipation
Text-Derived Knowledge Helps Vision: A Simple Cross-modal Distillation for Video-based Action AnticipationFindings (Findings), 2022
Sayontan Ghosh
Tanvi Aggarwal
Minh Hoai
Niranjan Balasubramanian
VLM
231
4
0
12 Oct 2022
MuRAG: Multimodal Retrieval-Augmented Generator for Open Question
  Answering over Images and Text
MuRAG: Multimodal Retrieval-Augmented Generator for Open Question Answering over Images and TextConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Wenhu Chen
Hexiang Hu
Xi Chen
Pat Verga
William W. Cohen
RALM
363
242
0
06 Oct 2022
Binding Language Models in Symbolic Languages
Binding Language Models in Symbolic LanguagesInternational Conference on Learning Representations (ICLR), 2022
Zhoujun Cheng
Tianbao Xie
Peng Shi
Chengzu Li
Rahul Nadkarni
...
Dragomir R. Radev
Mari Ostendorf
Luke Zettlemoyer
Noah A. Smith
Tao Yu
LMTD
441
274
0
06 Oct 2022
Dynamic Prompt Learning via Policy Gradient for Semi-structured
  Mathematical Reasoning
Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical ReasoningInternational Conference on Learning Representations (ICLR), 2022
Pan Lu
Liang Qiu
Kai-Wei Chang
Ying Nian Wu
Song-Chun Zhu
Tanmay Rajpurohit
Peter Clark
Ashwin Kalyan
ReLMLRM
544
392
0
29 Sep 2022
A Survey on Text-to-SQL Parsing: Concepts, Methods, and Future
  Directions
A Survey on Text-to-SQL Parsing: Concepts, Methods, and Future Directions
Bowen Qin
Binyuan Hui
Lihan Wang
Min Yang
Jinyang Li
...
Rongyu Cao
Jian Sun
Luo Si
Fei Huang
Yongbin Li
LMTD
253
79
0
29 Aug 2022
OPERA: Harmonizing Task-Oriented Dialogs and Information Seeking
  Experience
OPERA: Harmonizing Task-Oriented Dialogs and Information Seeking ExperienceACM Transactions on the Web (TWEB), 2022
Miaoran Li
Baolin Peng
Jianfeng Gao
Zhu Zhang
268
8
0
24 Jun 2022
Multimodal Learning with Transformers: A Survey
Multimodal Learning with Transformers: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Peng Xu
Xiatian Zhu
David Clifton
ViT
572
860
0
13 Jun 2022
MultiHiertt: Numerical Reasoning over Multi Hierarchical Tabular and
  Textual Data
MultiHiertt: Numerical Reasoning over Multi Hierarchical Tabular and Textual DataAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Yilun Zhao
Yunxiang Li
Chenying Li
Rui Zhang
AIMat
275
144
0
03 Jun 2022
QAMPARI: An Open-domain Question Answering Benchmark for Questions with
  Many Answers from Multiple Paragraphs
QAMPARI: An Open-domain Question Answering Benchmark for Questions with Many Answers from Multiple ParagraphsIEEE Games Entertainment Media Conference (GEM), 2022
S. Amouyal
Tomer Wolfson
Ohad Rubin
Ori Yoran
Jonathan Herzig
Jonathan Berant
RALMVLM
499
40
0
25 May 2022
DrugEHRQA: A Question Answering Dataset on Structured and Unstructured
  Electronic Health Records For Medicine Related Queries
DrugEHRQA: A Question Answering Dataset on Structured and Unstructured Electronic Health Records For Medicine Related QueriesInternational Conference on Language Resources and Evaluation (LREC), 2022
Jayetri Bardhan
Anthony Colas
Kirk Roberts
D. Wang
CML
120
18
0
03 May 2022
Conversational Question Answering on Heterogeneous Sources
Conversational Question Answering on Heterogeneous SourcesAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2022
Philipp Christmann
Rishiraj Saha Roy
Gerhard Weikum
254
50
0
25 Apr 2022
Learning to Answer Questions in Dynamic Audio-Visual Scenarios
Learning to Answer Questions in Dynamic Audio-Visual ScenariosComputer Vision and Pattern Recognition (CVPR), 2022
Guangyao Li
Yake Wei
Yapeng Tian
Chenliang Xu
Ji-Rong Wen
Di Hu
297
215
0
26 Mar 2022
Table Structure Recognition with Conditional Attention
Table Structure Recognition with Conditional Attention
Bin Xiao
Murat Simsek
B. Kantarci
Ala Abu Alkheir
LMTD
188
12
0
08 Mar 2022
UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding
  with Text-to-Text Language Models
UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Tianbao Xie
Chen Henry Wu
Peng Shi
Ruiqi Zhong
Torsten Scholak
...
Lingpeng Kong
Rui Zhang
Noah A. Smith
Luke Zettlemoyer
Tao Yu
LMTD
348
338
0
16 Jan 2022
CommonsenseQA 2.0: Exposing the Limits of AI through Gamification
CommonsenseQA 2.0: Exposing the Limits of AI through Gamification
Alon Talmor
Ori Yoran
Ronan Le Bras
Chandrasekhar Bhagavatula
Yoav Goldberg
Yejin Choi
Jonathan Berant
ELM
313
167
0
14 Jan 2022
MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media
  Knowledge Extraction and Grounding
MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and GroundingAAAI Conference on Artificial Intelligence (AAAI), 2021
Revanth Reddy Gangi Reddy
Xilin Rui
Pengfei Yu
Xudong Lin
Haoyang Wen
...
Joey Tianyi Zhou
Avirup Sil
Shih-Fu Chang
Alex Schwing
Heng Ji
266
34
0
20 Dec 2021
Multimodal End-to-End Group Emotion Recognition using Cross-Modal
  Attention
Multimodal End-to-End Group Emotion Recognition using Cross-Modal Attention
Lev Evtodienko
120
6
0
10 Nov 2021
Logic-level Evidence Retrieval and Graph-based Verification Network for
  Table-based Fact Verification
Logic-level Evidence Retrieval and Graph-based Verification Network for Table-based Fact Verification
Qi Shi
Yu Zhang
Qingyu Yin
Ting Liu
289
21
0
14 Sep 2021
SituatedQA: Incorporating Extra-Linguistic Contexts into QA
SituatedQA: Incorporating Extra-Linguistic Contexts into QA
Michael J.Q. Zhang
Eunsol Choi
RALM
267
173
0
13 Sep 2021
MATE: Multi-view Attention for Table Transformer Efficiency
MATE: Multi-view Attention for Table Transformer EfficiencyConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Julian Martin Eisenschlos
Maharshi Gor
Thomas Müller
William W. Cohen
LMTD
202
101
0
09 Sep 2021
WebQA: Multihop and Multimodal QA
WebQA: Multihop and Multimodal QAComputer Vision and Pattern Recognition (CVPR), 2021
Yingshan Chang
M. Narang
Hisami Suzuki
Guihong Cao
Jianfeng Gao
Yonatan Bisk
LRM
382
113
0
01 Sep 2021
Multi-modal Retrieval of Tables and Texts Using Tri-encoder Models
Multi-modal Retrieval of Tables and Texts Using Tri-encoder ModelsWorkshop on Machine Reading for Question Answering (MRQA), 2021
Bogdan Kostić
Julian Risch
Timo Moller
RALM
451
25
0
09 Aug 2021
MuSiQue: Multihop Questions via Single-hop Question Composition
MuSiQue: Multihop Questions via Single-hop Question Composition
H. Trivedi
Niranjan Balasubramanian
Tushar Khot
Ashish Sabharwal
LRM
518
562
0
02 Aug 2021
QA Dataset Explosion: A Taxonomy of NLP Resources for Question Answering
  and Reading Comprehension
QA Dataset Explosion: A Taxonomy of NLP Resources for Question Answering and Reading ComprehensionACM Computing Surveys (CSUR), 2021
Anna Rogers
Matt Gardner
Isabelle Augenstein
377
191
0
27 Jul 2021
MultiBench: Multiscale Benchmarks for Multimodal Representation Learning
MultiBench: Multiscale Benchmarks for Multimodal Representation Learning
Paul Pu Liang
Yiwei Lyu
Xiang Fan
Zetian Wu
Yun Cheng
...
Peter Wu
Michelle A. Lee
Yuke Zhu
Ruslan Salakhutdinov
Louis-Philippe Morency
VLM
294
224
0
15 Jul 2021
Turning Tables: Generating Examples from Semi-structured Tables for
  Endowing Language Models with Reasoning Skills
Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning SkillsAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Ori Yoran
Alon Talmor
Jonathan Berant
ReLMLRM
377
55
0
15 Jul 2021
Question Decomposition with Dependency Graphs
Question Decomposition with Dependency GraphsConference on Automated Knowledge Base Construction (AKBC), 2021
Matan Hasson
Jonathan Berant
GNN
187
10
0
17 Apr 2021
FeTaQA: Free-form Table Question Answering
FeTaQA: Free-form Table Question AnsweringTransactions of the Association for Computational Linguistics (TACL), 2021
Linyong Nan
Chia-Hsuan Hsieh
Ziming Mao
Xi Lin
Neha Verma
...
Isabel Trindade
Renusree Bandaru
Jacob Cunningham
Caiming Xiong
Dragomir R. Radev
LMTD
349
222
0
01 Apr 2021
Challenges in Information-Seeking QA: Unanswerable Questions and
  Paragraph Retrieval
Challenges in Information-Seeking QA: Unanswerable Questions and Paragraph Retrieval
Akari Asai
Eunsol Choi
RALM
307
60
0
22 Oct 2020
Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question
  Answering
Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question AnsweringInternational Conference on Learning Representations (ICLR), 2019
Akari Asai
Kazuma Hashimoto
Hannaneh Hajishirzi
R. Socher
Caiming Xiong
RALMKELMLRM
531
315
0
24 Nov 2019
Previous
12
Page 2 of 2