Papers
Communities
Organizations
Events
Blog
Pricing
Feedback
Contact Sales
Search
Open menu
Home
Papers
2005.02472
Cited By
Cross-media Structured Common Space for Multimedia Event Extraction
5 May 2020
Pengfei Yu
Alireza Zareian
Qi Zeng
Spencer Whitehead
Di Lu
Heng Ji
Shih-Fu Chang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Cross-media Structured Common Space for Multimedia Event Extraction"
50 / 52 papers shown
Title
Evaluation of Finetuned LLMs in AMR Parsing
Shu Han Ho
0
0
0
07 Aug 2025
Multimodal Event Detection: Current Approaches and Defining the New Playground through LLMs and VLMs
Abhishek Dey
Aabha Bothera
Samhita Sarikonda
Rishav Aryan
Sanjay Kumar Podishetty
Akshay Havalgi
Gaurav Singh
Saurabh Srivastava
131
0
0
16 May 2025
Collaborative Multi-LoRA Experts with Achievement-based Multi-Tasks Loss for Unified Multimodal Information Extraction
Li Yuan
Yi Cai
Xudong Shen
Qing Li
Qingbao Huang
Zikun Deng
Tao Wang
MoMe
OffRL
MoE
108
0
0
08 May 2025
Retrieval-Enhanced Few-Shot Prompting for Speech Event Extraction
Máté Gedeon
RALM
127
0
0
30 Apr 2025
SEOE: A Scalable and Reliable Semantic Evaluation Framework for Open Domain Event Detection
Yi-Fan Lu
Xian-Ling Mao
Tian Lan
Tong Zhang
Yu-Shi Zhu
Heyan Huang
142
0
0
05 Mar 2025
SNaRe: Domain-aware Data Generation for Low-Resource Event Detection
Tanmay Parekh
Yuxuan Dong
Lucas Bandarkar
Artin Kim
I-Hung Hsu
Kai-Wei Chang
Nanyun Peng
113
0
0
24 Feb 2025
SPEED++: A Multilingual Event Extraction Framework for Epidemic Prediction and Preparedness
Tanmay Parekh
Jeffrey Kwan
Jiarui Yu
Sparsh Johri
Hyosang Ahn
Sreya Muppalla
Kai-Wei Chang
Wei Wang
Nanyun Peng
138
4
0
24 Oct 2024
Beyond Exact Match: Semantically Reassessing Event Extraction by Large Language Models
Yi-Fan Lu
Xian-Ling Mao
Tian Lan
Heyan Huang
Heyan Huang
Xiaoyan Gao
110
0
0
12 Oct 2024
Enhancing Event Reasoning in Large Language Models through Instruction Fine-Tuning with Semantic Causal Graphs
Mazal Bethany
Emet Bethany
Brandon Wherry
Cho-Yu Chiang
Nishant Vishwamitra
Anthony Rios
Peyman Najafirad
LRM
129
1
0
30 Aug 2024
ARMADA: Attribute-Based Multimodal Data Augmentation
Xiaomeng Jin
Jeonghwan Kim
Yu Zhou
Kuan-Hao Huang
Te-Lin Wu
Nanyun Peng
Heng Ji
98
2
0
19 Aug 2024
A Survey on Integrated Sensing, Communication, and Computation
Dingzhu Wen
Yong Zhou
Xiaoyang Li
Yuanming Shi
Kaibin Huang
Khaled B. Letaief
100
47
0
15 Aug 2024
MMUTF: Multimodal Multimedia Event Argument Extraction with Unified Template Filling
Philipp Seeberger
Dominik Wagner
Korbinian Riedhammer
91
1
0
18 Jun 2024
Multimodal Cross-Document Event Coreference Resolution Using Linear Semantic Transfer and Mixed-Modality Ensembles
Abhijnan Nath
Huma Jamil
Shafiuddin Rehan Ahmed
George Baker
Rahul Ghosh
James H. Martin
Nathaniel Blanchard
Nikhil Krishnaswamy
86
2
0
13 Apr 2024
GenEARL: A Training-Free Generative Framework for Multimodal Event Argument Role Labeling
Hritik Bansal
Po-Nien Kung
P. Brantingham
Weisheng Wang
Miao Zheng
VLM
94
2
0
07 Apr 2024
Event Detection from Social Media for Epidemic Prediction
Tanmay Parekh
Anh Mac
Jiarui Yu
Yuxuan Dong
Syed Shahriar
...
Eric Yang
Kuan-Hao Huang
Wei Wang
Nanyun Peng
Kai-Wei Chang
84
8
0
02 Apr 2024
Improving Event Definition Following For Zero-Shot Event Detection
Zefan Cai
Po-Nien Kung
Ashima Suvarna
Mingyu Derek Ma
Hritik Bansal
Baobao Chang
P. Brantingham
Wei Wang
Nanyun Peng
108
9
0
05 Mar 2024
UMIE: Unified Multimodal Information Extraction with Instruction Tuning
Lin Sun
Kai Zhang
Qingyuan Li
Renze Lou
106
15
0
05 Jan 2024
Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning
Kung-Hsiang Huang
Mingyang Zhou
Hou Pong Chan
Yi R. Fung
Zhenhailong Wang
Lingyu Zhang
Shih-Fu Chang
Chenhui Xu
131
43
0
15 Dec 2023
RESIN-EDITOR: A Schema-guided Hierarchical Event Graph Visualizer and Editor
Khanh Duy Nguyen
Zixuan Zhang
Reece Suchocki
Sha Li
Martha Palmer
S. Brown
Jiawei Han
Heng Ji
86
2
0
05 Dec 2023
Video Summarization: Towards Entity-Aware Captions
Hammad A. Ayyubi
Tianqi Liu
Arsha Nagrani
Xudong Lin
Ruotong Wang
Anurag Arnab
Feng Han
Yukun Zhu
Jialu Liu
Shih-Fu Chang
68
0
0
01 Dec 2023
ViStruct: Visual Structural Knowledge Extraction via Curriculum Guided Code-Vision Representation
Yangyi Chen
Xingyao Wang
Pengfei Yu
Derek Hoiem
Heng Ji
96
12
0
22 Nov 2023
TextEE: Benchmark, Reevaluation, Reflections, and Future Challenges in Event Extraction
Kuan-Hao Huang
I-Hung Hsu
Tanmay Parekh
Zhiyu Xie
Zixuan Zhang
Premkumar Natarajan
Kai-Wei Chang
Nanyun Peng
Heng Ji
139
22
0
16 Nov 2023
STELLA: Continual Audio-Video Pre-training with Spatio-Temporal Localized Alignment
Jaewoo Lee
Jaehong Yoon
Wonjae Kim
Yunji Kim
Sung Ju Hwang
CLL
143
1
0
12 Oct 2023
Multimodal Question Answering for Unified Information Extraction
Yuxuan Sun
Kai Zhang
Yu-Chuan Su
76
8
0
04 Oct 2023
MultiVENT: Multilingual Videos of Events with Aligned Natural Text
Kate Sanders
David Etter
Reno Kriz
Benjamin Van Durme
VGen
131
7
0
06 Jul 2023
Artificial Intelligence for Emergency Response
Ayan Mukhopadhyay
35
1
0
15 Jun 2023
Training Multimedia Event Extraction With Generated Images and Captions
Zilin Du
Yunxin Li
Xu Guo
Yidan Sun
Boyang Albert Li
DiffM
106
9
0
15 Jun 2023
Language Models Can Improve Event Prediction by Few-Shot Abductive Reasoning
Xiaoming Shi
Siqiao Xue
Kangrui Wang
Fan Zhou
James Y. Zhang
Jun-ping Zhou
Chenhao Tan
Hongyuan Mei
ReLM
LRM
128
57
0
26 May 2023
Multimodal Automated Fact-Checking: A Survey
Mubashara Akhtar
Michael Schlichtkrull
Zhijiang Guo
O. Cocarascu
Elena Simperl
Andreas Vlachos
176
44
0
22 May 2023
Few-shot Domain-Adaptive Visually-fused Event Detection from Text
Farhad Moghimifar
Fatemeh Shiri
Van Nguyen
Gholamreza Haffari
Yuanyou Li
VLM
80
3
0
04 May 2023
Understanding Social Media Cross-Modality Discourse in Linguistic Space
Chunpu Xu
Hanzhuo Tan
Jing Li
Piji Li
106
8
0
26 Feb 2023
In Defense of Structural Symbolic Representation for Video Event-Relation Prediction
Andrew Lu
Xudong Lin
Yulei Niu
Shih-Fu Chang
106
2
0
06 Jan 2023
Vision and Structured-Language Pretraining for Cross-Modal Food Retrieval
Mustafa Shukor
Nicolas Thome
Matthieu Cord
CLIP
CoGe
105
10
0
08 Dec 2022
Video Event Extraction via Tracking Visual States of Arguments
Guang Yang
Pengfei Yu
Jiajie Zhang
Xudong Lin
Shih-Fu Chang
Heng Ji
82
12
0
03 Nov 2022
Language Model Pre-Training with Sparse Latent Typing
Liliang Ren
Zixuan Zhang
H. Wang
Clare R. Voss
Chengxiang Zhai
Heng Ji
138
3
0
23 Oct 2022
Beyond Grounding: Extracting Fine-Grained Event Hierarchies Across Modalities
Hammad A. Ayyubi
Christopher Thomas
Lovish Chum
R. Lokesh
Long Chen
...
Xudong Lin
Xuande Feng
Jaywon Koo
Sounak Ray
Shih-Fu Chang
AI4TS
93
0
0
14 Jun 2022
Detecting the Role of an Entity in Harmful Memes: Techniques and Their Limitations
R. N. Nandi
Firoj Alam
Preslav Nakov
58
8
0
09 May 2022
Translation between Molecules and Natural Language
Carl Edwards
T. Lai
Kevin Ros
Garrett Honke
Kyunghyun Cho
Heng Ji
164
186
0
25 Apr 2022
Fine-Grained Visual Entailment
Christopher Thomas
Yipeng Zhang
Shih-Fu Chang
145
6
0
29 Mar 2022
Multi-Modal Knowledge Graph Construction and Application: A Survey
Xiangru Zhu
Zhixu Li
Xiaodan Wang
Xueyao Jiang
Yixiang Chen
Xuwu Wang
Yanghua Xiao
N. Yuan
100
184
0
11 Feb 2022
CLIP-Event: Connecting Text and Images with Event Structures
Pengfei Yu
Ruochen Xu
Shuohang Wang
Luowei Zhou
Xudong Lin
Chenguang Zhu
Michael Zeng
Heng Ji
Shih-Fu Chang
VLM
CLIP
101
136
0
13 Jan 2022
What is Event Knowledge Graph: A Survey
Saiping Guan
Xueqi Cheng
Long Bai
Fu Zhang
Zixuan Li
Yutao Zeng
Xiaolong Jin
Jiafeng Guo
85
62
0
31 Dec 2021
MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding
Revanth Reddy Gangi Reddy
Xilin Rui
Pengfei Yu
Xudong Lin
Haoyang Wen
...
Joey Tianyi Zhou
Avirup Sil
Shih-Fu Chang
Alex Schwing
Heng Ji
102
33
0
20 Dec 2021
Joint Multimedia Event Extraction from Video and Article
Brian Chen
Xudong Lin
Christopher Thomas
Pengfei Yu
Shoya Yoshida
Lovish Chum
Heng Ji
Shih-Fu Chang
VGen
96
26
0
27 Sep 2021
Fine-Grained Chemical Entity Typing with Multimodal Knowledge Representation
Chenkai Sun
Weijian Li
Jinfeng Xiao
Nikolaus Nova Parulian
ChengXiang Zhai
Heng Ji
99
4
0
29 Aug 2021
A Survey on Deep Learning Event Extraction: Approaches and Applications
Qian Li
Jianxin Li
Shuaiyi Nie
Shiyao Cui
Hongzhi Zhang
...
Hao Peng
Shu Guo
Lihong Wang
Amin Beheshti
Philip S. Yu
140
53
0
05 Jul 2021
Video Question Answering with Phrases via Semantic Roles
Arka Sadhu
Kan Chen
Ram Nevatia
66
16
0
08 Apr 2021
Visual Semantic Role Labeling for Video Understanding
Arka Sadhu
Tanmay Gupta
Mark Yatskar
Ram Nevatia
Aniruddha Kembhavi
VLM
115
75
0
02 Apr 2021
EventPlus: A Temporal Event Understanding Pipeline
Mingyu Derek Ma
Jiao Sun
Mu Yang
Kung-Hsiang Huang
Nuan Wen
Shikhar Singh
Rujun Han
Nanyun Peng
78
32
0
13 Jan 2021
Cross-Media Keyphrase Prediction: A Unified Framework with Multi-Modality Multi-Head Attention and Image Wordings
Yue Wang
Jing Li
Michael R. Lyu
Irwin King
98
17
0
03 Nov 2020
1
2
Next