ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
  • Feedback
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.02472
  4. Cited By
Cross-media Structured Common Space for Multimedia Event Extraction

Cross-media Structured Common Space for Multimedia Event Extraction

5 May 2020
Pengfei Yu
Alireza Zareian
Qi Zeng
Spencer Whitehead
Di Lu
Heng Ji
Shih-Fu Chang
ArXiv (abs)PDFHTML

Papers citing "Cross-media Structured Common Space for Multimedia Event Extraction"

50 / 52 papers shown
Title
Evaluation of Finetuned LLMs in AMR Parsing
Evaluation of Finetuned LLMs in AMR Parsing
Shu Han Ho
0
0
0
07 Aug 2025
Multimodal Event Detection: Current Approaches and Defining the New Playground through LLMs and VLMs
Multimodal Event Detection: Current Approaches and Defining the New Playground through LLMs and VLMs
Abhishek Dey
Aabha Bothera
Samhita Sarikonda
Rishav Aryan
Sanjay Kumar Podishetty
Akshay Havalgi
Gaurav Singh
Saurabh Srivastava
131
0
0
16 May 2025
Collaborative Multi-LoRA Experts with Achievement-based Multi-Tasks Loss for Unified Multimodal Information Extraction
Collaborative Multi-LoRA Experts with Achievement-based Multi-Tasks Loss for Unified Multimodal Information Extraction
Li Yuan
Yi Cai
Xudong Shen
Qing Li
Qingbao Huang
Zikun Deng
Tao Wang
MoMeOffRLMoE
108
0
0
08 May 2025
Retrieval-Enhanced Few-Shot Prompting for Speech Event Extraction
Retrieval-Enhanced Few-Shot Prompting for Speech Event Extraction
Máté Gedeon
RALM
127
0
0
30 Apr 2025
SEOE: A Scalable and Reliable Semantic Evaluation Framework for Open Domain Event Detection
Yi-Fan Lu
Xian-Ling Mao
Tian Lan
Tong Zhang
Yu-Shi Zhu
Heyan Huang
142
0
0
05 Mar 2025
SNaRe: Domain-aware Data Generation for Low-Resource Event Detection
SNaRe: Domain-aware Data Generation for Low-Resource Event Detection
Tanmay Parekh
Yuxuan Dong
Lucas Bandarkar
Artin Kim
I-Hung Hsu
Kai-Wei Chang
Nanyun Peng
113
0
0
24 Feb 2025
SPEED++: A Multilingual Event Extraction Framework for Epidemic
  Prediction and Preparedness
SPEED++: A Multilingual Event Extraction Framework for Epidemic Prediction and Preparedness
Tanmay Parekh
Jeffrey Kwan
Jiarui Yu
Sparsh Johri
Hyosang Ahn
Sreya Muppalla
Kai-Wei Chang
Wei Wang
Nanyun Peng
138
4
0
24 Oct 2024
Beyond Exact Match: Semantically Reassessing Event Extraction by Large Language Models
Beyond Exact Match: Semantically Reassessing Event Extraction by Large Language Models
Yi-Fan Lu
Xian-Ling Mao
Tian Lan
Heyan Huang
Heyan Huang
Xiaoyan Gao
110
0
0
12 Oct 2024
Enhancing Event Reasoning in Large Language Models through Instruction
  Fine-Tuning with Semantic Causal Graphs
Enhancing Event Reasoning in Large Language Models through Instruction Fine-Tuning with Semantic Causal Graphs
Mazal Bethany
Emet Bethany
Brandon Wherry
Cho-Yu Chiang
Nishant Vishwamitra
Anthony Rios
Peyman Najafirad
LRM
129
1
0
30 Aug 2024
ARMADA: Attribute-Based Multimodal Data Augmentation
ARMADA: Attribute-Based Multimodal Data Augmentation
Xiaomeng Jin
Jeonghwan Kim
Yu Zhou
Kuan-Hao Huang
Te-Lin Wu
Nanyun Peng
Heng Ji
98
2
0
19 Aug 2024
A Survey on Integrated Sensing, Communication, and Computation
A Survey on Integrated Sensing, Communication, and Computation
Dingzhu Wen
Yong Zhou
Xiaoyang Li
Yuanming Shi
Kaibin Huang
Khaled B. Letaief
100
47
0
15 Aug 2024
MMUTF: Multimodal Multimedia Event Argument Extraction with Unified
  Template Filling
MMUTF: Multimodal Multimedia Event Argument Extraction with Unified Template Filling
Philipp Seeberger
Dominik Wagner
Korbinian Riedhammer
91
1
0
18 Jun 2024
Multimodal Cross-Document Event Coreference Resolution Using Linear
  Semantic Transfer and Mixed-Modality Ensembles
Multimodal Cross-Document Event Coreference Resolution Using Linear Semantic Transfer and Mixed-Modality Ensembles
Abhijnan Nath
Huma Jamil
Shafiuddin Rehan Ahmed
George Baker
Rahul Ghosh
James H. Martin
Nathaniel Blanchard
Nikhil Krishnaswamy
86
2
0
13 Apr 2024
GenEARL: A Training-Free Generative Framework for Multimodal Event
  Argument Role Labeling
GenEARL: A Training-Free Generative Framework for Multimodal Event Argument Role Labeling
Hritik Bansal
Po-Nien Kung
P. Brantingham
Weisheng Wang
Miao Zheng
VLM
94
2
0
07 Apr 2024
Event Detection from Social Media for Epidemic Prediction
Event Detection from Social Media for Epidemic Prediction
Tanmay Parekh
Anh Mac
Jiarui Yu
Yuxuan Dong
Syed Shahriar
...
Eric Yang
Kuan-Hao Huang
Wei Wang
Nanyun Peng
Kai-Wei Chang
84
8
0
02 Apr 2024
Improving Event Definition Following For Zero-Shot Event Detection
Improving Event Definition Following For Zero-Shot Event Detection
Zefan Cai
Po-Nien Kung
Ashima Suvarna
Mingyu Derek Ma
Hritik Bansal
Baobao Chang
P. Brantingham
Wei Wang
Nanyun Peng
108
9
0
05 Mar 2024
UMIE: Unified Multimodal Information Extraction with Instruction Tuning
UMIE: Unified Multimodal Information Extraction with Instruction Tuning
Lin Sun
Kai Zhang
Qingyuan Li
Renze Lou
106
15
0
05 Jan 2024
Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in
  Chart Captioning
Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning
Kung-Hsiang Huang
Mingyang Zhou
Hou Pong Chan
Yi R. Fung
Zhenhailong Wang
Lingyu Zhang
Shih-Fu Chang
Chenhui Xu
131
43
0
15 Dec 2023
RESIN-EDITOR: A Schema-guided Hierarchical Event Graph Visualizer and
  Editor
RESIN-EDITOR: A Schema-guided Hierarchical Event Graph Visualizer and Editor
Khanh Duy Nguyen
Zixuan Zhang
Reece Suchocki
Sha Li
Martha Palmer
S. Brown
Jiawei Han
Heng Ji
86
2
0
05 Dec 2023
Video Summarization: Towards Entity-Aware Captions
Video Summarization: Towards Entity-Aware Captions
Hammad A. Ayyubi
Tianqi Liu
Arsha Nagrani
Xudong Lin
Ruotong Wang
Anurag Arnab
Feng Han
Yukun Zhu
Jialu Liu
Shih-Fu Chang
68
0
0
01 Dec 2023
ViStruct: Visual Structural Knowledge Extraction via Curriculum Guided
  Code-Vision Representation
ViStruct: Visual Structural Knowledge Extraction via Curriculum Guided Code-Vision Representation
Yangyi Chen
Xingyao Wang
Pengfei Yu
Derek Hoiem
Heng Ji
96
12
0
22 Nov 2023
TextEE: Benchmark, Reevaluation, Reflections, and Future Challenges in
  Event Extraction
TextEE: Benchmark, Reevaluation, Reflections, and Future Challenges in Event Extraction
Kuan-Hao Huang
I-Hung Hsu
Tanmay Parekh
Zhiyu Xie
Zixuan Zhang
Premkumar Natarajan
Kai-Wei Chang
Nanyun Peng
Heng Ji
139
22
0
16 Nov 2023
STELLA: Continual Audio-Video Pre-training with Spatio-Temporal
  Localized Alignment
STELLA: Continual Audio-Video Pre-training with Spatio-Temporal Localized Alignment
Jaewoo Lee
Jaehong Yoon
Wonjae Kim
Yunji Kim
Sung Ju Hwang
CLL
143
1
0
12 Oct 2023
Multimodal Question Answering for Unified Information Extraction
Multimodal Question Answering for Unified Information Extraction
Yuxuan Sun
Kai Zhang
Yu-Chuan Su
76
8
0
04 Oct 2023
MultiVENT: Multilingual Videos of Events with Aligned Natural Text
MultiVENT: Multilingual Videos of Events with Aligned Natural Text
Kate Sanders
David Etter
Reno Kriz
Benjamin Van Durme
VGen
131
7
0
06 Jul 2023
Artificial Intelligence for Emergency Response
Artificial Intelligence for Emergency Response
Ayan Mukhopadhyay
35
1
0
15 Jun 2023
Training Multimedia Event Extraction With Generated Images and Captions
Training Multimedia Event Extraction With Generated Images and Captions
Zilin Du
Yunxin Li
Xu Guo
Yidan Sun
Boyang Albert Li
DiffM
106
9
0
15 Jun 2023
Language Models Can Improve Event Prediction by Few-Shot Abductive
  Reasoning
Language Models Can Improve Event Prediction by Few-Shot Abductive Reasoning
Xiaoming Shi
Siqiao Xue
Kangrui Wang
Fan Zhou
James Y. Zhang
Jun-ping Zhou
Chenhao Tan
Hongyuan Mei
ReLMLRM
128
57
0
26 May 2023
Multimodal Automated Fact-Checking: A Survey
Multimodal Automated Fact-Checking: A Survey
Mubashara Akhtar
Michael Schlichtkrull
Zhijiang Guo
O. Cocarascu
Elena Simperl
Andreas Vlachos
176
44
0
22 May 2023
Few-shot Domain-Adaptive Visually-fused Event Detection from Text
Few-shot Domain-Adaptive Visually-fused Event Detection from Text
Farhad Moghimifar
Fatemeh Shiri
Van Nguyen
Gholamreza Haffari
Yuanyou Li
VLM
80
3
0
04 May 2023
Understanding Social Media Cross-Modality Discourse in Linguistic Space
Understanding Social Media Cross-Modality Discourse in Linguistic Space
Chunpu Xu
Hanzhuo Tan
Jing Li
Piji Li
106
8
0
26 Feb 2023
In Defense of Structural Symbolic Representation for Video
  Event-Relation Prediction
In Defense of Structural Symbolic Representation for Video Event-Relation Prediction
Andrew Lu
Xudong Lin
Yulei Niu
Shih-Fu Chang
106
2
0
06 Jan 2023
Vision and Structured-Language Pretraining for Cross-Modal Food
  Retrieval
Vision and Structured-Language Pretraining for Cross-Modal Food Retrieval
Mustafa Shukor
Nicolas Thome
Matthieu Cord
CLIPCoGe
105
10
0
08 Dec 2022
Video Event Extraction via Tracking Visual States of Arguments
Video Event Extraction via Tracking Visual States of Arguments
Guang Yang
Pengfei Yu
Jiajie Zhang
Xudong Lin
Shih-Fu Chang
Heng Ji
82
12
0
03 Nov 2022
Language Model Pre-Training with Sparse Latent Typing
Language Model Pre-Training with Sparse Latent Typing
Liliang Ren
Zixuan Zhang
H. Wang
Clare R. Voss
Chengxiang Zhai
Heng Ji
138
3
0
23 Oct 2022
Beyond Grounding: Extracting Fine-Grained Event Hierarchies Across
  Modalities
Beyond Grounding: Extracting Fine-Grained Event Hierarchies Across Modalities
Hammad A. Ayyubi
Christopher Thomas
Lovish Chum
R. Lokesh
Long Chen
...
Xudong Lin
Xuande Feng
Jaywon Koo
Sounak Ray
Shih-Fu Chang
AI4TS
93
0
0
14 Jun 2022
Detecting the Role of an Entity in Harmful Memes: Techniques and Their
  Limitations
Detecting the Role of an Entity in Harmful Memes: Techniques and Their Limitations
R. N. Nandi
Firoj Alam
Preslav Nakov
58
8
0
09 May 2022
Translation between Molecules and Natural Language
Translation between Molecules and Natural Language
Carl Edwards
T. Lai
Kevin Ros
Garrett Honke
Kyunghyun Cho
Heng Ji
164
186
0
25 Apr 2022
Fine-Grained Visual Entailment
Fine-Grained Visual Entailment
Christopher Thomas
Yipeng Zhang
Shih-Fu Chang
145
6
0
29 Mar 2022
Multi-Modal Knowledge Graph Construction and Application: A Survey
Multi-Modal Knowledge Graph Construction and Application: A Survey
Xiangru Zhu
Zhixu Li
Xiaodan Wang
Xueyao Jiang
Yixiang Chen
Xuwu Wang
Yanghua Xiao
N. Yuan
100
184
0
11 Feb 2022
CLIP-Event: Connecting Text and Images with Event Structures
CLIP-Event: Connecting Text and Images with Event Structures
Pengfei Yu
Ruochen Xu
Shuohang Wang
Luowei Zhou
Xudong Lin
Chenguang Zhu
Michael Zeng
Heng Ji
Shih-Fu Chang
VLMCLIP
101
136
0
13 Jan 2022
What is Event Knowledge Graph: A Survey
What is Event Knowledge Graph: A Survey
Saiping Guan
Xueqi Cheng
Long Bai
Fu Zhang
Zixuan Li
Yutao Zeng
Xiaolong Jin
Jiafeng Guo
85
62
0
31 Dec 2021
MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media
  Knowledge Extraction and Grounding
MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding
Revanth Reddy Gangi Reddy
Xilin Rui
Pengfei Yu
Xudong Lin
Haoyang Wen
...
Joey Tianyi Zhou
Avirup Sil
Shih-Fu Chang
Alex Schwing
Heng Ji
102
33
0
20 Dec 2021
Joint Multimedia Event Extraction from Video and Article
Joint Multimedia Event Extraction from Video and Article
Brian Chen
Xudong Lin
Christopher Thomas
Pengfei Yu
Shoya Yoshida
Lovish Chum
Heng Ji
Shih-Fu Chang
VGen
96
26
0
27 Sep 2021
Fine-Grained Chemical Entity Typing with Multimodal Knowledge
  Representation
Fine-Grained Chemical Entity Typing with Multimodal Knowledge Representation
Chenkai Sun
Weijian Li
Jinfeng Xiao
Nikolaus Nova Parulian
ChengXiang Zhai
Heng Ji
99
4
0
29 Aug 2021
A Survey on Deep Learning Event Extraction: Approaches and Applications
A Survey on Deep Learning Event Extraction: Approaches and Applications
Qian Li
Jianxin Li
Shuaiyi Nie
Shiyao Cui
Hongzhi Zhang
...
Hao Peng
Shu Guo
Lihong Wang
Amin Beheshti
Philip S. Yu
140
53
0
05 Jul 2021
Video Question Answering with Phrases via Semantic Roles
Video Question Answering with Phrases via Semantic Roles
Arka Sadhu
Kan Chen
Ram Nevatia
66
16
0
08 Apr 2021
Visual Semantic Role Labeling for Video Understanding
Visual Semantic Role Labeling for Video Understanding
Arka Sadhu
Tanmay Gupta
Mark Yatskar
Ram Nevatia
Aniruddha Kembhavi
VLM
115
75
0
02 Apr 2021
EventPlus: A Temporal Event Understanding Pipeline
EventPlus: A Temporal Event Understanding Pipeline
Mingyu Derek Ma
Jiao Sun
Mu Yang
Kung-Hsiang Huang
Nuan Wen
Shikhar Singh
Rujun Han
Nanyun Peng
78
32
0
13 Jan 2021
Cross-Media Keyphrase Prediction: A Unified Framework with
  Multi-Modality Multi-Head Attention and Image Wordings
Cross-Media Keyphrase Prediction: A Unified Framework with Multi-Modality Multi-Head Attention and Image Wordings
Yue Wang
Jing Li
Michael R. Lyu
Irwin King
98
17
0
03 Nov 2020
12
Next