The Multi-Modal Video Reasoning and Analyzing Competition
Haoran Peng
He Huang
Li Xu
Tianjiao Li
J. Liu
Hossein Rahmani
Qiuhong Ke
Zhicheng Guo
Cong Wu
Rongchang Li
Mang Ye
Jiahao Wang
Jiaxu Zhang
Yuanzhong Liu
Tao He
Fuwei Zhang
Xianbin Liu
Tao Lin

Abstract
In this paper, we introduce the Multi-Modal Video Reasoning and Analyzing Competition (MMVRAC) workshop in conjunction with ICCV 2021. This competition is composed of four different tracks, namely, video question answering, skeleton-based action recognition, fisheye video-based action recognition, and person re-identification, which are based on two datasets: SUTD-TrafficQA and UAV-Human. We summarize the top-performing methods submitted by the participants in this competition and show their results achieved in the competition.
View on arXivComments on this paper