ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2008.03546
  4. Cited By
Online Multi-modal Person Search in Videos

Online Multi-modal Person Search in Videos

8 August 2020
J. Xia
Anyi Rao
Qingqiu Huang
Linning Xu
Jiangtao Wen
Dahua Lin
ArXivPDFHTML

Papers citing "Online Multi-modal Person Search in Videos"

14 / 14 papers shown
Title
Generative AI for Film Creation: A Survey of Recent Advances
Generative AI for Film Creation: A Survey of Recent Advances
Ruihan Zhang
Borou Yu
Jiajian Min
Yetong Xin
Zheng Wei
...
Sijia Jiang
Peiwen Huang
Na Chen
Xuanxuan Liu
Anyi Rao
VGen
57
0
0
11 Apr 2025
Audio-Visual Speaker Diarization: Current Databases, Approaches and
  Challenges
Audio-Visual Speaker Diarization: Current Databases, Approaches and Challenges
Victoria Mingote
Alfonso Ortega
A. Miguel
Eduardo Lleida
22
0
0
09 Sep 2024
Transformation vs Tradition: Artificial General Intelligence (AGI) for
  Arts and Humanities
Transformation vs Tradition: Artificial General Intelligence (AGI) for Arts and Humanities
Zheng Liu
Yiwei Li
Qian Cao
Junwen Chen
Tianze Yang
...
John Gibbs
Khaled Rasheed
Ninghao Liu
Gengchen Mai
Tianming Liu
AI4CE
36
10
0
30 Oct 2023
Movie Genre Classification by Language Augmentation and Shot Sampling
Movie Genre Classification by Language Augmentation and Shot Sampling
Zhongping Zhang
Yiwen Gu
Bryan A. Plummer
Xin Miao
Jiayi Liu
Huayan Wang
VLM
CLIP
11
1
0
24 Mar 2022
AVA-AVD: Audio-Visual Speaker Diarization in the Wild
AVA-AVD: Audio-Visual Speaker Diarization in the Wild
Eric Z. Xu
Zeyang Song
Satoshi Tsutsui
C. Feng
Mang Ye
Mike Zheng Shou
VGen
16
42
0
29 Nov 2021
MovieCuts: A New Dataset and Benchmark for Cut Type Recognition
MovieCuts: A New Dataset and Benchmark for Cut Type Recognition
Alejandro Pardo
Fabian Caba Heilbron
Juan Carlos León Alcázar
Ali K. Thabet
Bernard Ghanem
VGen
29
28
0
12 Sep 2021
Category-Level 6D Object Pose Estimation via Cascaded Relation and
  Recurrent Reconstruction Networks
Category-Level 6D Object Pose Estimation via Cascaded Relation and Recurrent Reconstruction Networks
Jiaze Wang
Kai-xiang Chen
Qi Dou
3DPC
73
99
0
19 Aug 2021
Learning to Cut by Watching Movies
Learning to Cut by Watching Movies
Alejandro Pardo
Fabian Caba Heilbron
Juan Carlos León Alcázar
Ali K. Thabet
Bernard Ghanem
VGen
43
20
0
09 Aug 2021
Is Someone Speaking? Exploring Long-term Temporal Features for
  Audio-visual Active Speaker Detection
Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection
Ruijie Tao
Zexu Pan
Rohan Kumar Das
Xinyuan Qian
Mike Zheng Shou
Haizhou Li
15
172
0
14 Jul 2021
Face, Body, Voice: Video Person-Clustering with Multiple Modalities
Face, Body, Voice: Video Person-Clustering with Multiple Modalities
Andrew Brown
Vicky Kalogeiton
Andrew Zisserman
CVBM
15
29
0
20 May 2021
HVPR: Hybrid Voxel-Point Representation for Single-stage 3D Object
  Detection
HVPR: Hybrid Voxel-Point Representation for Single-stage 3D Object Detection
Jongyoun Noh
Sanghoon Lee
Bumsub Ham
3DPC
16
126
0
02 Apr 2021
A Unified Framework for Shot Type Classification Based on Subject
  Centric Lens
A Unified Framework for Shot Type Classification Based on Subject Centric Lens
Anyi Rao
Jiaze Wang
Linning Xu
Xuekun Jiang
Qingqiu Huang
Bolei Zhou
Dahua Lin
18
60
0
08 Aug 2020
MovieNet: A Holistic Dataset for Movie Understanding
MovieNet: A Holistic Dataset for Movie Understanding
Qingqiu Huang
Yu Xiong
Anyi Rao
Jiaze Wang
Dahua Lin
VGen
23
234
0
21 Jul 2020
Learn to Propagate Reliably on Noisy Affinity Graphs
Learn to Propagate Reliably on Noisy Affinity Graphs
Lei Yang
Qingqiu Huang
Huaiyi Huang
Linning Xu
Dahua Lin
GNN
27
13
0
17 Jul 2020
1