Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1907.09702
Cited By
BMN: Boundary-Matching Network for Temporal Action Proposal Generation
IEEE International Conference on Computer Vision (ICCV), 2019
23 July 2019
Tianwei Lin
Xiao-Chang Liu
Xin Li
Errui Ding
Shilei Wen
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BMN: Boundary-Matching Network for Temporal Action Proposal Generation"
50 / 332 papers shown
TBT-Former: Learning Temporal Boundary Distributions for Action Localization
Thisara Rathnayaka
Uthayasanker Thayasivam
167
0
0
01 Dec 2025
Structured Context Learning for Generic Event Boundary Detection
Xin Gu
Congcong Li
Xinyao Wang
Dexiang Hong
Libo Zhang
Tiejian Luo
Longyin Wen
Heng Fan
72
0
0
29 Nov 2025
MambaTAD: When State-Space Models Meet Long-Range Temporal Action Detection
H. Lu
Yi Yu
Shijian Lu
Deepu Rajan
Boon Poh Ng
Alex Chichung Kot
Xudong Jiang
Mamba
198
0
0
22 Nov 2025
Next-Frame Feature Prediction for Multimodal Deepfake Detection and Temporal Localization
Ashutosh Anshul
Shreyas Gopal
D. Rajan
Eng Siong Chng
83
0
0
13 Nov 2025
Temporal Zoom Networks: Distance Regression and Continuous Depth for Efficient Action Localization
Ibne Farabi Shihab
Sanjeda Akter
Anuj Sharma
193
0
0
06 Nov 2025
VideoAgentTrek: Computer Use Pretraining from Unlabeled Videos
Dunjie Lu
Yiheng Xu
Junli Wang
Haoyuan Wu
Xinyuan Wang
...
Yuchen Mao
J. Zhou
Junyang Lin
Binyuan Hui
Tao Yu
137
2
0
22 Oct 2025
Distributed Algorithms for Multi-Agent Multi-Armed Bandits with Collision
Daoyuan Zhou
Xuchuang Wang
L. Yang
Yang Gao
159
1
0
08 Oct 2025
Open-ended Hierarchical Streaming Video Understanding with Vision Language Models
Hyolim Kang
Yunsu Park
Youngbeom Yoo
Yeeun Choi
Seon Joo Kim
AI4TS
VLM
142
3
0
15 Sep 2025
ERF-BA-TFD+: A Multimodal Model for Audio-Visual Deepfake Detection
Xin Zhang
Jiaming Chu
Jian-jun Zhao
Yuchu Jiang
Xu Yang
Lei Jin
Chi Zhang
Xuelong Li
115
0
0
24 Aug 2025
Generative Model-Based Feature Attention Module for Video Action Analysis
G. Wang
Peng Zhao
Cong Zhao
Jing Huang
Siyan Guo
Shusen Yang
123
0
0
19 Aug 2025
Generic Event Boundary Detection via Denoising Diffusion
Jaejun Hwang
Dayoung Gong
Manjin Kim
Minsu Cho
DiffM
133
0
0
16 Aug 2025
Pindrop it! Audio and Visual Deepfake Countermeasures for Robust Detection and Fine Grained-Localization
Nicholas Klein
Hemlata Tak
James Fullwood
Krishna Regmi
Leonidas Spinoulas
Ganesh Sivaraman
Tianxiang Chen
Elie Khoury
AAML
208
0
0
11 Aug 2025
Localizing Audio-Visual Deepfakes via Hierarchical Boundary Modeling
Xuanjun Chen
Shih-Peng Cheng
Jiawei Du
Lin Zhang
Xiaoxiao Miao
Chung-Che Wang
Haibin Wu
Hung-yi Lee
Jyh-Shing Roger Jang
227
1
0
04 Aug 2025
Beyond Label Semantics: Language-Guided Action Anatomy for Few-shot Action Recognition
Zefeng Qian
Xincheng Yao
Yifei Huang
Chongyang Zhang
Jiangyong Ying
Hong Sun
242
1
0
22 Jul 2025
ESG-Net: Event-Aware Semantic Guided Network for Dense Audio-Visual Event Localization
Huilai Li
Yonghao Dang
Ying Xing
Yiming Wang
Jianqin Yin
183
0
0
14 Jul 2025
PR-DETR: Injecting Position and Relation Prior for Dense Video Captioning
Yizhe Li
Sanping Zhou
Zheng Qin
Le Wang
ViT
189
0
0
19 Jun 2025
Action Dubber: Timing Audible Actions via Inflectional Flow
Wenlong Wan
Weiying Zheng
Tianyi Xiang
Guiqing Li
Shengfeng He
174
0
0
16 Jun 2025
DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer
Computer Vision and Pattern Recognition (CVPR), 2025
Ho-Joong Kim
Y. E. Lee
Jung-Ho Hong
Seong-Whan Lee
355
1
0
09 May 2025
Read My Ears! Horse Ear Movement Detection for Equine Affective State Assessment
João Alves
Pia Haubro Andersen
Rikke Gade
158
0
0
06 May 2025
Deep Learning for Sports Video Event Detection: Tasks, Datasets, Methods, and Challenges
Hao Xu
Arbind Agrahari Baniya
Sam Well
Mohamed Reda Bouadjenek
Richard Dazeley
S. Aryal
AI4TS
330
3
0
06 May 2025
Grounding-MD: Grounded Video-language Pre-training for Open-World Moment Detection
Weijun Zhuang
Qizhang Li
Xin Li
Ming-Yu Liu
Xiaopeng Hong
Feng Gao
Fan Yang
W. Zuo
266
1
0
20 Apr 2025
F
3
^3
3
Set: Towards Analyzing Fast, Frequent, and Fine-grained Events from Videos
International Conference on Learning Representations (ICLR), 2025
Zhaoyu Liu
Kan Jiang
Murong Ma
Zhe Hou
Yun Lin
Jin Song Dong
296
3
0
11 Apr 2025
FDDet: Frequency-Decoupling for Boundary Refinement in Temporal Action Detection
International Conference on Intelligent Computing (ICIC), 2025
Xinnan Zhu
Yicheng Zhu
Tixin Chen
Wentao Wu
Yuanjie Dang
295
1
0
01 Apr 2025
Temporal Action Detection Model Compression by Progressive Block Drop
Computer Vision and Pattern Recognition (CVPR), 2025
Xiaoyong Chen
Yong Guo
Jiaming Liang
Sitong Zhuang
Runhao Zeng
Xiping Hu
302
1
0
21 Mar 2025
TimeLoc: A Unified End-to-End Framework for Precise Timestamp Localization in Long Videos
Chen-Da Liu-Zhang
Lin Sui
Shuming Liu
Fangzhou Mu
Ziyi Wang
Bernard Ghanem
313
3
0
09 Mar 2025
OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection
Shuming Liu
Chen Zhao
Fatimah Zohra
Mattia Soldan
Alejandro Pardo
...
Juan Carlos León Alcázar
A. Cioppa
Silvio Giancola
Carlos Hinojosa
Bernard Ghanem
297
6
0
27 Feb 2025
MS-Temba: Multi-Scale Temporal Mamba for Understanding Long Untrimmed Videos
Arkaprava Sinha
Monish Soundar Raj
Pu Wang
Ahmed Helmy
Srijan Das
Srijan Das
Mamba
576
5
0
10 Jan 2025
Action-Agnostic Point-Level Supervision for Temporal Action Detection
AAAI Conference on Artificial Intelligence (AAAI), 2024
Shuhei M. Yoshida
Takashi Shibata
M. Terao
Takayuki Okatani
Masashi Sugiyama
268
3
0
31 Dec 2024
DiMoDif: Discourse Modality-information Differentiation for Audio-visual Deepfake Detection and Localization
C. Koutlis
Symeon Papadopoulos
418
7
0
15 Nov 2024
PESFormer: Boosting Macro- and Micro-expression Spotting with Direct Timestamp Encoding
Wang-Wang Yu
Kai-Fu Yang
Xiangrui Hu
Jingwen Jiang
Hong-Mei Yan
Yong-Jie Li
191
0
0
24 Oct 2024
ContextDet: Temporal Action Detection with Adaptive Context Aggregation
Ning Wang
Yun Xiao
Xiaopeng Peng
Xiaojun Chang
Xuanhong Wang
Dingyi Fang
363
4
0
20 Oct 2024
Zero-shot Action Localization via the Confidence of Large Vision-Language Models
Josiah Aklilu
Xiaohan Wang
Serena Yeung-Levy
328
1
0
18 Oct 2024
Temporal2Seq: A Unified Framework for Temporal Video Understanding Tasks
Min Yang
Zichen Zhang
Limin Wang
AI4TS
218
0
0
27 Sep 2024
HAVANA: Hierarchical stochastic neighbor embedding for Accelerated Video ANnotAtions
Alexandru Bobe
Jan van Gemert
200
0
0
16 Sep 2024
Introducing Gating and Context into Temporal Action Detection
Aglind Reka
Diana Laura Borza
Dominick Reilly
Michal Balazia
Francois Bremond
244
0
0
06 Sep 2024
Open-Vocabulary Action Localization with Iterative Visual Prompting
IEEE Access (IEEE Access), 2024
Naoki Wake
Atsushi Kanehira
Kazuhiro Sasabuchi
Jun Takamatsu
Katsushi Ikeuchi
VLM
368
1
0
30 Aug 2024
Towards Completeness: A Generalizable Action Proposal Generator for Zero-Shot Temporal Action Localization
International Conference on Pattern Recognition (ICPR), 2024
Jia-Run Du
Kun-Yu Lin
Jingke Meng
Wei-Shi Zheng
218
0
0
25 Aug 2024
Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization
ACM Multimedia (MM), 2024
Geuntaek Lim
Hyunwoo Kim
Joonsoo Kim
Yukyung Choi
286
7
0
12 Aug 2024
Online Temporal Action Localization with Memory-Augmented Transformer
European Conference on Computer Vision (ECCV), 2024
Youngkil Song
Dongkeun Kim
Minsu Cho
Suha Kwak
241
3
0
06 Aug 2024
SpotFormer: Multi-Scale Spatio-Temporal Transformer for Facial Expression Spotting
Yicheng Deng
Hideaki Hayashi
Hajime Nagahara
288
3
0
30 Jul 2024
Harnessing Temporal Causality for Advanced Temporal Action Detection
Shuming Liu
Lin Sui
Chen-Da Liu-Zhang
Fangzhou Mu
Chen Zhao
Guohao Li
CML
312
5
0
25 Jul 2024
Coarse-to-Fine Proposal Refinement Framework for Audio Temporal Forgery Detection and Localization
Junyan Wu
Wei Lu
Xiangyang Luo
Rui Yang
Qian Wang
Xiaochun Cao
249
11
0
23 Jul 2024
ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming Videos
Hyolim Kang
Jeongseok Hyun
Joungbin An
Youngjae Yu
Seon Joo Kim
169
1
0
17 Jul 2024
Rethinking the Architecture Design for Efficient Generic Event Boundary Detection
Ziwei Zheng
Zechuan Zhang
Yulin Wang
Shiji Song
Gao Huang
Le Yang
182
6
0
17 Jul 2024
Towards Adaptive Pseudo-label Learning for Semi-Supervised Temporal Action Localization
Feixiang Zhou
Bryan M. Williams
Hossein Rahmani
209
3
0
10 Jul 2024
Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action Localization
Jeongseok Hyun
Su Ho Han
Hyolim Kang
Joon-Young Lee
Seon Joo Kim
VLM
269
3
0
09 Jul 2024
MMAD: Multi-label Micro-Action Detection in Videos
Kun Li
Pengyu Liu
D. Guo
Guoliang Chen
Zhiliang Wu
Hehe Fan
Meng Wang
500
19
0
07 Jul 2024
Fine-grained Dynamic Network for Generic Event Boundary Detection
Ziwei Zheng
Lijun He
Le Yang
Fan Li
195
2
0
05 Jul 2024
DyFADet: Dynamic Feature Aggregation for Temporal Action Detection
Le Yang
Ziwei Zheng
Yizeng Han
Hao-Ran Cheng
Shiji Song
Gao Huang
Fan Li
299
21
0
03 Jul 2024
The Solution for Temporal Sound Localisation Task of ICCV 1st Perception Test Challenge 2023
Yurui Huang
Yang Yang
Shou Chen
Xiangyu Wu
Qingguo Chen
Jianfeng Lu
169
0
0
01 Jul 2024
1
2
3
4
5
6
7
Next