ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.03545
  4. Cited By
Dense Regression Network for Video Grounding

Dense Regression Network for Video Grounding

Computer Vision and Pattern Recognition (CVPR), 2020
7 April 2020
Runhao Zeng
Haoming Xu
Wenbing Huang
Peihao Chen
Zhuliang Yu
Chuang Gan
ArXiv (abs)PDFHTML

Papers citing "Dense Regression Network for Video Grounding"

50 / 170 papers shown
Who Can We Trust? Scope-Aware Video Moment Retrieval with Multi-Agent Conflict
Who Can We Trust? Scope-Aware Video Moment Retrieval with Multi-Agent Conflict
Chaochen Wu
Guan Luo
Meiyun Zuo
Zhitao Fan
173
0
0
01 Nov 2025
HieraMamba: Video Temporal Grounding via Hierarchical Anchor-Mamba Pooling
HieraMamba: Video Temporal Grounding via Hierarchical Anchor-Mamba Pooling
Joungbin An
Kristen Grauman
Mamba
295
0
0
27 Oct 2025
Augmenting Moment Retrieval: Zero-Dependency Two-Stage Learning
Augmenting Moment Retrieval: Zero-Dependency Two-Stage Learning
Zhengxuan Wei
Jiajin Tang
Sibei Yang
VLM
198
1
0
22 Oct 2025
When One Moment Isn't Enough: Multi-Moment Retrieval with Cross-Moment Interactions
When One Moment Isn't Enough: Multi-Moment Retrieval with Cross-Moment Interactions
Zhuo Cao
Heming Du
Bingqing Zhang
Xin Yu
Xue Li
Sen Wang
159
1
0
20 Oct 2025
Enrich and Detect: Video Temporal Grounding with Multimodal LLMs
Enrich and Detect: Video Temporal Grounding with Multimodal LLMs
Shraman Pramanick
E. Mavroudi
Yale Song
Rama Chellappa
Lorenzo Torresani
Triantafyllos Afouras
272
3
0
19 Oct 2025
From Learning to Mastery: Achieving Safe and Efficient Real-World Autonomous Driving with Human-In-The-Loop Reinforcement Learning
From Learning to Mastery: Achieving Safe and Efficient Real-World Autonomous Driving with Human-In-The-Loop Reinforcement Learning
Li Zeqiao
Wang Yijing
Wang Haoyu
Li Zheng
Li Peng
Liu Wenfei
Zuo zhiqiang
256
6
0
07 Oct 2025
Sim-DETR: Unlock DETR for Temporal Sentence Grounding
Sim-DETR: Unlock DETR for Temporal Sentence Grounding
Jiajin Tang
Zhengxuan Wei
Yuchen Zhu
Cheng Shi
Guanbin Li
Guanbin Li
Sibei Yang
PINN
357
3
0
28 Sep 2025
TempSamp-R1: Effective Temporal Sampling with Reinforcement Fine-Tuning for Video LLMs
TempSamp-R1: Effective Temporal Sampling with Reinforcement Fine-Tuning for Video LLMs
Yunheng Li
Jing Cheng
Shaoyong Jia
Hangyi Kuang
Shaohui Jiao
Qibin Hou
Ming-Ming Cheng
AI4TSVLM
312
9
0
22 Sep 2025
ResidualViT for Efficient Temporally Dense Video Encoding
ResidualViT for Efficient Temporally Dense Video Encoding
Mattia Soldan
Fabian Caba Heilbron
Bernard Ghanem
Josef Sivic
Bryan C. Russell
209
1
0
16 Sep 2025
OVG-HQ: Online Video Grounding with Hybrid-modal Queries
OVG-HQ: Online Video Grounding with Hybrid-modal Queries
Runhao Zeng
Jiaqi Mao
Minghao Lai
Minh Hieu Phan
Yanjie Dong
Wei Wang
Qi Chen
Xiping Hu
187
0
0
16 Aug 2025
TAR-TVG: Enhancing VLMs with Timestamp Anchor-Constrained Reasoning for Temporal Video Grounding
TAR-TVG: Enhancing VLMs with Timestamp Anchor-Constrained Reasoning for Temporal Video Grounding
Chaohong Guo
Xun Mo
Yongwei Nie
Xuemiao Xu
Chao Xu
Fei Richard Yu
Chengjiang Long
LRM
250
3
0
11 Aug 2025
DeCafNet: Delegate and Conquer for Efficient Temporal Grounding in Long Videos
DeCafNet: Delegate and Conquer for Efficient Temporal Grounding in Long VideosComputer Vision and Pattern Recognition (CVPR), 2025
Zijia Lu
A S M Iftekhar
Gaurav Mittal
Tianjian Meng
Xiawei Wang
Cheng Zhao
Rohith Kukkala
Ehsan Elhamifar
Mei Chen
301
3
0
22 May 2025
Grounding-MD: Grounded Video-language Pre-training for Open-World Moment Detection
Grounding-MD: Grounded Video-language Pre-training for Open-World Moment Detection
Weijun Zhuang
Qizhang Li
Xin Li
Ming-Yu Liu
Xiaopeng Hong
Feng Gao
Fan Yang
W. Zuo
313
1
0
20 Apr 2025
Prototypes are Balanced Units for Efficient and Effective Partially Relevant Video Retrieval
Prototypes are Balanced Units for Efficient and Effective Partially Relevant Video Retrieval
WonJun Moon
Cheol-Ho Cho
Woojin Jun
Minho Shim
Taeoh Kim
Inwoong Lee
Dongyoon Wee
Jae-Pil Heo
372
4
0
17 Apr 2025
Towards Efficient and Robust Moment Retrieval System: A Unified Framework for Multi-Granularity Models and Temporal Reranking
Towards Efficient and Robust Moment Retrieval System: A Unified Framework for Multi-Granularity Models and Temporal Reranking
H. Tran
Tinh-Anh Nguyen-Nhu
Huu-Phong Phan-Nguyen
T. Nguyen
Nhat-Minh Nguyen-Dich
Anh Dao
Huy-Duc Do
Quan Nguyen
Hoang M. Le
Quang-Vinh Dinh
279
3
0
11 Apr 2025
SVLTA: Benchmarking Vision-Language Temporal Alignment via Synthetic Video Situation
SVLTA: Benchmarking Vision-Language Temporal Alignment via Synthetic Video SituationComputer Vision and Pattern Recognition (CVPR), 2025
Hao Du
Bo Wu
Yan Lu
Zhendong Mao
267
2
0
08 Apr 2025
MCAT: Visual Query-Based Localization of Standard Anatomical Clips in Fetal Ultrasound Videos Using Multi-Tier Class-Aware Token Transformer
MCAT: Visual Query-Based Localization of Standard Anatomical Clips in Fetal Ultrasound Videos Using Multi-Tier Class-Aware Token TransformerAAAI Conference on Artificial Intelligence (AAAI), 2025
Divyanshu Mishra
Pramit Saha
He Zhao
Netzahualcoyotl Hernandez-Cruz
Olga Patey
A. Papageorghiou
J. A. Noble
213
1
0
08 Apr 2025
Learning Activity View-invariance Under Extreme Viewpoint Changes via Curriculum Knowledge Distillation
Learning Activity View-invariance Under Extreme Viewpoint Changes via Curriculum Knowledge Distillation
Arjun Somayazulu
E. Mavroudi
Changan Chen
Lorenzo Torresani
Kristen Grauman
226
1
0
07 Apr 2025
OmniSTVG: Toward Spatio-Temporal Omni-Object Video Grounding
OmniSTVG: Toward Spatio-Temporal Omni-Object Video Grounding
Jiali Yao
Xinran Deng
Xin Gu
Mengrui Dai
Bing Fan
Zhipeng Zhang
Yan Huang
Heng Fan
L. Zhang
447
5
0
13 Mar 2025
TimeLoc: A Unified End-to-End Framework for Precise Timestamp Localization in Long Videos
TimeLoc: A Unified End-to-End Framework for Precise Timestamp Localization in Long Videos
Chen-Da Liu-Zhang
Lin Sui
Shuming Liu
Fangzhou Mu
Ziyi Wang
Bernard Ghanem
384
4
0
09 Mar 2025
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection
Pengcheng Zhao
Zhixian He
Fuwei Zhang
Shujin Lin
Fan Zhou
411
5
0
18 Jan 2025
FlashVTG: Feature Layering and Adaptive Score Handling Network for Video
  Temporal Grounding
FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal GroundingIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Zhuo Cao
Bingqing Zhang
Heming Du
Xin Yu
Xue Li
Sen Wang
386
20
0
18 Dec 2024
VideoLights: Feature Refinement and Cross-Task Alignment Transformer for Joint Video Highlight Detection and Moment Retrieval
VideoLights: Feature Refinement and Cross-Task Alignment Transformer for Joint Video Highlight Detection and Moment Retrieval
Dhiman Paul
Md Rizwan Parvez
Nabeel Mohammed
Shafin Rahman
VGen
395
5
0
02 Dec 2024
Vid-Morp: Video Moment Retrieval Pretraining from Unlabeled Videos in
  the Wild
Vid-Morp: Video Moment Retrieval Pretraining from Unlabeled Videos in the Wild
Peijun Bao
Chenqi Kong
Zihao Shao
Boon Poh Ng
Meng Hwa Er
Alex C. Kot
353
4
0
01 Dec 2024
Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level
Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel LevelComputer Vision and Pattern Recognition (CVPR), 2024
Andong Deng
Tongjia Chen
Shoubin Yu
Taojiannan Yang
Lincoln Spencer
Yapeng Tian
Lin Wang
Joey Tianyi Zhou
Chen Chen
LRM
470
14
0
15 Nov 2024
ActPrompt: In-Domain Feature Adaptation via Action Cues for Video
  Temporal Grounding
ActPrompt: In-Domain Feature Adaptation via Action Cues for Video Temporal Grounding
Yubin Wang
Xinyang Jiang
De Cheng
Dongsheng Li
Cairong Zhao
VLM
270
2
0
13 Aug 2024
SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding
  from TV Dramas and Synopses
SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and SynopsesACM Multimedia (MM), 2024
Chaolei Tan
Zihang Lin
Junfu Pu
Chen Ma
Wei-Yi Pei
Zhi Qu
Yexin Wang
Ying Shan
Wei-Shi Zheng
Jianfang Hu
AI4TS
463
3
0
03 Aug 2024
Temporally Grounding Instructional Diagrams in Unconstrained Videos
Temporally Grounding Instructional Diagrams in Unconstrained Videos
Jiahao Zhang
Frederic Z. Zhang
Cristian Rodriguez
Yizhak Ben-Shabat
A. Cherian
Stephen Gould
402
4
0
16 Jul 2024
Enhancing Video-Language Representations with Structural Spatio-Temporal
  Alignment
Enhancing Video-Language Representations with Structural Spatio-Temporal Alignment
Hao Fei
Shengqiong Wu
Meishan Zhang
Hao Fei
Tat-Seng Chua
Shuicheng Yan
AI4TS
310
72
0
27 Jun 2024
Chrono: A Simple Blueprint for Representing Time in MLLMs
Chrono: A Simple Blueprint for Representing Time in MLLMs
Meinardus Boris
Batra Anil
Rohrbach Anna
Rohrbach Marcus
Marcus Rohrbach
MLLMVLM
666
4
0
26 Jun 2024
MLLM as Video Narrator: Mitigating Modality Imbalance in Video Moment
  Retrieval
MLLM as Video Narrator: Mitigating Modality Imbalance in Video Moment Retrieval
Weitong Cai
Jiabo Huang
Shaogang Gong
Hailin Jin
Yang Liu
263
11
0
25 Jun 2024
Localizing Events in Videos with Multimodal Queries
Localizing Events in Videos with Multimodal QueriesComputer Vision and Pattern Recognition (CVPR), 2024
Gengyuan Zhang
Mang Ling Ada Fok
Yan Xia
Yansong Tang
Zorah Lähner
Juil Sock
Volker Tresp
Jindong Gu
405
7
0
14 Jun 2024
A Survey on Text-guided 3D Visual Grounding: Elements, Recent Advances,
  and Future Directions
A Survey on Text-guided 3D Visual Grounding: Elements, Recent Advances, and Future DirectionsIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024
Daizong Liu
Yang Liu
Wencan Huang
Wei Hu
LM&Ro
422
35
0
09 Jun 2024
Simplify Implant Depth Prediction as Video Grounding: A Texture Perceive
  Implant Depth Prediction Network
Simplify Implant Depth Prediction as Video Grounding: A Texture Perceive Implant Depth Prediction NetworkInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2024
Xinquan Yang
Xuguang Li
Xiaoling Luo
Leilei Zeng
Yudi Zhang
Linlin Shen
Yongqiang Deng
MedIm
210
3
0
07 Jun 2024
Video Anomaly Detection in 10 Years: A Survey and Outlook
Video Anomaly Detection in 10 Years: A Survey and Outlook
Moshira Abdalla
Sajid Javed
Muaz Al Radi
Anwaar Ulhaq
Naoufel Werghi
372
37
0
29 May 2024
Task-Driven Exploration: Decoupling and Inter-Task Feedback for Joint
  Moment Retrieval and Highlight Detection
Task-Driven Exploration: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection
Jin Yang
Ping Wei
Huan Li
Ziyang Ren
292
33
0
14 Apr 2024
UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection
UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection
Yingsen Zeng
Yujie Zhong
Chengjian Feng
Lin Ma
573
16
0
07 Apr 2024
SnAG: Scalable and Accurate Video Grounding
SnAG: Scalable and Accurate Video GroundingComputer Vision and Pattern Recognition (CVPR), 2024
Fangzhou Mu
Sicheng Mo
Yin Li
404
34
0
02 Apr 2024
SpikeMba: Multi-Modal Spiking Saliency Mamba for Temporal Video
  Grounding
SpikeMba: Multi-Modal Spiking Saliency Mamba for Temporal Video Grounding
Wenrui Li
Xiaopeng Hong
Ruiqin Xiong
Xiaopeng Fan
Mamba
399
29
0
01 Apr 2024
VURF: A General-purpose Reasoning and Self-refinement Framework for Video Understanding
VURF: A General-purpose Reasoning and Self-refinement Framework for Video Understanding
Ahmad A Mahmood
Ashmal Vayani
Muzammal Naseer
Salman Khan
Fahad Shahbaz Khan
LRM
546
13
0
21 Mar 2024
Unified Static and Dynamic Network: Efficient Temporal Filtering for Video Grounding
Unified Static and Dynamic Network: Efficient Temporal Filtering for Video Grounding
Jingjing Hu
Dan Guo
Kun Li
Zhan Si
Xun Yang
Xiaojun Chang
Meng Wang
376
23
0
21 Mar 2024
Siamese Learning with Joint Alignment and Regression for
  Weakly-Supervised Video Paragraph Grounding
Siamese Learning with Joint Alignment and Regression for Weakly-Supervised Video Paragraph GroundingComputer Vision and Pattern Recognition (CVPR), 2024
Chaolei Tan
Jian-Huang Lai
Wei-Shi Zheng
Jianfang Hu
AI4TS
418
10
0
18 Mar 2024
Uncertainty-Calibrated Test-Time Model Adaptation without Forgetting
Uncertainty-Calibrated Test-Time Model Adaptation without Forgetting
Zhuliang Yu
Guohao Chen
Jiaxiang Wu
Yifan Zhang
Yaofo Chen
Peilin Zhao
Shuaicheng Niu
TTAOOD
391
13
0
18 Mar 2024
Improving Video Corpus Moment Retrieval with Partial Relevance
  Enhancement
Improving Video Corpus Moment Retrieval with Partial Relevance Enhancement
Danyang Hou
Liang Pang
Huawei Shen
Xueqi Cheng
377
9
0
21 Feb 2024
Bias-Conflict Sample Synthesis and Adversarial Removal Debias Strategy
  for Temporal Sentence Grounding in Video
Bias-Conflict Sample Synthesis and Adversarial Removal Debias Strategy for Temporal Sentence Grounding in VideoAAAI Conference on Artificial Intelligence (AAAI), 2024
Zhaobo Qi
Yibo Yuan
Xiaowen Ruan
Shuhui Wang
Weigang Zhang
Qingming Huang
351
16
0
15 Jan 2024
Commonsense for Zero-Shot Natural Language Video Localization
Commonsense for Zero-Shot Natural Language Video LocalizationAAAI Conference on Artificial Intelligence (AAAI), 2023
Meghana Holla
Ismini Lourentzou
388
5
0
29 Dec 2023
Grounding-Prompter: Prompting LLM with Multimodal Information for
  Temporal Sentence Grounding in Long Videos
Grounding-Prompter: Prompting LLM with Multimodal Information for Temporal Sentence Grounding in Long Videos
Houlun Chen
Xin Wang
Hong Chen
Zihan Song
Jia Jia
Wenwu Zhu
LRM
271
19
0
28 Dec 2023
LLM4VG: Large Language Models Evaluation for Video Grounding
LLM4VG: Large Language Models Evaluation for Video Grounding
Wei Feng
Xin Wang
Hong Chen
Zeyang Zhang
Zihan Song
Yuwei Zhou
Wenwu Zhu
437
11
0
21 Dec 2023
Multi-Modal Domain Adaptation Across Video Scenes for Temporal Video
  Grounding
Multi-Modal Domain Adaptation Across Video Scenes for Temporal Video Grounding
Haifeng Huang
Yang Zhao
Zehan Wang
Yan Xia
Zhou Zhao
302
1
0
21 Dec 2023
DemaFormer: Damped Exponential Moving Average Transformer with
  Energy-Based Modeling for Temporal Language Grounding
DemaFormer: Damped Exponential Moving Average Transformer with Energy-Based Modeling for Temporal Language GroundingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Thong Nguyen
Xiaobao Wu
Xinshuai Dong
Cong-Duy Nguyen
See-Kiong Ng
Anh Tuan Luu
316
11
0
05 Dec 2023
1234
Next
Page 1 of 4