Dense Regression Network for Video Grounding

Computer Vision and Pattern Recognition (CVPR), 2020

7 April 2020

Chuang Gan

Papers citing "Dense Regression Network for Video Grounding"

50 / 170 papers shown

Who Can We Trust? Scope-Aware Video Moment Retrieval with Multi-Agent Conflict

177

01 Nov 2025

HieraMamba: Video Temporal Grounding via Hierarchical Anchor-Mamba Pooling

Joungbin An

Kristen Grauman

Mamba

297

27 Oct 2025

Augmenting Moment Retrieval: Zero-Dependency Two-Stage Learning

200

22 Oct 2025

When One Moment Isn't Enough: Multi-Moment Retrieval with Cross-Moment Interactions

166

20 Oct 2025

Enrich and Detect: Video Temporal Grounding with Multimodal LLMs

Triantafyllos Afouras

277

19 Oct 2025

From Learning to Mastery: Achieving Safe and Efficient Real-World Autonomous Driving with Human-In-The-Loop Reinforcement Learning

265

07 Oct 2025

Sim-DETR: Unlock DETR for Temporal Sentence Grounding

366

28 Sep 2025

TempSamp-R1: Effective Temporal Sampling with Reinforcement Fine-Tuning for Video LLMs

320

22 Sep 2025

ResidualViT for Efficient Temporally Dense Video Encoding

224

16 Sep 2025

OVG-HQ: Online Video Grounding with Hybrid-modal Queries

190

16 Aug 2025

TAR-TVG: Enhancing VLMs with Timestamp Anchor-Constrained Reasoning for Temporal Video Grounding

251

11 Aug 2025

DeCafNet: Delegate and Conquer for Efficient Temporal Grounding in Long VideosComputer Vision and Pattern Recognition (CVPR), 2025

318

22 May 2025

Grounding-MD: Grounded Video-language Pre-training for Open-World Moment Detection

318

20 Apr 2025

Prototypes are Balanced Units for Efficient and Effective Partially Relevant Video Retrieval

381

17 Apr 2025

Towards Efficient and Robust Moment Retrieval System: A Unified Framework for Multi-Granularity Models and Temporal Reranking

H. Tran

Tinh-Anh Nguyen-Nhu

Huu-Phong Phan-Nguyen

T. Nguyen

Nhat-Minh Nguyen-Dich

279

11 Apr 2025

SVLTA: Benchmarking Vision-Language Temporal Alignment via Synthetic Video SituationComputer Vision and Pattern Recognition (CVPR), 2025

271

08 Apr 2025

MCAT: Visual Query-Based Localization of Standard Anatomical Clips in Fetal Ultrasound Videos Using Multi-Tier Class-Aware Token TransformerAAAI Conference on Artificial Intelligence (AAAI), 2025

Divyanshu Mishra

Pramit Saha

He Zhao

Netzahualcoyotl Hernandez-Cruz

Olga Patey

A. Papageorghiou

J. A. Noble

214

08 Apr 2025

Learning Activity View-invariance Under Extreme Viewpoint Changes via Curriculum Knowledge Distillation

228

07 Apr 2025

OmniSTVG: Toward Spatio-Temporal Omni-Object Video Grounding

448

13 Mar 2025

TimeLoc: A Unified End-to-End Framework for Precise Timestamp Localization in Long Videos

401

09 Mar 2025

LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection

413

18 Jan 2025

FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal GroundingIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

397

18 Dec 2024

VideoLights: Feature Refinement and Cross-Task Alignment Transformer for Joint Video Highlight Detection and Moment Retrieval

405

02 Dec 2024

Vid-Morp: Video Moment Retrieval Pretraining from Unlabeled Videos in the Wild

362

01 Dec 2024

Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel LevelComputer Vision and Pattern Recognition (CVPR), 2024

473

15 Nov 2024

ActPrompt: In-Domain Feature Adaptation via Action Cues for Video Temporal Grounding

Yubin Wang

277

13 Aug 2024

SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and SynopsesACM Multimedia (MM), 2024

478

03 Aug 2024

Temporally Grounding Instructional Diagrams in Unconstrained Videos

Yizhak Ben-Shabat

408

16 Jul 2024

Enhancing Video-Language Representations with Structural Spatio-Temporal Alignment

Hao Fei

Meishan Zhang

311

27 Jun 2024

Chrono: A Simple Blueprint for Representing Time in MLLMs

681

26 Jun 2024

MLLM as Video Narrator: Mitigating Modality Imbalance in Video Moment Retrieval

Shaogang Gong

265

25 Jun 2024

Localizing Events in Videos with Multimodal QueriesComputer Vision and Pattern Recognition (CVPR), 2024

Yan Xia

Volker Tresp

409

14 Jun 2024

A Survey on Text-guided 3D Visual Grounding: Elements, Recent Advances, and Future DirectionsIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024

Wei Hu

425

09 Jun 2024

Simplify Implant Depth Prediction as Video Grounding: A Texture Perceive Implant Depth Prediction NetworkInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2024

Linlin Shen

216

07 Jun 2024

Video Anomaly Detection in 10 Years: A Survey and Outlook

373

29 May 2024

Task-Driven Exploration: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection

294

14 Apr 2024

UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection

Yingsen Zeng

Yujie Zhong

Chengjian Feng

Lin Ma

575

07 Apr 2024

SnAG: Scalable and Accurate Video GroundingComputer Vision and Pattern Recognition (CVPR), 2024

Fangzhou Mu

Sicheng Mo

Yin Li

425

02 Apr 2024

SpikeMba: Multi-Modal Spiking Saliency Mamba for Temporal Video Grounding

401

01 Apr 2024

VURF: A General-purpose Reasoning and Self-refinement Framework for Video Understanding

Ahmad A Mahmood

Ashmal Vayani

Muzammal Naseer

Salman Khan

Fahad Shahbaz Khan

LRM

558

21 Mar 2024

Unified Static and Dynamic Network: Efficient Temporal Filtering for Video Grounding

Xiaojun Chang

Meng Wang

380

21 Mar 2024

Siamese Learning with Joint Alignment and Regression for Weakly-Supervised Video Paragraph GroundingComputer Vision and Pattern Recognition (CVPR), 2024

422

18 Mar 2024

Uncertainty-Calibrated Test-Time Model Adaptation without Forgetting

392

18 Mar 2024

Improving Video Corpus Moment Retrieval with Partial Relevance Enhancement

377

21 Feb 2024

Bias-Conflict Sample Synthesis and Adversarial Removal Debias Strategy for Temporal Sentence Grounding in VideoAAAI Conference on Artificial Intelligence (AAAI), 2024

363

15 Jan 2024

Commonsense for Zero-Shot Natural Language Video LocalizationAAAI Conference on Artificial Intelligence (AAAI), 2023

Meghana Holla

Ismini Lourentzou

398

29 Dec 2023

Grounding-Prompter: Prompting LLM with Multimodal Information for Temporal Sentence Grounding in Long Videos

277

28 Dec 2023

LLM4VG: Large Language Models Evaluation for Video Grounding

438

21 Dec 2023

Multi-Modal Domain Adaptation Across Video Scenes for Temporal Video Grounding

Zhou Zhao

303

21 Dec 2023

DemaFormer: Damped Exponential Moving Average Transformer with Energy-Based Modeling for Temporal Language GroundingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

See-Kiong Ng

319

05 Dec 2023