Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2107.08430
Cited By

YOLOX: Exceeding YOLO Series in 2021

v1v2 (latest)

YOLOX: Exceeding YOLO Series in 2021

18 July 2021

ArXiv (abs)PDF HTML Github (9857★)

Papers citing "YOLOX: Exceeding YOLO Series in 2021"

50 / 869 papers shown

Concept-based Explainable Data Mining with VLM for 3D Detection

Concept-based Explainable Data Mining with VLM for 3D Detection

222

0

0

05 Dec 2025

From Detection to Association: Learning Discriminative Object Embeddings for Multi-Object Tracking

From Detection to Association: Learning Discriminative Object Embeddings for Multi-Object Tracking

360

0

0

02 Dec 2025

SelfMOTR: Revisiting MOTR with Self-Generating Detection Priors

SelfMOTR: Revisiting MOTR with Self-Generating Detection Priors

Johannes Stegmaier

447

0

0

25 Nov 2025

StableTrack: Stabilizing Multi-Object Tracking on Low-Frequency Detections

StableTrack: Stabilizing Multi-Object Tracking on Low-Frequency Detections

Matvei Shelukhan

Karina Kvanchiani

462

0

0

25 Nov 2025

A Tri-Modal Dataset and a Baseline System for Tracking Unmanned Aerial Vehicles

A Tri-Modal Dataset and a Baseline System for Tracking Unmanned Aerial Vehicles

136

1

0

23 Nov 2025

OmniPT: Unleashing the Potential of Large Vision Language Models for Pedestrian Tracking and Understanding

OmniPT: Unleashing the Potential of Large Vision Language Models for Pedestrian Tracking and Understanding

110

0

0

21 Nov 2025

MobileOcc: A Human-Aware Semantic Occupancy Dataset for Mobile Robots

MobileOcc: A Human-Aware Semantic Occupancy Dataset for Mobile Robots

Javier Alonso-Mora

170

0

0

21 Nov 2025

Real-Time 3D Object Detection with Inference-Aligned Learning

Real-Time 3D Object Detection with Inference-Aligned Learning

266

0

0

20 Nov 2025

PairHuman: A High-Fidelity Photographic Dataset for Customized Dual-Person Generation

PairHuman: A High-Fidelity Photographic Dataset for Customized Dual-Person GenerationInformation Fusion (Inf. Fusion), 2025

321

0

0

20 Nov 2025

Enhancing Multi-Camera Gymnast Tracking Through Domain Knowledge Integration

Enhancing Multi-Camera Gymnast Tracking Through Domain Knowledge Integration

137

7

0

20 Nov 2025

Fast Post-Hoc Confidence Fusion for 3-Class Open-Set Aerial Object Detection

Fast Post-Hoc Confidence Fusion for 3-Class Open-Set Aerial Object Detection

Spyridon Loukovitis

Vasileios Karampinis

Athanasios Voulodimos

115

0

0

19 Nov 2025

PlugTrack: Multi-Perceptive Motion Analysis for Adaptive Fusion in Multi-Object Tracking

PlugTrack: Multi-Perceptive Motion Analysis for Adaptive Fusion in Multi-Object Tracking

149

0

0

17 Nov 2025

Scale-Aware Relay and Scale-Adaptive Loss for Tiny Object Detection in Aerial Images

Scale-Aware Relay and Scale-Adaptive Loss for Tiny Object Detection in Aerial Images

264

0

0

13 Nov 2025

On the Interplay between Positional Encodings, Morphological Complexity, and Word Order Flexibility

On the Interplay between Positional Encodings, Morphological Complexity, and Word Order Flexibility

Kushal Tatariya

Miryam de Lhoneux

137

0

0

11 Nov 2025

Zero-Shot Multi-Animal Tracking in the Wild

Zero-Shot Multi-Animal Tracking in the Wild

Jan Frederik Meier

158

0

0

04 Nov 2025

Contrast-Guided Cross-Modal Distillation for Thermal Object Detection

Contrast-Guided Cross-Modal Distillation for Thermal Object Detection

206

0

0

03 Nov 2025

World Simulation with Video Foundation Models for Physical AI

World Simulation with Video Foundation Models for Physical AI

...

633

53

0

28 Oct 2025

DQ3D: Depth-guided Query for Transformer-Based 3D Object Detection in Traffic Scenarios

DQ3D: Depth-guided Query for Transformer-Based 3D Object Detection in Traffic Scenarios

147

0

0

27 Oct 2025

Monocular Visual 8D Pose Estimation for Articulated Bicycles and Cyclists

Monocular Visual 8D Pose Estimation for Articulated Bicycles and Cyclists

Eduardo R. Corral-Soto

158

1

0

23 Oct 2025

Integrating Machine Learning into Belief-Desire-Intention Agents: Current Advances and Open Challenges

Integrating Machine Learning into Belief-Desire-Intention Agents: Current Advances and Open Challenges

204

0

0

23 Oct 2025

ReCon: Region-Controllable Data Augmentation with Rectification and Alignment for Object Detection

ReCon: Region-Controllable Data Augmentation with Rectification and Alignment for Object Detection

233

1

0

17 Oct 2025

Valeo Near-Field: a novel dataset for pedestrian intent detection

Valeo Near-Field: a novel dataset for pedestrian intent detection

Antonyo Musabini

Rachid Benmokhtar

Jagdish Bhanushali

Bertrand Luvison

Xavier Perrotton

150

0

0

17 Oct 2025

DMTrack: Deformable State-Space Modeling for UAV Multi-Object Tracking with Kalman Fusion and Uncertainty-Aware Association

DMTrack: Deformable State-Space Modeling for UAV Multi-Object Tracking with Kalman Fusion and Uncertainty-Aware Association

152

0

0

15 Oct 2025

An Analytical Framework to Enhance Autonomous Vehicle Perception for Smart Cities

An Analytical Framework to Enhance Autonomous Vehicle Perception for Smart Cities

Hesham El-Sayed

116

1

0

15 Oct 2025

DEF-YOLO: Leveraging YOLO for Concealed Weapon Detection in Thermal Imagin

DEF-YOLO: Leveraging YOLO for Concealed Weapon Detection in Thermal Imagin

Arnav Ramamoorthy

158

1

0

15 Oct 2025

SpikePool: Event-driven Spiking Transformer with Pooling Attention

SpikePool: Event-driven Spiking Transformer with Pooling Attention

Priyadarshini Panda

126

0

0

14 Oct 2025

MultiFoodhat: A potential new paradigm for intelligent food quality inspection

MultiFoodhat: A potential new paradigm for intelligent food quality inspection

170

0

0

14 Oct 2025

Adap-RPF: Adaptive Trajectory Sampling for Robot Person Following in Dynamic Crowded Environments

Adap-RPF: Adaptive Trajectory Sampling for Robot Person Following in Dynamic Crowded Environments

142

0

0

13 Oct 2025

Fast Self-Supervised depth and mask aware Association for Multi-Object Tracking

Fast Self-Supervised depth and mask aware Association for Multi-Object Tracking

Charalambos Poullis

294

0

0

10 Oct 2025

PRNet: Original Information Is All You Have

PRNet: Original Information Is All You Have

110

1

0

10 Oct 2025

SPICE: Simple and Practical Image Clarification and Enhancement

SPICE: Simple and Practical Image Clarification and Enhancement

Alexander Belyaev

Pierre-Alain Fayolle

137

0

0

09 Oct 2025

Explaining raw data complexity to improve satellite onboard processing

Explaining raw data complexity to improve satellite onboard processing

Marjorie Bellizzi

Benjamin Francesconi

173

0

0

08 Oct 2025

StereoSync: Spatially-Aware Stereo Audio Generation from Video

StereoSync: Spatially-Aware Stereo Audio Generation from Video

Christian Marinoni

R. F. Gramaccioni

Takashi Shibuya

Danilo Comminiello

135

2

0

07 Oct 2025

Forestpest-YOLO: A High-Performance Detection Framework for Small Forestry Pests

Forestpest-YOLO: A High-Performance Detection Framework for Small Forestry Pests

186

0

0

01 Oct 2025

FracDetNet: Advanced Fracture Detection via Dual-Focus Attention and Multi-scale Calibration in Medical X-ray Imaging

FracDetNet: Advanced Fracture Detection via Dual-Focus Attention and Multi-scale Calibration in Medical X-ray Imaging

112

0

0

27 Sep 2025

Real-Time Object Detection Meets DINOv3

Real-Time Object Detection Meets DINOv3

ObjD 3DH PINN VLM

542

17

0

25 Sep 2025

CompressAI-Vision: Open-source software to evaluate compression methods for computer vision tasks

CompressAI-Vision: Open-source software to evaluate compression methods for computer vision tasks

Chris Rosewarne

305

2

0

25 Sep 2025

X-Streamer: Unified Human World Modeling with Audiovisual Interaction

X-Streamer: Unified Human World Modeling with Audiovisual Interaction

318

8

0

25 Sep 2025

Punching Above Precision: Small Quantized Model Distillation with Learnable Regularizer

Punching Above Precision: Small Quantized Model Distillation with Learnable Regularizer

Md Abdur Rahaman

M. J. Aashik Rasool

165

2

0

25 Sep 2025

Visual Detector Compression via Location-Aware Discriminant Analysis

Visual Detector Compression via Location-Aware Discriminant Analysis

138

2

0

22 Sep 2025

SFN-YOLO: Towards Free-Range Poultry Detection via Scale-aware Fusion Networks

SFN-YOLO: Towards Free-Range Poultry Detection via Scale-aware Fusion Networks

81

0

0

21 Sep 2025

Task-Aware Image Signal Processor for Advanced Visual Perception

Task-Aware Image Signal Processor for Advanced Visual Perception

168

0

0

17 Sep 2025

VSE-MOT: Multi-Object Tracking in Low-Quality Video Scenes Guided by Visual Semantic Enhancement

VSE-MOT: Multi-Object Tracking in Low-Quality Video Scenes Guided by Visual Semantic Enhancement

196

0

0

17 Sep 2025

Multi-animal tracking in Transition: Comparative Insights into Established and Emerging Methods

Multi-animal tracking in Transition: Comparative Insights into Established and Emerging MethodsSmart Agricultural Technology (SAT), 2025

Anne Marthe Sophie Ngo Bibinbe

Jamie Ahloy-Dallaire

262

0

0

15 Sep 2025

Motion Estimation for Multi-Object Tracking using KalmanNet with Semantic-Independent Encoding

Motion Estimation for Multi-Object Tracking using KalmanNet with Semantic-Independent Encoding

227

0

0

14 Sep 2025

An HMM-based framework for identity-aware long-term multi-object tracking from sparse and uncertain identification: use case on long-term tracking in livestock

An HMM-based framework for identity-aware long-term multi-object tracking from sparse and uncertain identification: use case on long-term tracking in livestock

Anne Marthe Sophie Ngo Bibinbe

Jamie Ahloy-Dallaire

269

0

0

12 Sep 2025

Objectness Similarity: Capturing Object-Level Fidelity in 3D Scene Evaluation

Objectness Similarity: Capturing Object-Level Fidelity in 3D Scene Evaluation

253

0

0

11 Sep 2025

Model-Agnostic Open-Set Air-to-Air Visual Object Detection for Reliable UAV Perception

Model-Agnostic Open-Set Air-to-Air Visual Object Detection for Reliable UAV Perception

Spyridon Loukovitis

Anastasios Arsenos

Vasileios Karampinis

Athanasios Voulodimos

132

2

0

11 Sep 2025

GeneVA: A Dataset of Human Annotations for Generative Text to Video Artifacts

GeneVA: A Dataset of Human Annotations for Generative Text to Video Artifacts

Patsorn Sangkloy

195

1

0

10 Sep 2025

TinyDef-DETR: A DETR-based Framework for Defect Detection in Transmission Lines from UAV Imagery

TinyDef-DETR: A DETR-based Framework for Defect Detection in Transmission Lines from UAV Imagery

295

0

0

07 Sep 2025

1 2 3 4...16 17 18

Page 1 of 18

Pageof 18