Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2005.03572
Cited By
v1
v2
v3
v4 (latest)
Enhancing Geometric Factors in Model Learning and Inference for Object Detection and Instance Segmentation
7 May 2020
Zhaohui Zheng
Ping Wang
Dongwei Ren
Wei Liu
Rongguang Ye
Q. Hu
W. Zuo
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (334★)
Papers citing
"Enhancing Geometric Factors in Model Learning and Inference for Object Detection and Instance Segmentation"
50 / 83 papers shown
SpatialThinker: Reinforcing 3D Reasoning in Multimodal LLMs via Spatial Rewards
Hunar Batra
Haoqin Tu
Hardy Chen
Yuanze Lin
Cihang Xie
Ronald Clark
OffRL
ReLM
LRM
423
7
0
10 Nov 2025
OOS-DSD: Improving Out-of-stock Detection in Retail Images using Auxiliary Tasks
Franko Šikić
Sven Lončarić
169
0
0
18 Oct 2025
TY-RIST: Tactical YOLO Tricks for Real-time Infrared Small Target Detection
Abdulkarim Atrash
Omar Moured
Yufan Chen
Jiaming Zhang
Seyda Ertekin
Omur Ugur
138
2
0
26 Sep 2025
TinyDef-DETR: A DETR-based Framework for Defect Detection in Transmission Lines from UAV Imagery
Feng Shen
Jiaming Cui
Shuai Zhou
Wenqiang Li
Ruifeng Qin
305
0
0
07 Sep 2025
Two-Stage Framework for Efficient UAV-Based Wildfire Video Analysis with Adaptive Compression and Fire Source Detection
Yanbing Bai
Rui-Yang Ju
Lemeng Zhao
Junjie Hu
Jianchao Bi
Erick Mas
Shunichi Koshimura
189
1
0
22 Aug 2025
Inter-Class Relational Loss for Small Object Detection: A Case Study on License Plates
Dian Ning
Dong Seog Han
144
0
0
20 Aug 2025
LayoutRectifier: An Optimization-based Post-processing for Graphic Design Layout Generation
I-Chao Shen
Ariel Shamir
Takeo Igarashi
282
4
0
15 Aug 2025
Head Anchor Enhanced Detection and Association for Crowded Pedestrian Tracking
Zewei Wu
César Teixeira
Wei Ke
Zhang Xiong
223
1
0
07 Aug 2025
Stable at Any Speed: Speed-Driven Multi-Object Tracking with Learnable Kalman Filtering
Yan Gong
M. Ben-Chen
Hao Liu
Gao Yongsheng
Lei Yang
N. Wang
Ziying Song
Haoqun Ma
254
3
0
01 Aug 2025
A Survey on Deep Multi-Task Learning in Connected Autonomous Vehicles
Jiayuan Wang
Farhad Pourpanah
Q. M. Jonathan Wu
Ning Zhang
179
1
0
29 Jul 2025
Event-based Graph Representation with Spatial and Motion Vectors for Asynchronous Object Detection
Aayush Atul Verma
Arpitsinh Vaghela
Bharatesh Chakravarthi
Kaustav Chanda
Yezhou Yang
3DPC
164
0
0
20 Jul 2025
InterpIoU: Rethinking Bounding Box Regression with Interpolation-Based IoU Optimization
Haoyuan Liu
Hiroshi Watanabe
253
3
0
16 Jul 2025
When Trackers Date Fish: A Benchmark and Framework for Underwater Multiple Fish Tracking
Weiran Li
Yeqiang Liu
Qiannan Guo
Yijie Wei
Hwa Liang Leo
Zhenbo Li
247
1
0
08 Jul 2025
AV-Reasoner: Improving and Benchmarking Clue-Grounded Audio-Visual Counting for MLLMs
Lidong Lu
Guo Chen
Ruoyao Xiao
Yicheng Liu
Tong Lu
VLM
LRM
477
10
0
05 Jun 2025
Thermal Detection of People with Mobility Restrictions for Barrier Reduction at Traffic Lights Controlled Intersections
Xiao Ni
Carsten Kuehnel
Xiaoyi Jiang
484
1
0
13 May 2025
Document Image Rectification Bases on Self-Adaptive Multitask Fusion
Heng Li
Xiangping Wu
Qingcai Chen
409
0
0
09 May 2025
Back to Fundamentals: Low-Level Visual Features Guided Progressive Token Pruning
Journal of systems architecture (JSA), 2025
Yuanbing Ouyang
Yizhuo Liang
Qingpeng Li
Xinfei Guo
Yiming Luo
Di Wu
Hao Wang
Yushan Pan
ViT
VLM
347
0
0
25 Apr 2025
Marginalized Generalized IoU (MGIoU): A Unified Objective Function for Optimizing Any Convex Parametric Shapes
Duy-Tho Le
Trung Pham
Jianfei Cai
H. Rezatofighi
443
2
0
23 Apr 2025
EIoU-EMC: A Novel Loss for Domain-specific Nested Entity Recognition
Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Jian Zhang
Tianqing Zhang
Qi Li
Hongwei Wang
258
0
0
19 Apr 2025
Light-YOLOv8-Flame: A Lightweight High-Performance Flame Detection Algorithm
Jiawei Lan
Zhibiao Wang
Haoyang Yu
Ye Tao
Wenhua Cui
490
3
0
11 Apr 2025
DuckSegmentation: A segmentation model based on the AnYue Hemp Duck Dataset
Ling Feng
Tianyu Xie
Wei Ma
Ruijie Fu
Yujiao Shi
Jun Li
Bei Zhou
204
0
0
27 Mar 2025
Shaken, Not Stirred: A Novel Dataset for Visual Understanding of Glasses in Human-Robot Bartending Tasks
Lukás Gajdosech
Hassan Ali
Jan-Gerrit Habekost
Martin Madaras
Matthias Kerzel
Stefan Wermter
411
0
0
06 Mar 2025
Teach YOLO to Remember: A Self-Distillation Approach for Continual Object Detection
Riccardo De Monte
Davide Dalle Pezze
Gian Antonio Susto
CLL
394
2
0
06 Mar 2025
DashCop: Automated E-ticket Generation for Two-Wheeler Traffic Violations Using Dashcam Videos
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025
Deepti Rawat
Keshav Gupta
Aryamaan Basu Roy
Ravi Kiran Sarvadevabhatla
382
1
0
01 Mar 2025
Self-Paced Learning Strategy with Easy Sample Prior Based on Confidence for the Flying Bird Object Detection Model Training
Signal, Image and Video Processing (SIVP), 2024
Zi-Wei Sun
Ze-Xi Hua
Heng-Chao Li
Yan Li
313
1
0
09 Dec 2024
HyperDefect-YOLO: Enhance YOLO with HyperGraph Computation for Industrial Defect Detection
Zuo Zuo
Jiahao Dong
Yue Gao
Zongze Wu
282
2
0
05 Dec 2024
WoodYOLO: A Novel Object Detector for Wood Species Detection in Microscopic Images
Lars Nieradzik
Henrike Stephani
Jördis Sieburg-Rockel
Stephanie Helmling
Andrea Olbrich
Stephanie Wrage
J. Keuper
290
3
0
18 Nov 2024
TextMaster: A Unified Framework for Realistic Text Editing via Glyph-Style Dual-Control
Zhenyu Yan
Jiangming Wang
Aoqiang Wang
Wenxiang Shang
Ran Lin
Zhao Zhang
DiffM
410
2
0
13 Oct 2024
Accelerating Non-Maximum Suppression: A Graph Theory Perspective
Neural Information Processing Systems (NeurIPS), 2024
King-Siong Si
Lu Sun
Weizhan Zhang
Tieliang Gong
Jiahao Wang
Jiang Liu
Hao Sun
181
7
0
30 Sep 2024
TSdetector: Temporal-Spatial Self-correction Collaborative Learning for Colonoscopy Video Detection
Kaini Wang
Haolin Wang
Guang-Quan Zhou
Yangang Wang
Ling Yang
Yang Chen
Shuo Li
250
0
0
30 Sep 2024
YOLOv8-ResCBAM: YOLOv8 Based on An Effective Attention Module for Pediatric Wrist Fracture Detection
Rui-Yang Ju
Chun-Tse Chien
Jen-Shiun Chiang
233
12
0
27 Sep 2024
AR Overlay: Training Image Pose Estimation on Curved Surface in a Synthetic Way
Sining Huang
Yukun Song
Yixiao Kang
Chang Yu
3DH
326
24
0
22 Sep 2024
WiLoR: End-to-end 3D Hand Localization and Reconstruction in-the-wild
Computer Vision and Pattern Recognition (CVPR), 2024
Rolandos Alexandros Potamias
Jinglei Zhang
Jiankang Deng
Stefanos Zafeiriou
3DH
493
89
0
18 Sep 2024
When to Extract ReID Features: A Selective Approach for Improved Multiple Object Tracking
Emirhan Bayar
Cemal Aker
VOT
344
2
0
10 Sep 2024
NanoMVG: USV-Centric Low-Power Multi-Task Visual Grounding based on Prompt-Guided Camera and 4D mmWave Radar
Runwei Guan
Tao Huang
Liye Jia
Haocheng Zhao
Shanliang Yao
Xiaohui Zhu
Ka Lok Man
Eng Gee Lim
Jeremy S. Smith
Yutao Yue
459
9
0
30 Aug 2024
CORT: Class-Oriented Real-time Tracking for Embedded Systems
Edoardo Cittadini
Alessandro De Siena
Giorgio Buttazzo
VOT
367
0
0
20 Jul 2024
Towards Reflected Object Detection: A Benchmark
Yiquan Wu
Zhongtian Wang
You Wu
Ling Huang
Hui Zhou
Shuiwang Li
ObjD
277
3
0
08 Jul 2024
Triple-domain Feature Learning with Frequency-aware Memory Enhancement for Moving Infrared Small Target Detection
Weiwei Duan
Luping Ji
Shengjia Chen
Sicheng Zhu
Mao Ye
423
59
0
11 Jun 2024
Cycle-YOLO: A Efficient and Robust Framework for Pavement Damage Detection
Zhengji Li
Xi Xiao
Jiacheng Xie
Yuxiao Fan
Wentao Wang
Gang Chen
Liqiang Zhang
Tianyang Wang
198
6
0
28 May 2024
A Multimodal Learning-based Approach for Autonomous Landing of UAV
Francisco Neves
Luís Branco
Maria Pereira
R. Claro
Andry Pinto
120
4
0
21 May 2024
FPDIoU Loss: A Loss Function for Efficient Bounding Box Regression of Rotated Object Detection
Image and Vision Computing (IVC), 2024
Siliang Ma
Yong Xu
352
8
0
16 May 2024
Real-time Lane-wise Traffic Monitoring in Optimal ROIs
Mei Qiu
Wei Lin
Lauren Christopher
S. Chien
Yaobin Chen
Shu Hu
231
5
0
29 Mar 2024
Cannabis Seed Variant Detection using Faster R-CNN
Toqi Tahamid Sarker
Taminul Islam
Khaled R Ahmed
175
4
0
15 Mar 2024
SHAN: Object-Level Privacy Detection via Inference on Scene Heterogeneous Graph
Zhuohang Jiang
Bingkui Tong
Xia Du
Ahmed Alhammadi
Jizhe Zhou
400
0
0
14 Mar 2024
Surround-View Fisheye Optics in Computer Vision and Simulation: Survey and Challenges
Daniel Jakab
B. Deegan
Sushil Sharma
E. Grua
Jonathan Horgan
Enda Ward
Pepijn Van De Ven
Anthony G. Scanlan
Ciarán Eising
265
21
0
19 Feb 2024
PlantoGraphy: Incorporating Iterative Design Process into Generative Artificial Intelligence for Landscape Rendering
Rong Huang
Haichuan Lin
Chuanzhang Chen
Kang Zhang
Wei Zeng
203
47
0
30 Jan 2024
The Method of Detecting Flying Birds in Surveillance Video Based on Their Characteristics
IEEE Transactions on Instrumentation and Measurement (IEEE Trans. Instrum. Meas.), 2024
Ziwei Sun
Zexi Hua
Hengchao Li
Yan Li
234
0
0
08 Jan 2024
Generalized Mask-aware IoU for Anchor Assignment for Real-time Instance Segmentation
Baris Can Cam
Kemal Oksuz
Fehmi Kahraman
Z. S. Baltaci
Sinan Kalkan
Emre Akbas
191
0
0
28 Dec 2023
Achelous++: Power-Oriented Water-Surface Panoptic Perception Framework on Edge Devices based on Vision-Radar Fusion and Pruning of Heterogeneous Modalities
Runwei Guan
Haocheng Zhao
Shanliang Yao
Ka Lok Man
Xiaohui Zhu
...
Yong Yue
Jeremy S. Smith
Eng Gee Lim
Weiping Ding
Yutao Yue
280
8
0
14 Dec 2023
Lightweight Full-Convolutional Siamese Tracker
Knowledge-Based Systems (KBS), 2023
Yunfeng Li
Bo Wang
Xueyi Wu
Zhuoyan Liu
Ye Li
360
26
0
09 Oct 2023
1
2
Next
Page 1 of 2