Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1405.3531
Cited By
Return of the Devil in the Details: Delving Deep into Convolutional Nets
14 May 2014
Ken Chatfield
Karen Simonyan
Andrea Vedaldi
Andrew Zisserman
FAtt
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Return of the Devil in the Details: Delving Deep into Convolutional Nets"
50 / 547 papers shown
Title
Improving trajectory continuity in drone-based crowd monitoring using a set of minimal-cost techniques and deep discriminative correlation filters
Bartosz Ptak
Marek Kraft
24
0
0
28 Apr 2025
Acute Lymphoblastic Leukemia Diagnosis Employing YOLOv11, YOLOv8, ResNet50, and Inception-ResNet-v2 Deep Learning Models
Alaa Awad
Salah A. Aly
59
0
0
13 Feb 2025
Learning Gaussian Data Augmentation in Feature Space for One-shot Object Detection in Manga
Takara Taniguchi
Ryosuke Furuta
31
1
0
08 Oct 2024
CLOSER: Towards Better Representation Learning for Few-Shot Class-Incremental Learning
Junghun Oh
Sungyong Baik
Kyoung Mu Lee
CLL
42
3
0
08 Oct 2024
Universal Pooling Method of Multi-layer Features from Pretrained Models for Speaker Verification
Jin Sob Kim
Hyun Joon Park
Wooseok Shin
Sung Won Han
SLR
50
0
0
12 Sep 2024
From Radiologist Report to Image Label: Assessing Latent Dirichlet Allocation in Training Neural Networks for Orthopedic Radiograph Classification
Jakub Olczak
Max Gordon
20
0
0
22 Aug 2024
Loki: A System for Serving ML Inference Pipelines with Hardware and Accuracy Scaling
Sohaib Ahmad
Hui Guan
Ramesh K. Sitaraman
42
4
0
04 Jul 2024
Probing the 3D Awareness of Visual Foundation Models
Mohamed El Banani
Amit Raj
Kevis-Kokitsi Maninis
Abhishek Kar
Yuanzhen Li
Michael Rubinstein
Deqing Sun
Leonidas J. Guibas
Justin Johnson
Varun Jampani
40
79
0
12 Apr 2024
Is Synthetic Image Useful for Transfer Learning? An Investigation into Data Generation, Volume, and Utilization
Yuhang Li
Xin Dong
Chen Chen
Jingtao Li
Yuxin Wen
Michael Spranger
Lingjuan Lyu
DiffM
32
4
0
28 Mar 2024
Tiny Machine Learning: Progress and Futures
Ji Lin
Ligeng Zhu
Wei-Ming Chen
Wei-Chen Wang
Song Han
52
51
0
28 Mar 2024
Block Selective Reprogramming for On-device Training of Vision Transformers
Sreetama Sarkar
Souvik Kundu
Kai Zheng
P. Beerel
37
2
0
25 Mar 2024
Federated Learning Method for Preserving Privacy in Face Recognition System
Enoch Solomon
Abraham Woubie
FedML
41
3
0
08 Mar 2024
An Empirical Study of the Generalization Ability of Lidar 3D Object Detectors to Unseen Domains
George Eskandar
Chongzhe Zhang
Abhishek Kaushik
Karim Guirguis
Mohamed Sayed
Bin Yang
OOD
3DPC
51
9
0
27 Feb 2024
DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization
Jisu Nam
Heesu Kim
Dongjae Lee
Siyoon Jin
Seungryong Kim
Seunggyu Chang
DiffM
32
40
0
15 Feb 2024
BioDrone: A Bionic Drone-based Single Object Tracking Benchmark for Robust Vision
Xin Zhao
Shiyu Hu
Yipei Wang
Jing Zhang
Yimin Hu
...
Haibin Ling
Yin Li
Renshu Li
Kun Liu
Jiadong Li
AAML
43
12
0
07 Feb 2024
JMA: a General Algorithm to Craft Nearly Optimal Targeted Adversarial Example
B. Tondi
Wei Guo
Mauro Barni
AAML
17
0
0
02 Jan 2024
Cross-Modal Object Tracking via Modality-Aware Fusion Network and A Large-Scale Dataset
Lei Liu
Mengya Zhang
Chenglong Li
Chenglong Li
Jin Tang
33
4
0
22 Dec 2023
ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection
Kai Huang
Boyuan Yang
Wei Gao
32
18
0
21 Dec 2023
Simple Transferability Estimation for Regression Tasks
Cuong N. Nguyen
Phong Tran
L. Ho
Vu C. Dinh
Anh Tran
Tal Hassner
Cuong V Nguyen
17
2
0
01 Dec 2023
Just Add
π
π
π
! Pose Induced Video Transformers for Understanding Activities of Daily Living
Dominick Reilly
Srijan Das
ViT
38
17
0
30 Nov 2023
Attribute-Aware Deep Hashing with Self-Consistency for Large-Scale Fine-Grained Image Retrieval
Xiu-Shen Wei
Yang Shen
Xuhao Sun
Peng Wang
Yuxin Peng
22
10
0
21 Nov 2023
Predicting Spine Geometry and Scoliosis from DXA Scans
A. Jamaludin
T. Kadir
E. Clark
Andrew Zisserman
MedIm
14
3
0
15 Nov 2023
PockEngine: Sparse and Efficient Fine-tuning in a Pocket
Ligeng Zhu
Lanxiang Hu
Ji Lin
Wei-Chen Wang
Wei-Ming Chen
Chuang Gan
Song Han
30
19
0
26 Oct 2023
The Expressive Power of Low-Rank Adaptation
Yuchen Zeng
Kangwook Lee
41
51
0
26 Oct 2023
End-to-End Lip Reading in Romanian with Cross-Lingual Domain Adaptation and Lateral Inhibition
Emilian-Claudiu Muanescu
Ruazvan-Alexandru Smuadu
Andrei-Marius Avram
Dumitru-Clementin Cercel
Florin-Catalin Pop
38
0
0
07 Oct 2023
Transferability of Representations Learned using Supervised Contrastive Learning Trained on a Multi-Domain Dataset
Alvin De Jun Tan
Clement Tan
C. Yeo
40
0
0
27 Sep 2023
HDTR-Net: A Real-Time High-Definition Teeth Restoration Network for Arbitrary Talking Face Generation Methods
Yongyuan Li
Xiuyuan Qin
Chao Liang
Mingqiang Wei
27
3
0
14 Sep 2023
A Survey on Deep Multi-modal Learning for Body Language Recognition and Generation
Li Liu
Lufei Gao
Wen-Ling Lei
Fengji Ma
Xiaotian Lin
Jin-Tao Wang
CVBM
27
5
0
17 Aug 2023
A Comprehensive Survey of Deep Transfer Learning for Anomaly Detection in Industrial Time Series: Methods, Applications, and Directions
Peng Yan
Ahmed Abdulkadir
Paul-Philipp Luley
Matthias Rosenthal
Gerrit A. Schatte
Benjamin Grewe
Thilo Stadelmann
AI4TS
36
58
0
11 Jul 2023
Large-scale unsupervised audio pre-training for video-to-speech synthesis
Triantafyllos Kefalas
Yannis Panagakis
M. Pantic
VGen
37
3
0
27 Jun 2023
Do as I can, not as I get
Shangfei Zheng
Hongzhi Yin
Tong Chen
Quoc Viet Hung Nguyen
Wei Chen
Lei Zhao
26
1
0
17 Jun 2023
Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers
Dominick Reilly
Aman Chadha
Srijan Das
ViT
33
4
0
15 Jun 2023
Diffusion Model for Dense Matching
Jisu Nam
Gyuseong Lee
Sunwoo Kim
Ines Hyeonsu Kim
Hyoungwon Cho
Seyeong Kim
Seung Wook Kim
DiffM
26
9
0
30 May 2023
Vision-Language Models in Remote Sensing: Current Progress and Future Trends
Xiang Li
Congcong Wen
Yuan Hu
Zhenghang Yuan
Xiao Xiang Zhu
VLM
24
71
0
09 May 2023
Physical Knowledge Enhanced Deep Neural Network for Sea Surface Temperature Prediction
Yuxin Meng
Feng Gao
Eric Rigall
Ran Dong
Junyu Dong
Q. Du
29
20
0
19 Apr 2023
A Study on Bias and Fairness In Deep Speaker Recognition
Amirhossein Hajavi
Ali Etemad
27
2
0
14 Mar 2023
The challenge of representation learning: Improved accuracy in deep vision models does not come with better predictions of perceptual similarity
F. Günther
Marco Marelli
M. Petilli
SSL
15
0
0
13 Mar 2023
Scalable Weight Reparametrization for Efficient Transfer Learning
Byeonggeun Kim
Juntae Lee
Seunghan Yang
Simyung Chang
OffRL
16
0
0
26 Feb 2023
Speaker Recognition in Realistic Scenario Using Multimodal Data
Saqlain Hussain Shah
M. S. Saeed
Shah Nawaz
Muhammad Haroon Yousaf
CVBM
26
8
0
25 Feb 2023
Blind Omnidirectional Image Quality Assessment: Integrating Local Statistics and Global Semantics
Wei Zhou
Zhou Wang
13
4
0
24 Feb 2023
Random Padding Data Augmentation
Nan Yang
Laicheng Zhong
Fan Huang
Dong Yuan
Wei Bao
28
2
0
17 Feb 2023
Semi-Supervised Visual Tracking of Marine Animals using Autonomous Underwater Vehicles
Levi Cai
Nathan McGuire
R. Hanlon
T. Mooney
Yogesh A. Girdhar
33
31
0
14 Feb 2023
LipFormer: Learning to Lipread Unseen Speakers based on Visual-Landmark Transformers
Feng Xue
Yu Li
Deyin Liu
Yincen Xie
Lin Wu
Richang Hong
41
12
0
04 Feb 2023
Does progress on ImageNet transfer to real-world datasets?
Alex Fang
Simon Kornblith
Ludwig Schmidt
VLM
29
34
0
11 Jan 2023
Self-Supervised Video Forensics by Audio-Visual Anomaly Detection
Chao Feng
Ziyang Chen
Andrew Owens
31
71
0
04 Jan 2023
Vocabulary-informed Zero-shot and Open-set Learning
Yanwei Fu
Xiaomei Wang
Hanze Dong
Yu-Gang Jiang
Meng Wang
Xiangyang Xue
Leonid Sigal
VLM
21
18
0
03 Jan 2023
Multi-view Tracking, Re-ID, and Social Network Analysis of a Flock of Visually Similar Birds in an Outdoor Aviary
Shiting Xiao
Yufu Wang
A. Perkes
Bernd Pfrommer
Marc F. Schmidt
Kostas Daniilidis
M. Badger
24
12
0
01 Dec 2022
NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research
J. Bornschein
Alexandre Galashov
Ross Hemsley
Amal Rannen-Triki
Yutian Chen
...
Angeliki Lazaridou
Yee Whye Teh
Andrei A. Rusu
Razvan Pascanu
MarcÁurelio Ranzato
OOD
VLM
AI4TS
39
17
0
15 Nov 2022
SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transformers
Alessandro Arezzo
Stefano Berretti
ViT
27
15
0
04 Nov 2022
Deep Manifold Hashing: A Divide-and-Conquer Approach for Semi-Paired Unsupervised Cross-Modal Retrieval
Yufeng Shi
Xinge You
Jiamiao Xu
Feng Zheng
Qinmu Peng
Weihua Ou
9
0
0
26 Sep 2022
1
2
3
4
...
9
10
11
Next