ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1405.3531
  4. Cited By
Return of the Devil in the Details: Delving Deep into Convolutional Nets

Return of the Devil in the Details: Delving Deep into Convolutional Nets

14 May 2014
Ken Chatfield
Karen Simonyan
Andrea Vedaldi
Andrew Zisserman
    FAtt
ArXivPDFHTML

Papers citing "Return of the Devil in the Details: Delving Deep into Convolutional Nets"

50 / 547 papers shown
Title
Improving trajectory continuity in drone-based crowd monitoring using a set of minimal-cost techniques and deep discriminative correlation filters
Improving trajectory continuity in drone-based crowd monitoring using a set of minimal-cost techniques and deep discriminative correlation filters
Bartosz Ptak
Marek Kraft
24
0
0
28 Apr 2025
Acute Lymphoblastic Leukemia Diagnosis Employing YOLOv11, YOLOv8, ResNet50, and Inception-ResNet-v2 Deep Learning Models
Acute Lymphoblastic Leukemia Diagnosis Employing YOLOv11, YOLOv8, ResNet50, and Inception-ResNet-v2 Deep Learning Models
Alaa Awad
Salah A. Aly
59
0
0
13 Feb 2025
Learning Gaussian Data Augmentation in Feature Space for One-shot Object
  Detection in Manga
Learning Gaussian Data Augmentation in Feature Space for One-shot Object Detection in Manga
Takara Taniguchi
Ryosuke Furuta
31
1
0
08 Oct 2024
CLOSER: Towards Better Representation Learning for Few-Shot
  Class-Incremental Learning
CLOSER: Towards Better Representation Learning for Few-Shot Class-Incremental Learning
Junghun Oh
Sungyong Baik
Kyoung Mu Lee
CLL
42
3
0
08 Oct 2024
Universal Pooling Method of Multi-layer Features from Pretrained Models
  for Speaker Verification
Universal Pooling Method of Multi-layer Features from Pretrained Models for Speaker Verification
Jin Sob Kim
Hyun Joon Park
Wooseok Shin
Sung Won Han
SLR
50
0
0
12 Sep 2024
From Radiologist Report to Image Label: Assessing Latent Dirichlet
  Allocation in Training Neural Networks for Orthopedic Radiograph
  Classification
From Radiologist Report to Image Label: Assessing Latent Dirichlet Allocation in Training Neural Networks for Orthopedic Radiograph Classification
Jakub Olczak
Max Gordon
20
0
0
22 Aug 2024
Loki: A System for Serving ML Inference Pipelines with Hardware and
  Accuracy Scaling
Loki: A System for Serving ML Inference Pipelines with Hardware and Accuracy Scaling
Sohaib Ahmad
Hui Guan
Ramesh K. Sitaraman
42
4
0
04 Jul 2024
Probing the 3D Awareness of Visual Foundation Models
Probing the 3D Awareness of Visual Foundation Models
Mohamed El Banani
Amit Raj
Kevis-Kokitsi Maninis
Abhishek Kar
Yuanzhen Li
Michael Rubinstein
Deqing Sun
Leonidas J. Guibas
Justin Johnson
Varun Jampani
40
79
0
12 Apr 2024
Is Synthetic Image Useful for Transfer Learning? An Investigation into
  Data Generation, Volume, and Utilization
Is Synthetic Image Useful for Transfer Learning? An Investigation into Data Generation, Volume, and Utilization
Yuhang Li
Xin Dong
Chen Chen
Jingtao Li
Yuxin Wen
Michael Spranger
Lingjuan Lyu
DiffM
32
4
0
28 Mar 2024
Tiny Machine Learning: Progress and Futures
Tiny Machine Learning: Progress and Futures
Ji Lin
Ligeng Zhu
Wei-Ming Chen
Wei-Chen Wang
Song Han
52
51
0
28 Mar 2024
Block Selective Reprogramming for On-device Training of Vision
  Transformers
Block Selective Reprogramming for On-device Training of Vision Transformers
Sreetama Sarkar
Souvik Kundu
Kai Zheng
P. Beerel
37
2
0
25 Mar 2024
Federated Learning Method for Preserving Privacy in Face Recognition
  System
Federated Learning Method for Preserving Privacy in Face Recognition System
Enoch Solomon
Abraham Woubie
FedML
41
3
0
08 Mar 2024
An Empirical Study of the Generalization Ability of Lidar 3D Object
  Detectors to Unseen Domains
An Empirical Study of the Generalization Ability of Lidar 3D Object Detectors to Unseen Domains
George Eskandar
Chongzhe Zhang
Abhishek Kaushik
Karim Guirguis
Mohamed Sayed
Bin Yang
OOD
3DPC
51
9
0
27 Feb 2024
DreamMatcher: Appearance Matching Self-Attention for
  Semantically-Consistent Text-to-Image Personalization
DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization
Jisu Nam
Heesu Kim
Dongjae Lee
Siyoon Jin
Seungryong Kim
Seunggyu Chang
DiffM
32
40
0
15 Feb 2024
BioDrone: A Bionic Drone-based Single Object Tracking Benchmark for
  Robust Vision
BioDrone: A Bionic Drone-based Single Object Tracking Benchmark for Robust Vision
Xin Zhao
Shiyu Hu
Yipei Wang
Jing Zhang
Yimin Hu
...
Haibin Ling
Yin Li
Renshu Li
Kun Liu
Jiadong Li
AAML
43
12
0
07 Feb 2024
JMA: a General Algorithm to Craft Nearly Optimal Targeted Adversarial
  Example
JMA: a General Algorithm to Craft Nearly Optimal Targeted Adversarial Example
B. Tondi
Wei Guo
Mauro Barni
AAML
17
0
0
02 Jan 2024
Cross-Modal Object Tracking via Modality-Aware Fusion Network and A
  Large-Scale Dataset
Cross-Modal Object Tracking via Modality-Aware Fusion Network and A Large-Scale Dataset
Lei Liu
Mengya Zhang
Chenglong Li
Chenglong Li
Jin Tang
33
4
0
22 Dec 2023
ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic
  Tensor Selection
ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection
Kai Huang
Boyuan Yang
Wei Gao
32
18
0
21 Dec 2023
Simple Transferability Estimation for Regression Tasks
Simple Transferability Estimation for Regression Tasks
Cuong N. Nguyen
Phong Tran
L. Ho
Vu C. Dinh
Anh Tran
Tal Hassner
Cuong V Nguyen
17
2
0
01 Dec 2023
Just Add $π$! Pose Induced Video Transformers for Understanding
  Activities of Daily Living
Just Add πππ! Pose Induced Video Transformers for Understanding Activities of Daily Living
Dominick Reilly
Srijan Das
ViT
38
17
0
30 Nov 2023
Attribute-Aware Deep Hashing with Self-Consistency for Large-Scale
  Fine-Grained Image Retrieval
Attribute-Aware Deep Hashing with Self-Consistency for Large-Scale Fine-Grained Image Retrieval
Xiu-Shen Wei
Yang Shen
Xuhao Sun
Peng Wang
Yuxin Peng
22
10
0
21 Nov 2023
Predicting Spine Geometry and Scoliosis from DXA Scans
Predicting Spine Geometry and Scoliosis from DXA Scans
A. Jamaludin
T. Kadir
E. Clark
Andrew Zisserman
MedIm
14
3
0
15 Nov 2023
PockEngine: Sparse and Efficient Fine-tuning in a Pocket
PockEngine: Sparse and Efficient Fine-tuning in a Pocket
Ligeng Zhu
Lanxiang Hu
Ji Lin
Wei-Chen Wang
Wei-Ming Chen
Chuang Gan
Song Han
30
19
0
26 Oct 2023
The Expressive Power of Low-Rank Adaptation
The Expressive Power of Low-Rank Adaptation
Yuchen Zeng
Kangwook Lee
41
51
0
26 Oct 2023
End-to-End Lip Reading in Romanian with Cross-Lingual Domain Adaptation
  and Lateral Inhibition
End-to-End Lip Reading in Romanian with Cross-Lingual Domain Adaptation and Lateral Inhibition
Emilian-Claudiu Muanescu
Ruazvan-Alexandru Smuadu
Andrei-Marius Avram
Dumitru-Clementin Cercel
Florin-Catalin Pop
38
0
0
07 Oct 2023
Transferability of Representations Learned using Supervised Contrastive
  Learning Trained on a Multi-Domain Dataset
Transferability of Representations Learned using Supervised Contrastive Learning Trained on a Multi-Domain Dataset
Alvin De Jun Tan
Clement Tan
C. Yeo
40
0
0
27 Sep 2023
HDTR-Net: A Real-Time High-Definition Teeth Restoration Network for
  Arbitrary Talking Face Generation Methods
HDTR-Net: A Real-Time High-Definition Teeth Restoration Network for Arbitrary Talking Face Generation Methods
Yongyuan Li
Xiuyuan Qin
Chao Liang
Mingqiang Wei
27
3
0
14 Sep 2023
A Survey on Deep Multi-modal Learning for Body Language Recognition and
  Generation
A Survey on Deep Multi-modal Learning for Body Language Recognition and Generation
Li Liu
Lufei Gao
Wen-Ling Lei
Fengji Ma
Xiaotian Lin
Jin-Tao Wang
CVBM
27
5
0
17 Aug 2023
A Comprehensive Survey of Deep Transfer Learning for Anomaly Detection
  in Industrial Time Series: Methods, Applications, and Directions
A Comprehensive Survey of Deep Transfer Learning for Anomaly Detection in Industrial Time Series: Methods, Applications, and Directions
Peng Yan
Ahmed Abdulkadir
Paul-Philipp Luley
Matthias Rosenthal
Gerrit A. Schatte
Benjamin Grewe
Thilo Stadelmann
AI4TS
36
58
0
11 Jul 2023
Large-scale unsupervised audio pre-training for video-to-speech
  synthesis
Large-scale unsupervised audio pre-training for video-to-speech synthesis
Triantafyllos Kefalas
Yannis Panagakis
M. Pantic
VGen
37
3
0
27 Jun 2023
Do as I can, not as I get
Do as I can, not as I get
Shangfei Zheng
Hongzhi Yin
Tong Chen
Quoc Viet Hung Nguyen
Wei Chen
Lei Zhao
26
1
0
17 Jun 2023
Seeing the Pose in the Pixels: Learning Pose-Aware Representations in
  Vision Transformers
Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers
Dominick Reilly
Aman Chadha
Srijan Das
ViT
33
4
0
15 Jun 2023
Diffusion Model for Dense Matching
Diffusion Model for Dense Matching
Jisu Nam
Gyuseong Lee
Sunwoo Kim
Ines Hyeonsu Kim
Hyoungwon Cho
Seyeong Kim
Seung Wook Kim
DiffM
26
9
0
30 May 2023
Vision-Language Models in Remote Sensing: Current Progress and Future
  Trends
Vision-Language Models in Remote Sensing: Current Progress and Future Trends
Xiang Li
Congcong Wen
Yuan Hu
Zhenghang Yuan
Xiao Xiang Zhu
VLM
24
71
0
09 May 2023
Physical Knowledge Enhanced Deep Neural Network for Sea Surface
  Temperature Prediction
Physical Knowledge Enhanced Deep Neural Network for Sea Surface Temperature Prediction
Yuxin Meng
Feng Gao
Eric Rigall
Ran Dong
Junyu Dong
Q. Du
29
20
0
19 Apr 2023
A Study on Bias and Fairness In Deep Speaker Recognition
A Study on Bias and Fairness In Deep Speaker Recognition
Amirhossein Hajavi
Ali Etemad
27
2
0
14 Mar 2023
The challenge of representation learning: Improved accuracy in deep
  vision models does not come with better predictions of perceptual similarity
The challenge of representation learning: Improved accuracy in deep vision models does not come with better predictions of perceptual similarity
F. Günther
Marco Marelli
M. Petilli
SSL
15
0
0
13 Mar 2023
Scalable Weight Reparametrization for Efficient Transfer Learning
Scalable Weight Reparametrization for Efficient Transfer Learning
Byeonggeun Kim
Juntae Lee
Seunghan Yang
Simyung Chang
OffRL
16
0
0
26 Feb 2023
Speaker Recognition in Realistic Scenario Using Multimodal Data
Speaker Recognition in Realistic Scenario Using Multimodal Data
Saqlain Hussain Shah
M. S. Saeed
Shah Nawaz
Muhammad Haroon Yousaf
CVBM
26
8
0
25 Feb 2023
Blind Omnidirectional Image Quality Assessment: Integrating Local
  Statistics and Global Semantics
Blind Omnidirectional Image Quality Assessment: Integrating Local Statistics and Global Semantics
Wei Zhou
Zhou Wang
13
4
0
24 Feb 2023
Random Padding Data Augmentation
Random Padding Data Augmentation
Nan Yang
Laicheng Zhong
Fan Huang
Dong Yuan
Wei Bao
28
2
0
17 Feb 2023
Semi-Supervised Visual Tracking of Marine Animals using Autonomous
  Underwater Vehicles
Semi-Supervised Visual Tracking of Marine Animals using Autonomous Underwater Vehicles
Levi Cai
Nathan McGuire
R. Hanlon
T. Mooney
Yogesh A. Girdhar
33
31
0
14 Feb 2023
LipFormer: Learning to Lipread Unseen Speakers based on Visual-Landmark
  Transformers
LipFormer: Learning to Lipread Unseen Speakers based on Visual-Landmark Transformers
Feng Xue
Yu Li
Deyin Liu
Yincen Xie
Lin Wu
Richang Hong
41
12
0
04 Feb 2023
Does progress on ImageNet transfer to real-world datasets?
Does progress on ImageNet transfer to real-world datasets?
Alex Fang
Simon Kornblith
Ludwig Schmidt
VLM
29
34
0
11 Jan 2023
Self-Supervised Video Forensics by Audio-Visual Anomaly Detection
Self-Supervised Video Forensics by Audio-Visual Anomaly Detection
Chao Feng
Ziyang Chen
Andrew Owens
31
71
0
04 Jan 2023
Vocabulary-informed Zero-shot and Open-set Learning
Vocabulary-informed Zero-shot and Open-set Learning
Yanwei Fu
Xiaomei Wang
Hanze Dong
Yu-Gang Jiang
Meng Wang
Xiangyang Xue
Leonid Sigal
VLM
21
18
0
03 Jan 2023
Multi-view Tracking, Re-ID, and Social Network Analysis of a Flock of
  Visually Similar Birds in an Outdoor Aviary
Multi-view Tracking, Re-ID, and Social Network Analysis of a Flock of Visually Similar Birds in an Outdoor Aviary
Shiting Xiao
Yufu Wang
A. Perkes
Bernd Pfrommer
Marc F. Schmidt
Kostas Daniilidis
M. Badger
24
12
0
01 Dec 2022
NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision
  Research
NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research
J. Bornschein
Alexandre Galashov
Ross Hemsley
Amal Rannen-Triki
Yutian Chen
...
Angeliki Lazaridou
Yee Whye Teh
Andrei A. Rusu
Razvan Pascanu
MarcÁurelio Ranzato
OOD
VLM
AI4TS
39
17
0
15 Nov 2022
SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker
  Embedding and Vision Transformers
SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transformers
Alessandro Arezzo
Stefano Berretti
ViT
27
15
0
04 Nov 2022
Deep Manifold Hashing: A Divide-and-Conquer Approach for Semi-Paired
  Unsupervised Cross-Modal Retrieval
Deep Manifold Hashing: A Divide-and-Conquer Approach for Semi-Paired Unsupervised Cross-Modal Retrieval
Yufeng Shi
Xinge You
Jiamiao Xu
Feng Zheng
Qinmu Peng
Weihua Ou
9
0
0
26 Sep 2022
1234...91011
Next