Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1608.03983
Cited By
SGDR: Stochastic Gradient Descent with Warm Restarts
13 August 2016
I. Loshchilov
Frank Hutter
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SGDR: Stochastic Gradient Descent with Warm Restarts"
50 / 1,220 papers shown
Title
RouteNator: A Router-Based Multi-Modal Architecture for Generating Synthetic Training Data for Function Calling LLMs
Vibha Belavadi
Tushar Vatsa
Dewang Sultania
Suhas Suresha
Ishita Verma
C. L. P. Chen
Tracy Holloway King
Michael Friedrich
SyDa
23
0
0
15 May 2025
Learning Dynamics in Continual Pre-Training for Large Language Models
Xingjin Wang
Howe Tissue
Lu Wang
Linjing Li
D. Zeng
CLL
29
0
0
12 May 2025
CogniSNN: A First Exploration to Random Graph Architecture based Spiking Neural Networks with Enhanced Expandability and Neuroplasticity
Yongsheng Huang
Peibo Duan
Zhipeng Liu
Kai Sun
Changsheng Zhang
Bin Zhang
Mingkun Xu
GNN
50
0
0
09 May 2025
VIN-NBV: A View Introspection Network for Next-Best-View Selection for Resource-Efficient 3D Reconstruction
Noah Frahm
Dongxu Zhao
Andrea Dunn Beltran
Ron Alterovitz
Jan-Michael Frahm
Junier Oliva
Roni Sengupta
122
0
0
09 May 2025
Examining the Source of Defects from a Mechanical Perspective for 3D Anomaly Detection
Hanzhe Liang
Aoran Wang
Jie Zhou
Xin Jin
C. Gao
Jinbao Wang
21
0
0
09 May 2025
The Moon's Many Faces: A Single Unified Transformer for Multimodal Lunar Reconstruction
Tom Sander
Moritz Tenthoff
Kay Wohlfarth
Christian Wöhler
31
0
0
08 May 2025
Image Restoration via Multi-domain Learning
Xingyu Jiang
Ning Gao
Xiuhui Zhang
Hongkun Dou
Shaowen Fu
Xiaoqing Zhong
H. Li
Yue Deng
ViT
34
0
0
07 May 2025
GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model
Zixiang Ai
Zichen Liu
Yuanhang Lei
Zhenyu Cui
Xu Zou
Jiahuan Zhou
29
0
0
07 May 2025
SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer
Young-Hu Park
R.-H. Park
Hyung-Min Park
49
0
0
07 May 2025
RAIL: Region-Aware Instructive Learning for Semi-Supervised Tooth Segmentation in CBCT
Chuyu Zhao
Hao Huang
Jiashuo Guo
Ziyu Shen
Zhongwei Zhou
Jie Liu
Zekuan Yu
45
0
0
06 May 2025
PASCAL: Precise and Efficient ANN- SNN Conversion using Spike Accumulation and Adaptive Layerwise Activation
Pranav Ramesh
Gopalakrishnan Srinivasan
29
0
0
03 May 2025
A Neural Architecture Search Method using Auxiliary Evaluation Metric based on ResNet Architecture
Shang Wang
Huanrong Tang
Jianquan Ouyang
41
0
0
02 May 2025
MemeBLIP2: A novel lightweight multimodal system to detect harmful memes
Jiaqi Liu
Ran Tong
Aowei Shen
Shuzheng Li
Changlin Yang
Lisha Xu
VLM
77
0
0
29 Apr 2025
Image Interpolation with Score-based Riemannian Metrics of Diffusion Models
Shinnosuke Saito
Takashi Matsubara
DiffM
82
1
0
28 Apr 2025
Learning Efficiency Meets Symmetry Breaking
Yingbin Bai
Sylvie Thiébaux
Felipe Trevizan
32
0
0
28 Apr 2025
A Comparison-Relationship-Surrogate Evolutionary Algorithm for Multi-Objective Optimization
Christopher M. Pierce
Young-Kee Kim
Ivan Bazarov
28
0
0
28 Apr 2025
Llama-3.1-FoundationAI-SecurityLLM-Base-8B Technical Report
Paul Kassianik
Baturay Saglam
Alexander Chen
Blaine Nelson
Anu Vellore
...
Hyrum Anderson
Kojin Oshiba
Omar Santos
Yaron Singer
Amin Karbasi
PILM
61
0
0
28 Apr 2025
MERA: Multimodal and Multiscale Self-Explanatory Model with Considerably Reduced Annotation for Lung Nodule Diagnosis
Jiahao Lu
Chong Yin
Silvia Ingala
Kenny Erleben
M. Nielsen
S. Darkner
49
0
0
27 Apr 2025
Co-Training with Active Contrastive Learning and Meta-Pseudo-Labeling on 2D Projections for Deep Semi-Supervised Learning
David Aparco-Cardenas
Jancarlo F. Gomes
Alexandre X. Falcão
Pedro J. de Rezende
41
0
0
25 Apr 2025
AlphaGrad: Non-Linear Gradient Normalization Optimizer
Soham Sane
ODL
48
0
0
22 Apr 2025
HFBRI-MAE: Handcrafted Feature Based Rotation-Invariant Masked Autoencoder for 3D Point Cloud Analysis
Xuanhua Yin
Dingxin Zhang
Jianhui Yu
Weidong Cai
25
0
0
19 Apr 2025
FocusedAD: Character-centric Movie Audio Description
Xiaojun Ye
C. Wang
Yiren Song
Sheng Zhou
Liangcheng Li
Jiajun Bu
VGen
53
0
0
16 Apr 2025
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
Ziqi Pang
Xin Xu
Yu-Xiong Wang
DiffM
60
0
0
15 Apr 2025
Pychop: Emulating Low-Precision Arithmetic in Numerical Methods and Neural Networks
Erin Carson
Xinye Chen
49
0
0
10 Apr 2025
Impact of Language Guidance: A Reproducibility Study
Cherish Puniani
Advika Sinha
Shree Singhi
Aayan Yadav
VLM
44
0
0
10 Apr 2025
Spline-based Transformers
Prashanth Chandran
Agon Serifi
Markus Gross
Moritz Bächer
38
0
0
03 Apr 2025
Learning Phase Distortion with Selective State Space Models for Video Turbulence Mitigation
Xingguang Zhang
Nicholas Chimitt
Xijun Wang
Yu Yuan
Stanley H. Chan
36
0
0
03 Apr 2025
MultiSensor-Home: A Wide-area Multi-modal Multi-view Dataset for Action Recognition and Transformer-based Sensor Fusion
Trung Thanh Nguyen
Yasutomo Kawanishi
Vijay John
Takahiro Komamizu
Ichiro Ide
41
0
0
03 Apr 2025
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
Yang Sui
Yu-Neng Chuang
Guanchu Wang
Jiamu Zhang
Tianyi Zhang
...
Hongyi Liu
Andrew Wen
Shaochen
Zhong
Hanjie Chen
OffRL
ReLM
LRM
74
26
0
20 Mar 2025
Hybrid Agents for Image Restoration
Bingchen Li
X. Li
Yiting Lu
Zhibo Chen
80
1
0
13 Mar 2025
Poly-MgNet: Polynomial Building Blocks in Multigrid-Inspired ResNets
Antonia van Betteray
Matthias Rottmann
Karsten Kahl
48
0
0
13 Mar 2025
TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction
Xuying Zhang
Yutong Liu
Yangguang Li
Renrui Zhang
Y. Liu
...
Wanli Ouyang
Zhiwei Xiong
Peng Gao
Qibin Hou
Ming-Ming Cheng
118
3
0
13 Mar 2025
Implicit Contrastive Representation Learning with Guided Stop-gradient
Byeongchan Lee
Sehyun Lee
SSL
87
2
0
12 Mar 2025
Can We Detect Failures Without Failure Data? Uncertainty-Aware Runtime Failure Detection for Imitation Learning Policies
Chen Xu
Tony Nguyen
Emma Dixon
Christopher Rodriguez
Patrick "Tree" Miller
Robert Lee
Paarth Shah
Rares Ambrus
Haruki Nishimura
Masha Itkina
OffRL
78
2
0
11 Mar 2025
ISP-AD: A Large-Scale Real-World Dataset for Advancing Industrial Anomaly Detection with Synthetic and Real Defects
Paul J. Krassnig
Dieter P. Gruber
125
0
0
06 Mar 2025
Enhancing Vision-Language Compositional Understanding with Multimodal Synthetic Data
Haoxin Li
Boyang Li
CoGe
69
0
0
03 Mar 2025
Enhancing Retinal Vessel Segmentation Generalization via Layout-Aware Generative Modelling
Jonathan Fhima
Jan Van Eijgen
Lennert Beeckmans
Thomas Jacobs
Moti Freiman
Luis Filipe Nakayama
Ingeborg Stalmans
Chaim Baskin
Joachim A. Behar
MedIm
62
0
0
03 Mar 2025
MRI super-resolution reconstruction using efficient diffusion probabilistic model with residual shifting
Mojtaba Safari
Shansong Wang
Zach Eidex
Qiang Li
Erik H. Middlebrooks
D. Yu
Xiaofeng Yang
MedIm
81
1
0
03 Mar 2025
MFSR-GAN: Multi-Frame Super-Resolution with Handheld Motion Modeling
Fadeel Sher Khan
Joshua Ebenezer
Hamid Sheikh
Seok-Jun Lee
67
0
0
28 Feb 2025
HVI: A New Color Space for Low-light Image Enhancement
Qingsen Yan
Yixu Feng
Cheng Zhang
Guansong Pang
Kangbiao Shi
Peng Wu
Wei Dong
Jinqiu Sun
Yanning Zhang
41
5
0
27 Feb 2025
Kanana: Compute-efficient Bilingual Language Models
Kanana LLM Team
Yunju Bak
Hojin Lee
Minho Ryu
Jiyeon Ham
...
Daniel Lee
Minchul Lee
M. Lee
Shinbok Lee
Gaeun Seo
88
1
0
26 Feb 2025
Sample Selection via Contrastive Fragmentation for Noisy Label Regression
C. Kim
Sangwoo Moon
Jihwan Moon
Dongyeon Woo
Gunhee Kim
NoLa
52
0
0
25 Feb 2025
Retrieval-Augmented Speech Recognition Approach for Domain Challenges
Peng Shen
Xugang Lu
Hisashi Kawai
RALM
60
0
0
24 Feb 2025
Disentangling Visual Transformers: Patch-level Interpretability for Image Classification
Guillaume Jeanneret
Loïc Simon
F. Jurie
ViT
44
0
0
24 Feb 2025
Patch Stitching Data Augmentation for Cancer Classification in Pathology Images
Jiamu Wang
Chang-Su Kim
Jin Tae Kwak
MedIm
28
1
0
22 Feb 2025
Exploiting Deblurring Networks for Radiance Fields
Haeyun Choi
Heemin Yang
Janghyeok Han
Sunghyun Cho
52
0
0
20 Feb 2025
Multi-Step Alignment as Markov Games: An Optimistic Online Gradient Descent Approach with Convergence Guarantees
Yongtao Wu
Luca Viano
Yihang Chen
Zhenyu Zhu
Kimon Antonakopoulos
Quanquan Gu
V. Cevher
49
0
0
18 Feb 2025
The Graph's Apprentice: Teaching an LLM Low Level Knowledge for Circuit Quality Estimation
Reza Moravej
Saurabh Bodhe
Zhanguang Zhang
Didier Chetelat
Dimitrios Tsaras
Yingxue Zhang
Hui-Ling Zhen
Jianye Hao
M. Yuan
50
1
0
17 Feb 2025
An Efficient Row-Based Sparse Fine-Tuning
Cen-Jhih Li
Aditya Bhaskara
52
0
0
17 Feb 2025
Increasing Both Batch Size and Learning Rate Accelerates Stochastic Gradient Descent
Hikaru Umeda
Hideaki Iiduka
67
2
0
17 Feb 2025
1
2
3
4
...
23
24
25
Next