Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1608.03983
Cited By
SGDR: Stochastic Gradient Descent with Warm Restarts
13 August 2016
I. Loshchilov
Frank Hutter
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SGDR: Stochastic Gradient Descent with Warm Restarts"
50 / 1,274 papers shown
Title
MoSt-DSA: Modeling Motion and Structural Interactions for Direct Multi-Frame Interpolation in DSA Images
Ziyang Xu
Huangxuan Zhao
Ziwei Cui
Wenyu Liu
Chuansheng Zheng
Xinggang Wang
25
1
0
09 Jul 2024
Can Learned Optimization Make Reinforcement Learning Less Difficult?
Alexander David Goldie
Chris Xiaoxuan Lu
Matthew Jackson
Shimon Whiteson
Jakob N. Foerster
40
3
0
09 Jul 2024
DεpS: Delayed ε-Shrinking for Faster Once-For-All Training
Aditya Annavajjala
Alind Khare
Animesh Agrawal
Igor Fedorov
Hugo Latapie
Myungjin Lee
Alexey Tumanov
CLL
37
0
0
08 Jul 2024
HPFF: Hierarchical Locally Supervised Learning with Patch Feature Fusion
Junhao Su
Chenghao He
Feiyu Zhu
Xiaojie Xu
Dongzhi Guan
Chenyang Si
48
2
0
08 Jul 2024
T-CorresNet: Template Guided 3D Point Cloud Completion with Correspondence Pooling Query Generation Strategy
Fan Duan
Jiahao Yu
Li Chen
3DPC
33
0
0
06 Jul 2024
Improving ensemble extreme precipitation forecasts using generative artificial intelligence
Yingkai Sha
R. Sobash
David John Gagne II
25
0
0
05 Jul 2024
AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation
Yuhan Zhu
Yuyang Ji
Zhiyu Zhao
Gangshan Wu
Limin Wang
VLM
39
7
0
05 Jul 2024
SE(3)-Hyena Operator for Scalable Equivariant Learning
Artem Moskalev
Mangal Prakash
Rui Liao
Tommaso Mansi
44
2
0
01 Jul 2024
Swish-T : Enhancing Swish Activation with Tanh Bias for Improved Neural Network Performance
Youngmin Seo
Jinha Kim
Unsang Park
26
0
0
01 Jul 2024
High-resolution open-vocabulary object 6D pose estimation
Jaime Corsetti
Davide Boscaini
Francesco Giuliari
Changjae Oh
Andrea Cavallaro
Fabio Poiesi
28
1
0
24 Jun 2024
Compressing Search with Language Models
Thomas Mulc
Jennifer L. Steele
60
1
0
24 Jun 2024
FutureNet-LOF: Joint Trajectory Prediction and Lane Occupancy Field Prediction with Future Context Encoding
Mingkun Wang
Xiaoguang Ren
Ruochun Jin
Minglong Li
Xiaochuan Zhang
Changqian Yu
Mingxu Wang
Wenjing Yang
45
1
0
20 Jun 2024
Cyclic 2.5D Perceptual Loss for Cross-Modal 3D Medical Image Synthesis: T1w MRI to Tau PET
Symac Kim
Junho Moon
Haejun Chung
Ikbeom Jang
MedIm
44
0
0
18 Jun 2024
Temporal Lidar Depth Completion
Pietari Kaskela
Philipp Fischer
Timo Roman
3DV
23
0
0
17 Jun 2024
Pick-or-Mix: Dynamic Channel Sampling for ConvNets
Ashish Kumar
Daneul Kim
Jaesik Park
Laxmidhar Behera
34
1
0
16 Jun 2024
Gradient-based Learning in State-based Potential Games for Self-Learning Production Systems
Steve Yuwono
Marlon Löppenberg
Dorothea Schwung
Andreas Schwung
26
2
0
14 Jun 2024
PixMamba: Leveraging State Space Models in a Dual-Level Architecture for Underwater Image Enhancement
Wei-Tung Lin
Yong-Xiang Lin
Jyun-Wei Chen
Kai-Lung Hua
Mamba
23
5
0
12 Jun 2024
Asymptotic Unbiased Sample Sampling to Speed Up Sharpness-Aware Minimization
Jiaxin Deng
Junbiao Pang
Baochang Zhang
64
1
0
12 Jun 2024
Enhancing End-to-End Autonomous Driving with Latent World Model
Yingyan Li
Lue Fan
Jiawei He
Yuqi Wang
Yuntao Chen
Zhaoxiang Zhang
Tieniu Tan
72
8
0
12 Jun 2024
Visual Representation Learning with Stochastic Frame Prediction
Huiwon Jang
Dongyoung Kim
Junsu Kim
Jinwoo Shin
Pieter Abbeel
Younggyo Seo
34
2
0
11 Jun 2024
Scaling Continuous Latent Variable Models as Probabilistic Integral Circuits
G. Gala
Cassio de Campos
Antonio Vergari
Erik Quaeghebeur
TPM
63
4
0
10 Jun 2024
Attention as a Hypernetwork
Simon Schug
Seijin Kobayashi
Yassir Akram
João Sacramento
Razvan Pascanu
GNN
37
3
0
09 Jun 2024
Feature contamination: Neural networks learn uncorrelated features and fail to generalize
Tianren Zhang
Chujie Zhao
Guanyu Chen
Yizhou Jiang
Feng Chen
OOD
MLT
OODD
74
3
0
05 Jun 2024
TSPDiffuser: Diffusion Models as Learned Samplers for Traveling Salesperson Path Planning Problems
Ryo Yonetani
39
1
0
05 Jun 2024
Polynomial-Augmented Neural Networks (PANNs) with Weak Orthogonality Constraints for Enhanced Function and PDE Approximation
Madison Cooley
Shandian Zhe
Robert M. Kirby
Varun Shankar
54
1
0
04 Jun 2024
Self-Improving Robust Preference Optimization
Eugene Choi
Arash Ahmadian
Matthieu Geist
Oilvier Pietquin
M. G. Azar
28
8
0
03 Jun 2024
Robust Classification by Coupling Data Mollification with Label Smoothing
Markus Heinonen
Ba-Hien Tran
Michael Kampffmeyer
Maurizio Filippone
67
0
0
03 Jun 2024
Details Enhancement in Unsigned Distance Field Learning for High-fidelity 3D Surface Reconstruction
Cheng Xu
Fei Hou
Wencheng Wang
Hong Qin
Zhebin Zhang
Ying He
53
1
0
01 Jun 2024
Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning
Linjiajie Fang
Ruoxue Liu
Jing Zhang
Wenjia Wang
Bing-Yi Jing
OffRL
50
1
0
31 May 2024
LCQ: Low-Rank Codebook based Quantization for Large Language Models
Wen-Pu Cai
Wu-Jun Li
Wu-Jun Li
MQ
38
0
0
31 May 2024
Scaling White-Box Transformers for Vision
Jinrui Yang
Xianhang Li
Druv Pai
Yuyin Zhou
Yi-An Ma
Yaodong Yu
Cihang Xie
ViT
41
9
0
30 May 2024
EMAG: Ego-motion Aware and Generalizable 2D Hand Forecasting from Egocentric Videos
Masashi Hatano
Ryo Hachiuma
Hideo Saito
EgoV
29
3
0
30 May 2024
Unlocking the Power of Spatial and Temporal Information in Medical Multimodal Pre-training
Jinxia Yang
Bing-Huang Su
Wayne Xin Zhao
Ji-Rong Wen
32
2
0
30 May 2024
AdaFisher: Adaptive Second Order Optimization via Fisher Information
Damien Martins Gomes
Yanlei Zhang
Eugene Belilovsky
Guy Wolf
Mahdi S. Hosseini
ODL
76
2
0
26 May 2024
Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models
Byung-Kwan Lee
Chae Won Kim
Beomchan Park
Yonghyun Ro
MLLM
LRM
31
17
0
24 May 2024
Infinite-Dimensional Feature Interaction
Chenhui Xu
Fuxun Yu
Maoliang Li
Zihao Zheng
Zirui Xu
Jinjun Xiong
Xiang Chen
34
1
0
22 May 2024
Conditioning diffusion models by explicit forward-backward bridging
Adrien Corenflos
Zheng Zhao
Simo Särkkä
Jens Sjölund
Thomas B. Schon
DiffM
53
5
0
22 May 2024
StarLKNet: Star Mixup with Large Kernel Networks for Palm Vein Identification
Xin Jin
Hongyu Zhu
M. El-Yacoubi
Hongchao Liao
Huafeng Qin
Yun Jiang
35
6
0
21 May 2024
Mammo-CLIP: A Vision Language Foundation Model to Enhance Data Efficiency and Robustness in Mammography
Shantanu Ghosh
Clare B. Poynton
Shyam Visweswaran
Kayhan Batmanghelich
VLM
37
8
0
20 May 2024
DuoSpaceNet: Leveraging Both Bird's-Eye-View and Perspective View Representations for 3D Object Detection
Zhe Huang
Yizhe Zhao
Hao Xiao
Chenyan Wu
Lingting Ge
3DPC
48
1
0
17 May 2024
Quantum Vision Transformers for Quark-Gluon Classification
Marçal Comajoan Cara
Gopal Ramesh Dahale
Zhongtian Dong
Roy T. Forestano
S. Gleyzer
...
Kyoungchul Kong
Tom Magorsch
Konstantin T. Matchev
Katia Matcheva
Eyup B. Unlu
42
9
0
16 May 2024
Compression-Realized Deep Structural Network for Video Quality Enhancement
Hanchi Sun
Xiaohong Liu
Xinyang Jiang
Yifei Shen
Dongsheng Li
Xiongkuo Min
Guangtao Zhai
30
1
0
10 May 2024
Model-based reinforcement learning for protein backbone design
Frederic Renard
Cyprien Courtot
Alfredo Reichlin
Oliver Bent
43
0
0
03 May 2024
Reinforcement Learning-Guided Semi-Supervised Learning
Marzi Heidari
Hanping Zhang
Yuhong Guo
OffRL
27
0
0
02 May 2024
Enhancing User Experience in On-Device Machine Learning with Gated Compression Layers
Haiguang Li
Usama Pervaiz
Joseph Antognini
Michal Matuszak
Lawrence Au
Gilles Roux
T. Thormundsson
36
0
0
02 May 2024
AttackBench: Evaluating Gradient-based Attacks for Adversarial Examples
Antonio Emanuele Cinà
Jérôme Rony
Maura Pintor
Luca Demetrio
Ambra Demontis
Battista Biggio
Ismail Ben Ayed
Fabio Roli
ELM
AAML
SILM
44
6
0
30 Apr 2024
Inexact subgradient methods for semialgebraic functions
Jérôme Bolte
Tam Le
Éric Moulines
Edouard Pauwels
44
2
0
30 Apr 2024
ShadowMaskFormer: Mask Augmented Patch Embeddings for Shadow Removal
Zhuohao Li
Guoyang Xie
Guannan Jiang
Zhichao Lu
31
3
0
29 Apr 2024
CUE-Net: Violence Detection Video Analytics with Spatial Cropping, Enhanced UniformerV2 and Modified Efficient Additive Attention
Damith Chamalke Senadeera
Xiaoyun Yang
Dimitrios Kollias
Gregory G. Slabaugh
32
0
0
27 Apr 2024
Temporal Scaling Law for Large Language Models
Yizhe Xiong
Xiansheng Chen
Xin Ye
Hui Chen
Zijia Lin
...
Zhenpeng Su
Wei Huang
Jianwei Niu
J. Han
Guiguang Ding
43
9
0
27 Apr 2024
Previous
1
2
3
4
5
...
24
25
26
Next