
Title |
|---|
![]() Stochastic Weakly Convex Optimization Beyond Lipschitz ContinuityInternational Conference on Machine Learning (ICML), 2024 Wenzhi Gao Qi Deng |
![]() Masked Audio Generation using a Single Non-Autoregressive TransformerInternational Conference on Learning Representations (ICLR), 2024 |
![]() Mocap Everyone Everywhere: Lightweight Motion Capture With Smartwatches
and a Head-Mounted CameraComputer Vision and Pattern Recognition (CVPR), 2024 |
![]() Locally Optimal Descent for Dynamic Stepsize SchedulingInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023 |
![]() Small-scale proxies for large-scale Transformer training instabilitiesInternational Conference on Learning Representations (ICLR), 2023 |
![]() Ego3DPose: Capturing 3D Cues from Binocular Egocentric ViewsACM SIGGRAPH Conference and Exhibition on Computer Graphics and Interactive Techniques in Asia (SIGGRAPH Asia), 2023 |
![]() Adaptive Proximal Gradient Method for Convex OptimizationNeural Information Processing Systems (NeurIPS), 2023 |
![]() Adaptive Federated Learning with Auto-Tuned ClientsInternational Conference on Learning Representations (ICLR), 2023 |
![]() Prodigy: An Expeditiously Adaptive Parameter-Free LearnerInternational Conference on Machine Learning (ICML), 2023 |
![]() Simple and Controllable Music GenerationNeural Information Processing Systems (NeurIPS), 2023 |
![]() Mechanic: A Learning Rate TunerNeural Information Processing Systems (NeurIPS), 2023 |
![]() DoWG Unleashed: An Efficient Universal Parameter-Free Gradient Descent
MethodNeural Information Processing Systems (NeurIPS), 2023 |
![]() MoMo: Momentum Models for Adaptive Learning RatesInternational Conference on Machine Learning (ICML), 2023 |
![]() Random Function DescentNeural Information Processing Systems (NeurIPS), 2023 |
![]() DoG is SGD's Best Friend: A Parameter-Free Dynamic Step Size ScheduleInternational Conference on Machine Learning (ICML), 2023 |