Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.06675
Cited By
Symbolic Discovery of Optimization Algorithms
13 February 2023
Xiangning Chen
Chen Liang
Da Huang
Esteban Real
Kaiyuan Wang
Yao Liu
Hieu H. Pham
Xuanyi Dong
Thang Luong
Cho-Jui Hsieh
Yifeng Lu
Quoc V. Le
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Symbolic Discovery of Optimization Algorithms"
50 / 194 papers shown
Title
tnGPS: Discovering Unknown Tensor Network Structure Search Algorithms via Large Language Models (LLMs)
Junhua Zeng
Chao Li
Zhun Sun
Qibin Zhao
Guoxu Zhou
32
4
0
04 Feb 2024
MetaOptimize: A Framework for Optimizing Step Sizes and Other Meta-parameters
Arsalan Sharifnassab
Saber Salehkaleybar
Richard Sutton
27
3
0
04 Feb 2024
Understanding Adam Optimizer via Online Learning of Updates: Adam is FTRL in Disguise
Kwangjun Ahn
Zhiyu Zhang
Yunbum Kook
Yan Dai
37
11
0
02 Feb 2024
Enhancing Stochastic Gradient Descent: A Unified Framework and Novel Acceleration Methods for Faster Convergence
Yichuan Deng
Zhao-quan Song
Chiwun Yang
24
1
0
02 Feb 2024
Reconstructing the Invisible: Video Frame Restoration through Siamese Masked Conditional Variational Autoencoder
Yongchen Zhou
Richard Jiang
19
0
0
18 Jan 2024
AutoFT: Learning an Objective for Robust Fine-Tuning
Caroline Choi
Yoonho Lee
Annie S. Chen
Allan Zhou
Aditi Raghunathan
Chelsea Finn
OOD
37
0
0
18 Jan 2024
Enhancing Small Object Encoding in Deep Neural Networks: Introducing Fast&Focused-Net with Volume-wise Dot Product Layer
Tofik Ali
Partha Pratim Roy
ObjD
19
2
0
18 Jan 2024
Spikformer V2: Join the High Accuracy Club on ImageNet with an SNN Ticket
Zhaokun Zhou
Kaiwei Che
Wei Fang
Keyu Tian
Yuesheng Zhu
Shuicheng Yan
Yonghong Tian
Liuliang Yuan
ViT
37
27
0
04 Jan 2024
fMPI: Fast Novel View Synthesis in the Wild with Layered Scene Representations
Jonas Kohler
Nicolas Griffiths Sanchez
Luca Cavalli
Catherine Herold
Albert Pumarola
Alberto Garcia Garcia
Ali K. Thabet
16
1
0
26 Dec 2023
Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models
Angela Castillo
Jonas Kohler
Juan C. Pérez
Juan Pablo Pérez
Albert Pumarola
Bernard Ghanem
Pablo Arbelaez
Ali K. Thabet
21
12
0
19 Dec 2023
Paloma: A Benchmark for Evaluating Language Model Fit
Ian H. Magnusson
Akshita Bhagia
Valentin Hofmann
Luca Soldaini
A. Jha
...
Iz Beltagy
Hanna Hajishirzi
Noah A. Smith
Kyle Richardson
Jesse Dodge
132
21
0
16 Dec 2023
AutoNumerics-Zero: Automated Discovery of State-of-the-Art Mathematical Functions
Esteban Real
Yao Chen
Mirko Rossini
Connal de Souza
Manav Garg
Akhil Verghese
Moritz Firsching
Quoc V. Le
E. D. Cubuk
David H. Park
11
1
0
13 Dec 2023
Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models
Yubin Wang
Xinyang Jiang
De Cheng
Dongsheng Li
Cairong Zhao
VLM
38
15
0
11 Dec 2023
Kandinsky 3.0 Technical Report
V.Ya. Arkhipkin
Andrei Filatov
Viacheslav Vasilev
Anastasia Maltseva
Said Azizov
Igor Pavlov
Julia Agafonova
Andrey Kuznetsov
Denis Dimitrov
DiffM
28
10
0
06 Dec 2023
Rejuvenating image-GPT as Strong Visual Representation Learners
Sucheng Ren
Zeyu Wang
Hongru Zhu
Junfei Xiao
Alan L. Yuille
Cihang Xie
VLM
49
7
0
04 Dec 2023
Sample Efficient Preference Alignment in LLMs via Active Exploration
Viraj Mehta
Vikramjeet Das
Ojash Neopane
Yijia Dai
Ilija Bogunovic
Ilija Bogunovic
W. Neiswanger
Stefano Ermon
Jeff Schneider
Willie Neiswanger
OffRL
25
12
0
01 Dec 2023
Generalisable Agents for Neural Network Optimisation
Kale-ab Tessera
C. Tilbury
Sasha Abramowitz
Ruan de Kock
Omayma Mahjoub
Benjamin Rosman
Sara Hooker
Arnu Pretorius
AI4CE
9
0
0
30 Nov 2023
LIMIT: Less Is More for Instruction Tuning Across Evaluation Paradigms
Aditi Jha
Sam Havens
Jeremey Dohmann
Alex Trott
Jacob P. Portes
ALM
19
11
0
22 Nov 2023
Synthetically Enhanced: Unveiling Synthetic Data's Potential in Medical Imaging Research
Bardia Khosravi
Frank Li
Theo Dapamede
Pouria Rouzrokh
Cooper Gamble
...
C. Wyles
Andrew B. Sellergren
S. Purkayastha
Bradley J. Erickson
J. Gichoya
MedIm
27
17
0
15 Nov 2023
Controlling the Output of a Generative Model by Latent Feature Vector Shifting
Róbert Belanec
Peter Lacko
Kristína Malinovská
14
1
0
15 Nov 2023
Plum: Prompt Learning using Metaheuristic
Rui Pan
Shuo Xing
Shizhe Diao
Wenhe Sun
Xiang Liu
Kashun Shum
Renjie Pi
Jipeng Zhang
Tong Zhang
VLM
OffRL
LRM
29
6
0
14 Nov 2023
A Coefficient Makes SVRG Effective
Yida Yin
Zhiqiu Xu
Zhiyuan Li
Trevor Darrell
Zhuang Liu
20
1
0
09 Nov 2023
Outliers with Opposing Signals Have an Outsized Effect on Neural Network Optimization
Elan Rosenfeld
Andrej Risteski
25
10
0
07 Nov 2023
OmniVec: Learning robust representations with cross modal sharing
Siddharth Srivastava
Gaurav Sharma
SSL
21
64
0
07 Nov 2023
Signal Processing Meets SGD: From Momentum to Filter
Zhipeng Yao
Guisong Chang
Jiaqi Zhang
Qi Zhang
Dazhou Li
Yu Zhang
ODL
24
0
0
06 Nov 2023
Closing the Gap Between the Upper Bound and the Lower Bound of Adam's Iteration Complexity
Bohan Wang
Jingwen Fu
Huishuai Zhang
Nanning Zheng
Wei-Neng Chen
8
16
0
27 Oct 2023
How well can machine-generated texts be identified and can language models be trained to avoid identification?
Sinclair Schneider
Florian Steuber
João A. G. Schneider
Gabi Dreo Rodosek
DeLMO
15
1
0
25 Oct 2023
Rethinking SIGN Training: Provable Nonconvex Acceleration without First- and Second-Order Gradient Lipschitz
Tao Sun
Congliang Chen
Peng Qiao
Li Shen
Xinwang Liu
Dongsheng Li
28
3
0
23 Oct 2023
NeuroSMPC: A Neural Network guided Sampling Based MPC for On-Road Autonomous Driving
Kaustab Pal
Aditya Sharma
Mohd. Omama
Parth N. Shah
K. M. Krishna
11
0
0
19 Oct 2023
Fractional Concepts in Neural Networks: Enhancing Activation Functions
Zahra Alijani
Vojtech Molek
15
0
0
18 Oct 2023
An Automatic Learning Rate Schedule Algorithm for Achieving Faster Convergence and Steeper Descent
Zhao-quan Song
Chiwun Yang
19
9
0
17 Oct 2023
DPZero: Private Fine-Tuning of Language Models without Backpropagation
Liang Zhang
Bingcong Li
K. K. Thekumparampil
Sewoong Oh
Niao He
28
11
0
14 Oct 2023
Adam-family Methods with Decoupled Weight Decay in Deep Learning
Kuang-Yu Ding
Nachuan Xiao
Kim-Chuan Toh
16
3
0
13 Oct 2023
QFT: Quantized Full-parameter Tuning of LLMs with Affordable Resources
Zhikai Li
Xiaoxuan Liu
Banghua Zhu
Zhen Dong
Qingyi Gu
Kurt Keutzer
MQ
27
7
0
11 Oct 2023
Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
Mengzhou Xia
Tianyu Gao
Zhiyuan Zeng
Danqi Chen
24
262
0
10 Oct 2023
Learning to Decode the Surface Code with a Recurrent, Transformer-Based Neural Network
Johannes Bausch
Andrew W. Senior
Francisco J. H. Heras
Thomas Edlich
Alex Davies
...
C. Gidney
Demis Hassabis
Sergio Boixo
Hartmut Neven
Pushmeet Kohli
14
32
0
09 Oct 2023
Lion Secretly Solves Constrained Optimization: As Lyapunov Predicts
Lizhang Chen
Bo Liu
Kaizhao Liang
Qian Liu
ODL
19
15
0
09 Oct 2023
TRAM: Bridging Trust Regions and Sharpness Aware Minimization
Tom Sherborne
Naomi Saphra
Pradeep Dasigi
Hao Peng
32
4
0
05 Oct 2023
RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models
Zekun Wang
Zhongyuan Peng
Haoran Que
Jiaheng Liu
Wangchunshu Zhou
...
Wanli Ouyang
Ke Xu
Wenhu Chen
Jie Fu
Junran Peng
LLMAG
36
80
0
01 Oct 2023
Masked Autoencoders are Scalable Learners of Cellular Morphology
Oren Z. Kraus
Kian Kenyon-Dean
Saber Saberian
Maryam Fallah
Peter McLean
...
Chi Vicky Cheng
Kristen Morse
Maureen Makes
Ben Mabey
Berton A. Earnshaw
19
14
0
27 Sep 2023
Physics Informed Neural Network Code for 2D Transient Problems (PINN-2DT) Compatible with Google Colab
Pawel Maczuga
Maciej Sikora
Maciej Skoczeñ
Przemyslaw Ro.znawski
Filip Tluszcz
Marcin Szubert
Marcin Lo's
W. Dzwinel
K. Pingali
Maciej Paszyñski
AI4CE
14
0
0
24 Sep 2023
ThinResNet: A New Baseline for Structured Convolutional Networks Pruning
Hugo Tessier
Ghouti Boukli Hacene
Vincent Gripon
8
1
0
22 Sep 2023
Traveling Words: A Geometric Interpretation of Transformers
Raul Molina
22
4
0
13 Sep 2023
Using Reed-Muller Codes for Classification with Rejection and Recovery
Daniel Fentham
David Parker
Mark Ryan
25
0
0
12 Sep 2023
Convergence Analysis of Decentralized ASGD
Mauro Dalle Lucca Tosi
Martin Theobald
21
2
0
07 Sep 2023
nanoT5: A PyTorch Framework for Pre-training and Fine-tuning T5-style Models with Limited Resources
Piotr Nawrot
AI4CE
17
5
0
05 Sep 2023
AdaPlus: Integrating Nesterov Momentum and Precise Stepsize Adjustment on AdamW Basis
Lei Guan
ODL
11
3
0
05 Sep 2023
NLLB-CLIP -- train performant multilingual image retrieval model on a budget
Alexander Visheratin
VLM
24
17
0
04 Sep 2023
ExMobileViT: Lightweight Classifier Extension for Mobile Vision Transformer
Gyeongdong Yang
Yungwook Kwon
Hyunjin Kim
ViT
13
1
0
04 Sep 2023
Emergence of Segmentation with Minimalistic White-Box Transformers
Yaodong Yu
Tianzhe Chu
Shengbang Tong
Ziyang Wu
Druv Pai
Sam Buchanan
Y. Ma
ViT
17
22
0
30 Aug 2023
Previous
1
2
3
4
Next