Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1511.08228
Cited By
Neural GPUs Learn Algorithms
25 November 2015
Lukasz Kaiser
Ilya Sutskever
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Neural GPUs Learn Algorithms"
50 / 97 papers shown
Title
Distributional Scaling Laws for Emergent Capabilities
Rosie Zhao
Tian Qin
David Alvarez-Melis
Sham Kakade
Naomi Saphra
LRM
41
1
0
24 Feb 2025
TCNet: Continuous Sign Language Recognition from Trajectories and Correlated Regions
Hui Lu
A. A. Salah
Ronald Poppe
SLR
35
5
0
18 Mar 2024
The Expected Loss of Preconditioned Langevin Dynamics Reveals the Hessian Rank
Amitay Bar
Rotem Mulayoff
T. Michaeli
Ronen Talmon
66
0
0
21 Feb 2024
Investigating Recurrent Transformers with Dynamic Halt
Jishnu Ray Chowdhury
Cornelia Caragea
43
1
0
01 Feb 2024
Optimizing Large Language Models to Expedite the Development of Smart Contracts
Nii Osae Osae Dade
Margaret Lartey-Quaye
Emmanuel Teye-Kofi Odonkor
Paul Ammah
35
4
0
08 Oct 2023
Neural Algorithmic Reasoning Without Intermediate Supervision
Gleb Rodionov
Liudmila Prokhorenkova
OffRL
LRM
OOD
41
10
0
23 Jun 2023
SALSA VERDE: a machine learning attack on Learning With Errors with sparse small secrets
Cathy Li
Emily Wenger
Zeyuan Allen-Zhu
François Charton
Kristin E. Lauter
AAML
33
10
0
20 Jun 2023
Neural Machine Translation for Code Generation
K. Dharma
Clayton T. Morrison
34
4
0
22 May 2023
Can neural networks do arithmetic? A survey on the elementary numerical skills of state-of-the-art deep learning models
Alberto Testolin
AIMat
37
20
0
14 Mar 2023
Learning to solve arithmetic problems with a virtual abacus
Flavio Petruzzellis
Ling-Hao Chen
Alberto Testolin
34
1
0
17 Jan 2023
Rationalizing Predictions by Adversarial Information Calibration
Lei Sha
Oana-Maria Camburu
Thomas Lukasiewicz
30
4
0
15 Jan 2023
Logical Tasks for Measuring Extrapolation and Rule Comprehension
Ippei Fujisawa
Ryota Kanai
ELM
LRM
28
4
0
14 Nov 2022
Recurrent Convolutional Neural Networks Learn Succinct Learning Algorithms
Surbhi Goel
Sham Kakade
Adam Tauman Kalai
Cyril Zhang
34
1
0
01 Sep 2022
Exploring Length Generalization in Large Language Models
Cem Anil
Yuhuai Wu
Anders Andreassen
Aitor Lewkowycz
Vedant Misra
V. Ramasesh
Ambrose Slone
Guy Gur-Ari
Ethan Dyer
Behnam Neyshabur
ReLM
LRM
38
160
0
11 Jul 2022
Transformers discover an elementary calculation system exploiting local attention and grid-like problem representation
Samuel Cognolato
Alberto Testolin
42
7
0
06 Jul 2022
Neural Networks and the Chomsky Hierarchy
Grégoire Delétang
Anian Ruoss
Jordi Grau-Moya
Tim Genewein
L. Wenliang
...
Chris Cundy
Marcus Hutter
Shane Legg
Joel Veness
Pedro A. Ortega
UQCV
109
133
0
05 Jul 2022
The CLRS Algorithmic Reasoning Benchmark
Petar Velivcković
Adria Puigdomenech Badia
David Budden
Razvan Pascanu
Andrea Banino
Mikhail Dashevskiy
R. Hadsell
Charles Blundell
163
89
0
31 May 2022
Highly Accurate FMRI ADHD Classification using time distributed multi modal 3D CNNs
Christopher Sims
MedIm
21
3
0
24 May 2022
A Probabilistic Interpretation of Transformers
Alexander Shim
43
1
0
28 Apr 2022
HyperNCA: Growing Developmental Networks with Neural Cellular Automata
Elias Najarro
Shyam Sudhakaran
Claire Glanois
S. Risi
39
14
0
25 Apr 2022
Static Prediction of Runtime Errors by Learning to Execute Programs with External Resource Descriptions
David Bieber
Rishab Goel
Daniel Zheng
Hugo Larochelle
Daniel Tarlow
28
15
0
07 Mar 2022
End-to-end Algorithm Synthesis with Recurrent Networks: Logical Extrapolation Without Overthinking
Arpit Bansal
Avi Schwarzschild
Eitan Borgnia
Z. Emam
Furong Huang
Micah Goldblum
Tom Goldstein
LRM
19
24
0
11 Feb 2022
Deep Symbolic Regression for Recurrent Sequences
Stéphane dÁscoli
Pierre-Alexandre Kamienny
Guillaume Lample
Franccois Charton
47
54
0
12 Jan 2022
Linear algebra with transformers
Franccois Charton
AIMat
31
56
0
03 Dec 2021
Show Your Work: Scratchpads for Intermediate Computation with Language Models
Maxwell Nye
Anders Andreassen
Guy Gur-Ari
Henryk Michalewski
Jacob Austin
...
Aitor Lewkowycz
Maarten Bosma
D. Luan
Charles Sutton
Augustus Odena
ReLM
LRM
98
707
0
30 Nov 2021
Gradients are Not All You Need
Luke Metz
C. Freeman
S. Schoenholz
Tal Kachman
30
93
0
10 Nov 2021
State-Space Constraints Improve the Generalization of the Differentiable Neural Computer in some Algorithmic Tasks
P. Ofner
Roman Kern
30
1
0
18 Oct 2021
Pretrained Language Models are Symbolic Mathematics Solvers too!
Kimia Noorbakhsh
Modar Sulaiman
M. Sharifi
Kallol Roy
Pooyan Jamshidi
LRM
28
18
0
07 Oct 2021
Learning to Synthesize Programs as Interpretable and Generalizable Policies
Dweep Trivedi
Jesse Zhang
Shao-Hua Sun
Joseph J. Lim
NAI
24
72
0
31 Aug 2021
The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers
Róbert Csordás
Kazuki Irie
Jürgen Schmidhuber
ViT
30
129
0
26 Aug 2021
Evaluating Large Language Models Trained on Code
Mark Chen
Jerry Tworek
Heewoo Jun
Qiming Yuan
Henrique Pondé
...
Bob McGrew
Dario Amodei
Sam McCandlish
Ilya Sutskever
Wojciech Zaremba
ELM
ALM
86
5,161
0
07 Jul 2021
Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks
Avi Schwarzschild
Eitan Borgnia
Arjun Gupta
Furong Huang
U. Vishkin
Micah Goldblum
Tom Goldstein
24
74
0
08 Jun 2021
Evolutionary Training and Abstraction Yields Algorithmic Generalization of Neural Computers
Daniel Tanneberg
Elmar Rueckert
Jan Peters
38
6
0
17 May 2021
Neural Algorithmic Reasoning
Petar Velickovic
Charles Blundell
NAI
OOD
25
99
0
06 May 2021
CLVSA: A Convolutional LSTM Based Variational Sequence-to-Sequence Model with Attention for Predicting Trends of Financial Markets
Jia Wang
Tong Sun
Benyuan Liu
Yu Cao
Hongwei Zhu
AI4TS
39
64
0
08 Apr 2021
Investigating the Limitations of Transformers with Simple Arithmetic Tasks
Rodrigo Nogueira
Zhiying Jiang
Jimmy J. Li
LRM
24
123
0
25 Feb 2021
Combinatorial optimization and reasoning with graph neural networks
Quentin Cappart
Didier Chételat
Elias Boutros Khalil
Andrea Lodi
Christopher Morris
Petar Velickovic
AI4CE
37
352
0
18 Feb 2021
Neural Sequence-to-grid Module for Learning Symbolic Rules
Segwang Kim
Hyoungwook Nam
Joonyoung Kim
Kyomin Jung
NAI
72
11
0
13 Jan 2021
On the Binding Problem in Artificial Neural Networks
Klaus Greff
Sjoerd van Steenkiste
Jürgen Schmidhuber
OCL
233
255
0
09 Dec 2020
Learning to Execute Programs with Instruction Pointer Attention Graph Neural Networks
David Bieber
Charles Sutton
Hugo Larochelle
Daniel Tarlow
GNN
27
43
0
23 Oct 2020
It's Hard for Neural Networks To Learn the Game of Life
Jacob Mitchell Springer
Garrett Kenyon
27
21
0
03 Sep 2020
Compositional Generalization in Semantic Parsing: Pre-training vs. Specialized Architectures
Daniel Furrer
Marc van Zee
Nathan Scales
Nathanael Scharli
CoGe
26
113
0
17 Jul 2020
Learning Reasoning Strategies in End-to-End Differentiable Proving
Pasquale Minervini
Sebastian Riedel
Pontus Stenetorp
Edward Grefenstette
Tim Rocktaschel
LRM
45
96
0
13 Jul 2020
Hierarchically Compositional Tasks and Deep Convolutional Networks
Arturo Deza
Q. Liao
Andrzej Banburski
T. Poggio
BDL
OOD
33
2
0
24 Jun 2020
Neural Execution Engines: Learning to Execute Subroutines
Yujun Yan
Kevin Swersky
Danai Koutra
Parthasarathy Ranganathan
Milad Hashemi
NAI
16
40
0
15 Jun 2020
Learning advanced mathematical computations from examples
Franccois Charton
Amaury Hayat
Guillaume Lample
PINN
23
4
0
11 Jun 2020
Neural Power Units
Niklas Heim
Tomás Pevný
Václav Smídl
29
9
0
02 Jun 2020
Progress Extrapolating Algorithmic Learning to Arbitrary Sequence Lengths
Andreas Robinson
42
0
0
18 Mar 2020
It's Not What Machines Can Learn, It's What We Cannot Teach
Gal Yehuda
Moshe Gabel
Assaf Schuster
FaML
19
37
0
21 Feb 2020
A Survey of Deep Learning Techniques for Neural Machine Translation
Shu Yang
Yuxin Wang
Xiaowen Chu
VLM
AI4TS
AI4CE
27
138
0
18 Feb 2020
1
2
Next