Neural GPUs Learn Algorithms

25 November 2015

Papers citing "Neural GPUs Learn Algorithms"

50 / 97 papers shown

Title
Distributional Scaling Laws for Emergent Capabilities Rosie Zhao Tian Qin David Alvarez-Melis Sham Kakade Naomi Saphra LRM 41 1 0 24 Feb 2025
TCNet: Continuous Sign Language Recognition from Trajectories and Correlated Regions Hui Lu A. A. Salah Ronald Poppe SLR 35 5 0 18 Mar 2024
The Expected Loss of Preconditioned Langevin Dynamics Reveals the Hessian Rank Amitay Bar Rotem Mulayoff T. Michaeli Ronen Talmon 66 0 0 21 Feb 2024
Investigating Recurrent Transformers with Dynamic Halt Jishnu Ray Chowdhury Cornelia Caragea 43 1 0 01 Feb 2024
Optimizing Large Language Models to Expedite the Development of Smart Contracts Nii Osae Osae Dade Margaret Lartey-Quaye Emmanuel Teye-Kofi Odonkor Paul Ammah 35 4 0 08 Oct 2023
Neural Algorithmic Reasoning Without Intermediate Supervision Gleb Rodionov Liudmila Prokhorenkova OffRL LRM OOD 41 10 0 23 Jun 2023
SALSA VERDE: a machine learning attack on Learning With Errors with sparse small secrets Cathy Li Emily Wenger Zeyuan Allen-Zhu François Charton Kristin E. Lauter AAML 33 10 0 20 Jun 2023
Neural Machine Translation for Code Generation K. Dharma Clayton T. Morrison 34 4 0 22 May 2023
Can neural networks do arithmetic? A survey on the elementary numerical skills of state-of-the-art deep learning models Alberto Testolin AIMat 37 20 0 14 Mar 2023
Learning to solve arithmetic problems with a virtual abacus Flavio Petruzzellis Ling-Hao Chen Alberto Testolin 34 1 0 17 Jan 2023
Rationalizing Predictions by Adversarial Information Calibration Lei Sha Oana-Maria Camburu Thomas Lukasiewicz 30 4 0 15 Jan 2023
Logical Tasks for Measuring Extrapolation and Rule Comprehension Ippei Fujisawa Ryota Kanai ELM LRM 28 4 0 14 Nov 2022
Recurrent Convolutional Neural Networks Learn Succinct Learning Algorithms Surbhi Goel Sham Kakade Adam Tauman Kalai Cyril Zhang 34 1 0 01 Sep 2022
Exploring Length Generalization in Large Language Models Cem Anil Yuhuai Wu Anders Andreassen Aitor Lewkowycz Vedant Misra V. Ramasesh Ambrose Slone Guy Gur-Ari Ethan Dyer Behnam Neyshabur ReLM LRM 38 160 0 11 Jul 2022
Transformers discover an elementary calculation system exploiting local attention and grid-like problem representation Samuel Cognolato Alberto Testolin 42 7 0 06 Jul 2022
Neural Networks and the Chomsky Hierarchy Grégoire Delétang Anian Ruoss Jordi Grau-Moya Tim Genewein L. Wenliang ... Chris Cundy Marcus Hutter Shane Legg Joel Veness Pedro A. Ortega UQCV 109 133 0 05 Jul 2022
The CLRS Algorithmic Reasoning Benchmark Petar Velivcković Adria Puigdomenech Badia David Budden Razvan Pascanu Andrea Banino Mikhail Dashevskiy R. Hadsell Charles Blundell 163 89 0 31 May 2022
Highly Accurate FMRI ADHD Classification using time distributed multi modal 3D CNNs Christopher Sims MedIm 21 3 0 24 May 2022
A Probabilistic Interpretation of Transformers Alexander Shim 43 1 0 28 Apr 2022
HyperNCA: Growing Developmental Networks with Neural Cellular Automata Elias Najarro Shyam Sudhakaran Claire Glanois S. Risi 39 14 0 25 Apr 2022
Static Prediction of Runtime Errors by Learning to Execute Programs with External Resource Descriptions David Bieber Rishab Goel Daniel Zheng Hugo Larochelle Daniel Tarlow 28 15 0 07 Mar 2022
End-to-end Algorithm Synthesis with Recurrent Networks: Logical Extrapolation Without Overthinking Arpit Bansal Avi Schwarzschild Eitan Borgnia Z. Emam Furong Huang Micah Goldblum Tom Goldstein LRM 19 24 0 11 Feb 2022
Deep Symbolic Regression for Recurrent Sequences Stéphane dÁscoli Pierre-Alexandre Kamienny Guillaume Lample Franccois Charton 47 54 0 12 Jan 2022
Linear algebra with transformers Franccois Charton AIMat 31 56 0 03 Dec 2021
Show Your Work: Scratchpads for Intermediate Computation with Language Models Maxwell Nye Anders Andreassen Guy Gur-Ari Henryk Michalewski Jacob Austin ... Aitor Lewkowycz Maarten Bosma D. Luan Charles Sutton Augustus Odena ReLM LRM 98 707 0 30 Nov 2021
Gradients are Not All You Need Luke Metz C. Freeman S. Schoenholz Tal Kachman 30 93 0 10 Nov 2021
State-Space Constraints Improve the Generalization of the Differentiable Neural Computer in some Algorithmic Tasks P. Ofner Roman Kern 30 1 0 18 Oct 2021
Pretrained Language Models are Symbolic Mathematics Solvers too! Kimia Noorbakhsh Modar Sulaiman M. Sharifi Kallol Roy Pooyan Jamshidi LRM 28 18 0 07 Oct 2021
Learning to Synthesize Programs as Interpretable and Generalizable Policies Dweep Trivedi Jesse Zhang Shao-Hua Sun Joseph J. Lim NAI 24 72 0 31 Aug 2021
The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers Róbert Csordás Kazuki Irie Jürgen Schmidhuber ViT 30 129 0 26 Aug 2021
Evaluating Large Language Models Trained on Code Mark Chen Jerry Tworek Heewoo Jun Qiming Yuan Henrique Pondé ... Bob McGrew Dario Amodei Sam McCandlish Ilya Sutskever Wojciech Zaremba ELM ALM 86 5,161 0 07 Jul 2021
Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks Avi Schwarzschild Eitan Borgnia Arjun Gupta Furong Huang U. Vishkin Micah Goldblum Tom Goldstein 24 74 0 08 Jun 2021
Evolutionary Training and Abstraction Yields Algorithmic Generalization of Neural Computers Daniel Tanneberg Elmar Rueckert Jan Peters 38 6 0 17 May 2021
Neural Algorithmic Reasoning Petar Velickovic Charles Blundell NAI OOD 25 99 0 06 May 2021
CLVSA: A Convolutional LSTM Based Variational Sequence-to-Sequence Model with Attention for Predicting Trends of Financial Markets Jia Wang Tong Sun Benyuan Liu Yu Cao Hongwei Zhu AI4TS 39 64 0 08 Apr 2021
Investigating the Limitations of Transformers with Simple Arithmetic Tasks Rodrigo Nogueira Zhiying Jiang Jimmy J. Li LRM 24 123 0 25 Feb 2021
Combinatorial optimization and reasoning with graph neural networks Quentin Cappart Didier Chételat Elias Boutros Khalil Andrea Lodi Christopher Morris Petar Velickovic AI4CE 37 352 0 18 Feb 2021
Neural Sequence-to-grid Module for Learning Symbolic Rules Segwang Kim Hyoungwook Nam Joonyoung Kim Kyomin Jung NAI 72 11 0 13 Jan 2021
On the Binding Problem in Artificial Neural Networks Klaus Greff Sjoerd van Steenkiste Jürgen Schmidhuber OCL 233 255 0 09 Dec 2020
Learning to Execute Programs with Instruction Pointer Attention Graph Neural Networks David Bieber Charles Sutton Hugo Larochelle Daniel Tarlow GNN 27 43 0 23 Oct 2020
It's Hard for Neural Networks To Learn the Game of Life Jacob Mitchell Springer Garrett Kenyon 27 21 0 03 Sep 2020
Compositional Generalization in Semantic Parsing: Pre-training vs. Specialized Architectures Daniel Furrer Marc van Zee Nathan Scales Nathanael Scharli CoGe 26 113 0 17 Jul 2020
Learning Reasoning Strategies in End-to-End Differentiable Proving Pasquale Minervini Sebastian Riedel Pontus Stenetorp Edward Grefenstette Tim Rocktaschel LRM 45 96 0 13 Jul 2020
Hierarchically Compositional Tasks and Deep Convolutional Networks Arturo Deza Q. Liao Andrzej Banburski T. Poggio BDL OOD 33 2 0 24 Jun 2020
Neural Execution Engines: Learning to Execute Subroutines Yujun Yan Kevin Swersky Danai Koutra Parthasarathy Ranganathan Milad Hashemi NAI 16 40 0 15 Jun 2020
Learning advanced mathematical computations from examples Franccois Charton Amaury Hayat Guillaume Lample PINN 23 4 0 11 Jun 2020
Neural Power Units Niklas Heim Tomás Pevný Václav Smídl 29 9 0 02 Jun 2020
Progress Extrapolating Algorithmic Learning to Arbitrary Sequence Lengths Andreas Robinson 42 0 0 18 Mar 2020
It's Not What Machines Can Learn, It's What We Cannot Teach Gal Yehuda Moshe Gabel Assaf Schuster FaML 19 37 0 21 Feb 2020
A Survey of Deep Learning Techniques for Neural Machine Translation Shu Yang Yuxin Wang Xiaowen Chu VLM AI4TS AI4CE 27 138 0 18 Feb 2020