Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
1511.02301
Cited By
v1
v2
v3
v4 (latest)
The Goldilocks Principle: Reading Children's Books with Explicit Memory Representations
7 November 2015
Felix Hill
Antoine Bordes
S. Chopra
Jason Weston
RALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Goldilocks Principle: Reading Children's Books with Explicit Memory Representations"
50 / 309 papers shown
Title
Decoupling the "What" and "Where" With Polar Coordinate Positional Embeddings
Anand Gopalakrishnan
Róbert Csordás
Jürgen Schmidhuber
M. C. Mozer
72
0
0
05 Sep 2025
Influence-driven Curriculum Learning for Pre-training on Limited Data
Loris Schoenegger
Lukas Thoma
Terra Blevins
Benjamin Roth
164
0
0
21 Aug 2025
Attention-Only Transformers via Unrolled Subspace Denoising
Peng Wang
Yifu Lu
Yaodong Yu
Druv Pai
Qing Qu
Yi Ma
ViT
235
3
0
04 Jun 2025
Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
Alex Warstadt
Aaron Mueller
Leshem Choshen
E. Wilcox
Chengxu Zhuang
...
Rafael Mosquera
Bhargavi Paranjape
Adina Williams
Tal Linzen
Robert Bamler
494
161
0
10 Apr 2025
Do Construction Distributions Shape Formal Language Learning In German BabyLMs?
Bastian Bunzeck
Daniel Duran
Sina Zarrieß
262
7
0
14 Mar 2025
Building a Rich Dataset to Empower the Persian Question Answering Systems
Mohsen Yazdinejad
Marjan Kaedi
166
0
0
31 Dec 2024
BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language Models
Patrick Haller
Jonas Golde
Alan Akbik
215
0
0
20 Dec 2024
Findings of the Second BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
Michael Y. Hu
Aaron Mueller
Candace Ross
Adina Williams
Tal Linzen
Chengxu Zhuang
Robert Bamler
Leshem Choshen
Alex Warstadt
Ethan Gotlieb Wilcox
415
35
0
06 Dec 2024
AntLM: Bridging Causal and Masked Language Models
Xinru Yu
Bin Guo
Shiwei Luo
Jiadong Wang
Changzhi Sun
Man Lan
CLL
281
3
0
04 Dec 2024
DRS: Deep Question Reformulation With Structured Output
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Zhecheng Li
Yijiao Wang
Bryan Hooi
Yujun Cai
Nanyun Peng
Kai-Wei Chang
KELM
527
3
0
27 Nov 2024
When Babies Teach Babies: Can student knowledge sharing outperform Teacher-Guided Distillation on small datasets?
Srikrishna Iyer
FedML
365
0
0
25 Nov 2024
LIBMoE: A Library for comprehensive benchmarking Mixture of Experts in Large Language Models
Nam V. Nguyen
Thong T. Doan
Luong Tran
Van Nguyen
Quang Pham
MoE
516
4
0
01 Nov 2024
Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Weize Chen
Qixin Xu
Chen Qian
Cheng Yang
Zhiyuan Liu
Maosong Sun
LLMAG
203
14
0
10 Oct 2024
What Makes a Good Story and How Can We Measure It? A Comprehensive Survey of Story Evaluation
Dingyi Yang
Qin Jin
351
14
0
26 Aug 2024
Recent Advances in Multi-Choice Machine Reading Comprehension: A Survey on Methods and Datasets
Shima Foolad
Kourosh Kiani
R. Rastgoo
FaML
222
0
0
04 Aug 2024
Gradient-based inference of abstract task representations for generalization in neural networks
Ali Hummos
Felipe del-Rio
Brabeeba Mien Wang
Julio Hurtado
Cristian B. Calderon
G. Yang
166
4
0
24 Jul 2024
MoEUT: Mixture-of-Experts Universal Transformers
Róbert Csordás
Kazuki Irie
Jürgen Schmidhuber
Christopher Potts
Christopher D. Manning
MoE
178
28
0
25 May 2024
Transfer Learning Enhanced Single-choice Decision for Multi-choice Question Answering
Chenhao Cui
Yufan Jiang
Shuangzhi Wu
Zhoujun Li
FaML
134
0
0
27 Apr 2024
Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNs
Kanishka Misra
Kyle Mahowald
402
36
0
28 Mar 2024
SyllabusQA: A Course Logistics Question Answering Dataset
Nigel Fernandez
Alexander Scarlatos
Andrew Lan
180
11
0
03 Mar 2024
An Integrated Data Processing Framework for Pretraining Foundation Models
Yiding Sun
Feng Wang
Yutao Zhu
Wayne Xin Zhao
Jiaxin Mao
259
6
0
26 Feb 2024
Leveraging Large Language Models for Concept Graph Recovery and Question Answering in NLP Education
Rui Yang
Boming Yang
Sixun Ouyang
Tianwei She
Aosong Feng
Yuang Jiang
Freddy Lecue
Jinghui Lu
Irene Li
AI4Ed
186
7
0
22 Feb 2024
Triple-Encoders: Representations That Fire Together, Wire Together
Justus-Jonas Erker
Florian Mai
Nils Reimers
Gerasimos Spanakis
Iryna Gurevych
180
2
0
19 Feb 2024
Distractor Generation for Multiple-Choice Questions: A Survey of Methods, Datasets, and Evaluation
Elaf Alhazmi
Quan Z. Sheng
W. Zhang
Munazza Zaib
A. Alhazmi
AI4Ed
184
18
0
02 Feb 2024
DsDm: Model-Aware Dataset Selection with Datamodels
International Conference on Machine Learning (ICML), 2024
Logan Engstrom
Axel Feldmann
Aleksander Madry
OODD
218
86
0
23 Jan 2024
Fine-tuning Strategies for Domain Specific Question Answering under Low Annotation Budget Constraints
IEEE International Conference on Tools with Artificial Intelligence (ICTAI), 2023
Kunpeng Guo
Dennis Diefenbach
Antoine Gourru
Christophe Gravier
271
2
0
17 Jan 2024
Building Efficient and Effective OpenQA Systems for Low-Resource Languages
Emrah Budur
Riza Ozccelik
Dilara Soylu
Omar Khattab
Tunga Güngör
Christopher Potts
205
7
0
07 Jan 2024
SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention
Neural Information Processing Systems (NeurIPS), 2023
Róbert Csordás
Piotr Piekos
Kazuki Irie
Jürgen Schmidhuber
MoE
159
27
0
13 Dec 2023
Making Translators Privacy-aware on the User's Side
Ryoma Sato
133
2
0
07 Dec 2023
Not all layers are equally as important: Every Layer Counts BERT
Lucas Georges Gabriel Charpentier
David Samuel
220
24
0
03 Nov 2023
Don't Make Your LLM an Evaluation Benchmark Cheater
Kun Zhou
Yutao Zhu
Zhipeng Chen
Wentong Chen
Wayne Xin Zhao
Xu Chen
Yankai Lin
Ji-Rong Wen
Jiawei Han
ELM
292
209
0
03 Nov 2023
Mean BERTs make erratic language teachers: the effectiveness of latent bootstrapping in low-resource settings
David Samuel
127
4
0
30 Oct 2023
Are NLP Models Good at Tracing Thoughts: An Overview of Narrative Understanding
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Lixing Zhu
Runcong Zhao
Lin Gui
Yulan He
185
9
0
28 Oct 2023
BabyStories: Can Reinforcement Learning Teach Baby Language Models to Write Better Stories?
Xingmeng Zhao
Tongnian Wang
Sheri Osborn
Anthony Rios
106
11
0
25 Oct 2023
A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future Directions
Junchao Wu
Shu Yang
Runzhe Zhan
Yulin Yuan
Yang Li
Lidia S. Chao
DeLMO
282
87
0
23 Oct 2023
ChapGTP, ILLC's Attempt at Raising a BabyLM: Improving Data Efficiency by Automatic Task Formation
Jaap Jumelet
Michael Hanna
Marianne de Heer Kloots
Anna Langedijk
Charlotte Pouw
Oskar van der Wal
178
3
0
17 Oct 2023
Envisioning Narrative Intelligence: A Creative Visual Storytelling Anthology
International Conference on Human Factors in Computing Systems (CHI), 2023
Brett A. Halperin
S. Lukin
CoGe
153
29
0
06 Oct 2023
A Data Source for Reasoning Embodied Agents
AAAI Conference on Artificial Intelligence (AAAI), 2023
Jack Lanchantin
Sainbayar Sukhbaatar
Gabriel Synnaeve
Yuxuan Sun
Kavya Srinet
Arthur Szlam
LM&Ro
LRM
160
10
0
14 Sep 2023
Prompt2Model: Generating Deployable Models from Natural Language Instructions
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Vijay Viswanathan
Chenyang Zhao
Amanda Bertsch
Tongshuang Wu
Graham Neubig
132
48
0
23 Aug 2023
Teach model to answer questions after comprehending the document
Ruiqing Sun
Ping Jian
FaML
128
0
0
18 Jul 2023
Honey, I Shrunk the Language: Language Model Behavior at Reduced Scale
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Vijeta Deshpande
Dan Pechi
Shree Thatte
Vladislav Lialin
Anna Rumshisky
238
11
0
26 May 2023
Can Large Language Models Capture Dissenting Human Voices?
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Noah Lee
Na Min An
James Thorne
ALM
254
38
0
23 May 2023
Empower Large Language Model to Perform Better on Industrial Domain-Specific Question Answering
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Fangkai Yang
Lu Wang
Zezhong Wang
Lu Wang
Jue Zhang
Mohit Garg
Qingwei Lin
Saravan Rajmohan
Dongmei Zhang
205
68
0
19 May 2023
Entity Tracking in Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Najoung Kim
Sebastian Schuster
277
29
0
03 May 2023
Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Kent K. Chang
Mackenzie Cramer
Sandeep Soni
David Bamman
RALM
490
155
0
28 Apr 2023
Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on Visual Language Understanding
Computer Vision and Pattern Recognition (CVPR), 2023
Morris Alper
Michael Fiman
Hadar Averbuch-Elor
VLM
LRM
165
18
0
21 Mar 2023
Complex QA and language models hybrid architectures, Survey
Xavier Daull
P. Bellot
Emmanuel Bruno
Vincent Martin
Elisabeth Murisasco
ELM
575
17
0
17 Feb 2023
Call for Papers -- The BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus
Alex Warstadt
Leshem Choshen
Aaron Mueller
Adina Williams
Ethan Gotlieb Wilcox
Chengxu Zhuang
165
67
0
27 Jan 2023
RWEN-TTS: Relation-aware Word Encoding Network for Natural Text-to-Speech Synthesis
AAAI Conference on Artificial Intelligence (AAAI), 2022
Shinhyeok Oh
HyeongRae Noh
Yoonseok Hong
Insoo Oh
195
0
0
15 Dec 2022
Open-world Story Generation with Structured Knowledge Enhancement: A Comprehensive Survey
Yuxin Wang
Jieru Lin
Zhiwei Yu
Wei Hu
Börje F. Karlsson
254
29
0
09 Dec 2022
1
2
3
4
5
6
7
Next