ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.00086
  4. Cited By
Dyna-bAbI: unlocking bAbI's potential with dynamic synthetic
  benchmarking

Dyna-bAbI: unlocking bAbI's potential with dynamic synthetic benchmarking

30 November 2021
Ronen Tamari
Kyle Richardson
Aviad Sar-Shalom
Noam Kahlon
Nelson F. Liu
Reut Tsarfaty
Dafna Shahaf
ArXivPDFHTML

Papers citing "Dyna-bAbI: unlocking bAbI's potential with dynamic synthetic benchmarking"

11 / 11 papers shown
Title
CRoW: Benchmarking Commonsense Reasoning in Real-World Tasks
CRoW: Benchmarking Commonsense Reasoning in Real-World Tasks
Mete Ismayilzada
Debjit Paul
Syrielle Montariol
Mor Geva
Antoine Bosselut
LRM
12
4
0
23 Oct 2023
Explainable Verbal Reasoner Plus (EVR+): A Natural Language Reasoning
  Framework that Supports Diverse Compositional Reasoning
Explainable Verbal Reasoner Plus (EVR+): A Natural Language Reasoning Framework that Supports Diverse Compositional Reasoning
Zhengzhong Liang
Zeyu Zhang
Steven Bethard
Mihai Surdeanu
ReLM
LRM
13
1
0
28 Apr 2023
Breakpoint Transformers for Modeling and Tracking Intermediate Beliefs
Breakpoint Transformers for Modeling and Tracking Intermediate Beliefs
Kyle Richardson
Ronen Tamari
Oren Sultan
Reut Tsarfaty
Dafna Shahaf
Ashish Sabharwal
KELM
13
7
0
15 Nov 2022
What Makes Instruction Learning Hard? An Investigation and a New
  Challenge in a Synthetic Environment
What Makes Instruction Learning Hard? An Investigation and a New Challenge in a Synthetic Environment
Matthew Finlayson
Kyle Richardson
Ashish Sabharwal
Peter Clark
16
12
0
19 Apr 2022
Pushing the Limits of Rule Reasoning in Transformers through Natural
  Language Satisfiability
Pushing the Limits of Rule Reasoning in Transformers through Natural Language Satisfiability
Kyle Richardson
Ashish Sabharwal
ReLM
LRM
17
24
0
16 Dec 2021
ReaSCAN: Compositional Reasoning in Language Grounding
ReaSCAN: Compositional Reasoning in Language Grounding
Zhengxuan Wu
Elisa Kreiss
Desmond C. Ong
Christopher Potts
CoGe
LRM
21
22
0
18 Sep 2021
PonderNet: Learning to Ponder
PonderNet: Learning to Ponder
Andrea Banino
Jan Balaguer
Charles Blundell
PINN
AIMat
92
80
0
12 Jul 2021
DynaSent: A Dynamic Benchmark for Sentiment Analysis
DynaSent: A Dynamic Benchmark for Sentiment Analysis
Christopher Potts
Zhengxuan Wu
Atticus Geiger
Douwe Kiela
219
76
0
30 Dec 2020
To Test Machine Comprehension, Start by Defining Comprehension
To Test Machine Comprehension, Start by Defining Comprehension
Jesse Dunietz
Greg Burnham
Akash Bharadwaj
Owen Rambow
Jennifer Chu-Carroll
D. Ferrucci
FaML
52
64
0
04 May 2020
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
  Understanding
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
294
6,927
0
20 Apr 2018
Simpler Context-Dependent Logical Forms via Model Projections
Simpler Context-Dependent Logical Forms via Model Projections
R. Long
Panupong Pasupat
Percy Liang
196
102
0
16 Jun 2016
1