ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1806.00692
  4. Cited By
Stress Test Evaluation for Natural Language Inference

Stress Test Evaluation for Natural Language Inference

2 June 2018
Aakanksha Naik
Abhilasha Ravichander
Norman M. Sadeh
Carolyn Rose
Graham Neubig
    ELM
ArXivPDFHTML

Papers citing "Stress Test Evaluation for Natural Language Inference"

37 / 237 papers shown
Title
Probing the Probing Paradigm: Does Probing Accuracy Entail Task
  Relevance?
Probing the Probing Paradigm: Does Probing Accuracy Entail Task Relevance?
Abhilasha Ravichander
Yonatan Belinkov
Eduard H. Hovy
26
123
0
02 May 2020
Neural Natural Language Inference Models Partially Embed Theories of
  Lexical Entailment and Negation
Neural Natural Language Inference Models Partially Embed Theories of Lexical Entailment and Negation
Atticus Geiger
Kyle Richardson
Christopher Potts
8
4
0
30 Apr 2020
Elastic weight consolidation for better bias inoculation
Elastic weight consolidation for better bias inoculation
James Thorne
Andreas Vlachos
17
11
0
29 Apr 2020
The Curse of Performance Instability in Analysis Datasets: Consequences,
  Source, and Suggestions
The Curse of Performance Instability in Analysis Datasets: Consequences, Source, and Suggestions
Xiang Zhou
Yixin Nie
Hao Tan
Joey Tianyi Zhou
12
40
0
28 Apr 2020
On Adversarial Examples for Biomedical NLP Tasks
On Adversarial Examples for Biomedical NLP Tasks
Vladimir Araujo
Andrés Carvallo
Carlos Aspillaga
Denis Parra
MedIm
AAML
OOD
9
13
0
23 Apr 2020
Pretrained Transformers Improve Out-of-Distribution Robustness
Pretrained Transformers Improve Out-of-Distribution Robustness
Dan Hendrycks
Xiaoyuan Liu
Eric Wallace
Adam Dziedzic
R. Krishnan
D. Song
OOD
10
428
0
13 Apr 2020
Overestimation of Syntactic Representationin Neural Language Models
Overestimation of Syntactic Representationin Neural Language Models
Jordan Kodner
Nitish Gupta
18
12
0
10 Apr 2020
Translation Artifacts in Cross-lingual Transfer Learning
Translation Artifacts in Cross-lingual Transfer Learning
Mikel Artetxe
Gorka Labaka
Eneko Agirre
19
114
0
09 Apr 2020
Are Natural Language Inference Models IMPPRESsive? Learning IMPlicature
  and PRESupposition
Are Natural Language Inference Models IMPPRESsive? Learning IMPlicature and PRESupposition
Paloma Jeretic
Alex Warstadt
Suvrat Bhooshan
Adina Williams
ReLM
AI4CE
23
109
0
07 Apr 2020
Evaluating Models' Local Decision Boundaries via Contrast Sets
Evaluating Models' Local Decision Boundaries via Contrast Sets
Matt Gardner
Yoav Artzi
Victoria Basmova
Jonathan Berant
Ben Bogin
...
Sanjay Subramanian
Reut Tsarfaty
Eric Wallace
Ally Zhang
Ben Zhou
ELM
35
84
0
06 Apr 2020
TyDi QA: A Benchmark for Information-Seeking Question Answering in
  Typologically Diverse Languages
TyDi QA: A Benchmark for Information-Seeking Question Answering in Typologically Diverse Languages
J. Clark
Eunsol Choi
Michael Collins
Dan Garrette
Tom Kwiatkowski
Vitaly Nikolaev
J. Palomaki
35
591
0
10 Mar 2020
HypoNLI: Exploring the Artificial Patterns of Hypothesis-only Bias in
  Natural Language Inference
HypoNLI: Exploring the Artificial Patterns of Hypothesis-only Bias in Natural Language Inference
Tianyu Liu
Xin Zheng
Baobao Chang
Zhifang Sui
43
23
0
05 Mar 2020
Stress Test Evaluation of Transformer-based Models in Natural Language
  Understanding Tasks
Stress Test Evaluation of Transformer-based Models in Natural Language Understanding Tasks
Carlos Aspillaga
Andrés Carvallo
Vladimir Araujo
ELM
39
31
0
14 Feb 2020
Adversarial Filters of Dataset Biases
Adversarial Filters of Dataset Biases
Ronan Le Bras
Swabha Swayamdipta
Chandra Bhagavatula
Rowan Zellers
Matthew E. Peters
Ashish Sabharwal
Yejin Choi
36
220
0
10 Feb 2020
Stance Detection Benchmark: How Robust Is Your Stance Detection?
Stance Detection Benchmark: How Robust Is Your Stance Detection?
Benjamin Schiller
Johannes Daxenberger
Iryna Gurevych
11
95
0
06 Jan 2020
What Does My QA Model Know? Devising Controlled Probes using Expert
  Knowledge
What Does My QA Model Know? Devising Controlled Probes using Expert Knowledge
Kyle Richardson
Ashish Sabharwal
25
45
0
31 Dec 2019
Adversarial Analysis of Natural Language Inference Systems
Adversarial Analysis of Natural Language Inference Systems
Tiffany Chien
Jugal Kalita
AAML
36
12
0
07 Dec 2019
Question Answering for Privacy Policies: Combining Computational and
  Legal Perspectives
Question Answering for Privacy Policies: Combining Computational and Legal Perspectives
Abhilasha Ravichander
A. Black
Shomir Wilson
Thomas B. Norton
Norman M. Sadeh
AILaw
17
104
0
03 Nov 2019
Posing Fair Generalization Tasks for Natural Language Inference
Posing Fair Generalization Tasks for Natural Language Inference
Atticus Geiger
Ignacio Cases
L. Karttunen
Christopher Potts
17
48
0
03 Nov 2019
Adversarial Music: Real World Audio Adversary Against Wake-word
  Detection System
Adversarial Music: Real World Audio Adversary Against Wake-word Detection System
Juncheng Billy Li
Shuhui Qu
Xinjian Li
Joseph Szurley
J. Zico Kolter
Florian Metze
AAML
10
63
0
31 Oct 2019
Adversarial NLI: A New Benchmark for Natural Language Understanding
Adversarial NLI: A New Benchmark for Natural Language Understanding
Yixin Nie
Adina Williams
Emily Dinan
Joey Tianyi Zhou
Jason Weston
Douwe Kiela
28
977
0
31 Oct 2019
Diversify Your Datasets: Analyzing Generalization via Controlled
  Variance in Adversarial Datasets
Diversify Your Datasets: Analyzing Generalization via Controlled Variance in Adversarial Datasets
Ohad Rozen
Vered Shwartz
Roee Aharoni
Ido Dagan
AAML
19
37
0
21 Oct 2019
MonaLog: a Lightweight System for Natural Language Inference Based on
  Monotonicity
MonaLog: a Lightweight System for Natural Language Inference Based on Monotonicity
Hai Hu
Qi Chen
Kyle Richardson
A. Mukherjee
L. Moss
Sandra Kübler
14
41
0
19 Oct 2019
SesameBERT: Attention for Anywhere
SesameBERT: Attention for Anywhere
Ta-Chun Su
Hsiang-Chih Cheng
28
7
0
08 Oct 2019
Improving Generalization by Incorporating Coverage in Natural Language
  Inference
Improving Generalization by Incorporating Coverage in Natural Language Inference
N. Moosavi
Prasetya Ajie Utama
Andreas Rucklé
Iryna Gurevych
NAI
9
4
0
19 Sep 2019
Probing Natural Language Inference Models through Semantic Fragments
Probing Natural Language Inference Models through Semantic Fragments
Kyle Richardson
Hai Hu
L. Moss
Ashish Sabharwal
6
148
0
16 Sep 2019
A Logic-Driven Framework for Consistency of Neural Models
A Logic-Driven Framework for Consistency of Neural Models
Tao Li
Vivek Gupta
Maitrey Mehta
Vivek Srikumar
AI4CE
18
101
0
31 Aug 2019
Unlearn Dataset Bias in Natural Language Inference by Fitting the
  Residual
Unlearn Dataset Bias in Natural Language Inference by Fitting the Residual
He He
Sheng Zha
Haohan Wang
16
197
0
28 Aug 2019
Can neural networks understand monotonicity reasoning?
Can neural networks understand monotonicity reasoning?
Hitomi Yanaka
K. Mineshima
D. Bekki
Kentaro Inui
Satoshi Sekine
Lasha Abzianidze
Johan Bos
LRM
22
80
0
15 Jun 2019
SherLIiC: A Typed Event-Focused Lexical Inference Benchmark for
  Evaluating Natural Language Inference
SherLIiC: A Typed Event-Focused Lexical Inference Benchmark for Evaluating Natural Language Inference
Martin Schmitt
Hinrich Schütze
14
14
0
04 Jun 2019
Misleading Failures of Partial-input Baselines
Misleading Failures of Partial-input Baselines
Shi Feng
Eric Wallace
Jordan L. Boyd-Graber
25
0
0
14 May 2019
SuperGLUE: A Stickier Benchmark for General-Purpose Language
  Understanding Systems
SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems
Alex Jinpeng Wang
Yada Pruksachatkun
Nikita Nangia
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
17
2,255
0
02 May 2019
HELP: A Dataset for Identifying Shortcomings of Neural Models in
  Monotonicity Reasoning
HELP: A Dataset for Identifying Shortcomings of Neural Models in Monotonicity Reasoning
Hitomi Yanaka
K. Mineshima
D. Bekki
Kentaro Inui
Satoshi Sekine
Lasha Abzianidze
Johan Bos
11
61
0
27 Apr 2019
Several Experiments on Investigating Pretraining and Knowledge-Enhanced
  Models for Natural Language Inference
Several Experiments on Investigating Pretraining and Knowledge-Enhanced Models for Natural Language Inference
Tianda Li
Xiao-Dan Zhu
Quan Liu
Qian Chen
Zhigang Chen
Si Wei
22
17
0
27 Apr 2019
Probing What Different NLP Tasks Teach Machines about Function Word
  Comprehension
Probing What Different NLP Tasks Teach Machines about Function Word Comprehension
Najoung Kim
Roma Patel
Adam Poliak
Alex Jinpeng Wang
Patrick Xia
...
Alexis Ross
Tal Linzen
Benjamin Van Durme
Samuel R. Bowman
Ellie Pavlick
20
105
0
25 Apr 2019
Inoculation by Fine-Tuning: A Method for Analyzing Challenge Datasets
Inoculation by Fine-Tuning: A Method for Analyzing Challenge Datasets
Nelson F. Liu
Roy Schwartz
Noah A. Smith
AAML
23
104
0
04 Apr 2019
Hypothesis Only Baselines in Natural Language Inference
Hypothesis Only Baselines in Natural Language Inference
Adam Poliak
Jason Naradowsky
Aparajita Haldar
Rachel Rudinger
Benjamin Van Durme
190
576
0
02 May 2018
Previous
12345