Semantic Sensitivities and Inconsistent Predictions: Measuring the
Fragility of NLI Models

Semantic Sensitivities and Inconsistent Predictions: Measuring the Fragility of NLI Models

25 January 2024

Isabelle Augenstein

Papers citing "Semantic Sensitivities and Inconsistent Predictions: Measuring the Fragility of NLI Models"

10 / 10 papers shown

Title
FinNLI: Novel Dataset for Multi-Genre Financial Natural Language Inference Benchmarking Jabez Magomere Elena Kochkina Samuel Mensah Simerjot Kaur Charese Smiley 15 0 0 22 Apr 2025
reWordBench: Benchmarking and Improving the Robustness of Reward Models with Transformed Inputs Zhaofeng Wu Michihiro Yasunaga Andrew Cohen Yoon Kim Asli Celikyilmaz Marjan Ghazvininejad 29 1 0 14 Mar 2025
SINdex: Semantic INconsistency Index for Hallucination Detection in LLMs Samir Abdaljalil Hasan Kurban Parichit Sharma Erchin Serpedin Rachad Atat HILM 46 0 0 07 Mar 2025
Towards Logically Consistent Language Models via Probabilistic Reasoning Diego Calanzone Stefano Teso Antonio Vergari LRM HILM 18 2 0 19 Apr 2024
How often are errors in natural language reasoning due to paraphrastic variability? Neha Srikanth Marine Carpuat Rachel Rudinger LRM 19 2 0 17 Apr 2024
Estimating the Causal Effects of Natural Logic Features in Transformer-Based NLI Models Julia Rozanova Marco Valentino André Freitas CML 19 1 0 03 Apr 2024
Large Language Models are Zero-Shot Reasoners Takeshi Kojima S. Gu Machel Reid Yutaka Matsuo Yusuke Iwasawa ReLM LRM 291 2,712 0 24 May 2022
Probing Classifiers: Promises, Shortcomings, and Advances Yonatan Belinkov 221 291 0 24 Feb 2021
UnNatural Language Inference Koustuv Sinha Prasanna Parthasarathi Joelle Pineau Adina Williams 196 94 0 30 Dec 2020
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding Alex Jinpeng Wang Amanpreet Singh Julian Michael Felix Hill Omer Levy Samuel R. Bowman ELM 294 6,927 0 20 Apr 2018