ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.14440
  4. Cited By
Semantic Sensitivities and Inconsistent Predictions: Measuring the
  Fragility of NLI Models

Semantic Sensitivities and Inconsistent Predictions: Measuring the Fragility of NLI Models

25 January 2024
Erik Arakelyan
Zhaoqi Liu
Isabelle Augenstein
    AAML
ArXivPDFHTML

Papers citing "Semantic Sensitivities and Inconsistent Predictions: Measuring the Fragility of NLI Models"

10 / 10 papers shown
Title
FinNLI: Novel Dataset for Multi-Genre Financial Natural Language Inference Benchmarking
FinNLI: Novel Dataset for Multi-Genre Financial Natural Language Inference Benchmarking
Jabez Magomere
Elena Kochkina
Samuel Mensah
Simerjot Kaur
Charese Smiley
15
0
0
22 Apr 2025
reWordBench: Benchmarking and Improving the Robustness of Reward Models with Transformed Inputs
reWordBench: Benchmarking and Improving the Robustness of Reward Models with Transformed Inputs
Zhaofeng Wu
Michihiro Yasunaga
Andrew Cohen
Yoon Kim
Asli Celikyilmaz
Marjan Ghazvininejad
29
1
0
14 Mar 2025
SINdex: Semantic INconsistency Index for Hallucination Detection in LLMs
Samir Abdaljalil
Hasan Kurban
Parichit Sharma
Erchin Serpedin
Rachad Atat
HILM
46
0
0
07 Mar 2025
Towards Logically Consistent Language Models via Probabilistic Reasoning
Towards Logically Consistent Language Models via Probabilistic Reasoning
Diego Calanzone
Stefano Teso
Antonio Vergari
LRM
HILM
18
2
0
19 Apr 2024
How often are errors in natural language reasoning due to paraphrastic
  variability?
How often are errors in natural language reasoning due to paraphrastic variability?
Neha Srikanth
Marine Carpuat
Rachel Rudinger
LRM
19
2
0
17 Apr 2024
Estimating the Causal Effects of Natural Logic Features in
  Transformer-Based NLI Models
Estimating the Causal Effects of Natural Logic Features in Transformer-Based NLI Models
Julia Rozanova
Marco Valentino
André Freitas
CML
19
1
0
03 Apr 2024
Large Language Models are Zero-Shot Reasoners
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
291
2,712
0
24 May 2022
Probing Classifiers: Promises, Shortcomings, and Advances
Probing Classifiers: Promises, Shortcomings, and Advances
Yonatan Belinkov
221
291
0
24 Feb 2021
UnNatural Language Inference
UnNatural Language Inference
Koustuv Sinha
Prasanna Parthasarathi
Joelle Pineau
Adina Williams
196
94
0
30 Dec 2020
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
  Understanding
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
294
6,927
0
20 Apr 2018
1