This is not a Dataset: A Large Negation Benchmark to Challenge Large
Language Models

This is not a Dataset: A Large Negation Benchmark to Challenge Large Language Models

24 October 2023

Iker García-Ferrero

Itziar Gonzalez-Dios

Papers citing "This is not a Dataset: A Large Negation Benchmark to Challenge Large Language Models"

10 / 10 papers shown

Title
Reasoning Capabilities and Invariability of Large Language Models Alessandro Raganato Rafael Peñaloza Marco Viviani G. Pasi ReLM LRM 80 0 0 01 May 2025
From No to Know: Taxonomy, Challenges, and Opportunities for Negation Understanding in Multimodal Foundation Models Mayank Vatsa Aparna Bharati S. Mittal Richa Singh 53 0 0 10 Feb 2025
Know "No'' Better: A Data-Driven Approach for Enhancing Negation Awareness in CLIP J. Park Jungbeom Lee Jongyoon Song Sangwon Yu Dahuin Jung Sungroh Yoon 45 0 0 19 Jan 2025
Generating Diverse Negations from Affirmative Sentences Darian Rodriguez Vasquez Afroditi Papadaki 37 0 0 30 Oct 2024
LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for Enhanced Following of Instructions with Multiple Constraints Thomas Palmeira Ferraz Kartik Mehta Yu-Hsiang Lin Haw-Shiuan Chang Shereen Oraby Sijia Liu Vivek Subramanian Tagyoung Chung Mohit Bansal Nanyun Peng 48 7 0 09 Oct 2024
Stress-Testing Long-Context Language Models with Lifelong ICL and Task Haystack Xiaoyue Xu Qinyuan Ye Xiang Ren 38 6 0 23 Jul 2024
Influence of Solution Efficiency and Valence of Instruction on Additive and Subtractive Solution Strategies in Humans and GPT-4 Lydia Uhler Verena Jordan Jürgen Buder Markus Huff F. Papenmeier 20 0 0 25 Apr 2024
Suppressing Pink Elephants with Direct Principle Feedback Louis Castricato Nathan Lile Suraj Anand Hailey Schoelkopf Siddharth Verma Stella Biderman 58 9 0 12 Feb 2024
Not another Negation Benchmark: The NaN-NLI Test Suite for Sub-clausal Negation Thinh Hung Truong Yulia Otmakhova Tim Baldwin Trevor Cohn Jey Han Lau Karin Verspoor 55 21 0 06 Oct 2022
Neural versus Phrase-Based Machine Translation Quality: a Case Study L. Bentivogli Arianna Bisazza Mauro Cettolo Marcello Federico 191 328 0 16 Aug 2016