Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2002.04108
Cited By
v1
v2
v3 (latest)
Adversarial Filters of Dataset Biases
International Conference on Machine Learning (ICML), 2020
10 February 2020
Ronan Le Bras
Swabha Swayamdipta
Chandra Bhagavatula
Rowan Zellers
Matthew E. Peters
Ashish Sabharwal
Yejin Choi
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adversarial Filters of Dataset Biases"
50 / 169 papers shown
ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning
Hongwei Liu
J. Liu
Shudong Liu
Haodong Duan
Yuqiang Li
...
Conghui He
Qi Zhang
Songyang Zhang
Lei Bai
Kai Chen
LRM
ALM
ELM
533
2
0
18 Nov 2025
Towards Human-AI Synergy in Requirements Engineering: A Framework and Preliminary Study
International Conference on Intelligent Data Science Technologies and Applications (IDSTA), 2025
Mateen Ahmed Abbasi
Petri Ihantola
T. Mikkonen
Niko Mäkitalo
144
0
0
28 Oct 2025
SCISSOR: Mitigating Semantic Bias through Cluster-Aware Siamese Networks for Robust Classification
Shuo Yang
Bardh Prenkaj
Gjergji Kasneci
403
1
0
17 Jun 2025
Improving the OOD Performance of Closed-Source LLMs on NLI Through Strategic Data Selection
Joe Stacey
Lisa Alazraki
Aran Ubhi
Beyza Ermis
Aaron Mueller
Marek Rei
457
0
0
26 May 2025
DATA: Multi-Disentanglement based Contrastive Learning for Open-World Semi-Supervised Deepfake Attribution
IEEE transactions on multimedia (TMM), 2025
Ming-Hui Liu
Xiao-Qian Liu
Xin Luo
Xin-Shun Xu
296
4
0
07 May 2025
MINERVA: Evaluating Complex Video Reasoning
Arsha Nagrani
Sachit Menon
Ahmet Iscen
Shyamal Buch
Ramin Mehran
...
Yukun Zhu
Carl Vondrick
Mikhail Sirotenko
Cordelia Schmid
Tobias Weyand
427
18
0
01 May 2025
Pushing the boundary on Natural Language Inference
Pablo Miralles-González
Javier Huertas-Tato
Alejandro Martín
David Camacho
LRM
640
1
0
25 Apr 2025
FLUKE: A Linguistically-Driven and Task-Agnostic Framework for Robustness Evaluation
Yulia Otmakhova
Hung Thinh Truong
Rahmad Mahendra
Zenan Zhai
Rongxin Zhu
Daniel Beck
Jey Han Lau
ELM
568
1
0
24 Apr 2025
Cancer-Myth: Evaluating Large Language Models on Patient Questions with False Presuppositions
Peng Guo
Tianqi Chen
Ching Ying Lin
Ching Ying Lin
Jade Law
Mazen Jizzini
Jorge J. Nieva
Ruishan Liu
Robin Jia
446
1
0
15 Apr 2025
SVLTA: Benchmarking Vision-Language Temporal Alignment via Synthetic Video Situation
Computer Vision and Pattern Recognition (CVPR), 2025
Hao Du
Bo Wu
Yan Lu
Zhendong Mao
270
2
0
08 Apr 2025
Attention Pruning: Automated Fairness Repair of Language Models via Surrogate Simulated Annealing
Vishnu Asutosh Dasu
Md Rafi Ur Rashid
Vipul Gupta
Saeid Tizpaz-Niari
Gang Tan
AAML
564
2
0
20 Mar 2025
The Box is in the Pen: Evaluating Commonsense Reasoning in Neural Machine Translation
Jie He
Tao Wang
Deyi Xiong
Qun Liu
ELM
LRM
436
35
0
05 Mar 2025
Addressing Bias in Generative AI: Challenges and Research Opportunities in Information Management
Information Manager (The) (TIM), 2025
Xiahua Wei
Naveen Kumar
Han Zhang
442
52
0
22 Jan 2025
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Yulei Qin
Yuncheng Yang
Pengcheng Guo
Gang Li
Hang Shao
Yuchen Shi
Zihan Xu
Yun Gu
Ke Li
Xing Sun
ALM
943
27
0
31 Dec 2024
The Evolution and Future Perspectives of Artificial Intelligence Generated Content
Chengzhang Zhu
Luobin Cui
Ying Tang
Jiacun Wang
468
4
0
02 Dec 2024
SelfPrompt: Autonomously Evaluating LLM Robustness via Domain-Constrained Knowledge Guidelines and Refined Adversarial Prompts
International Conference on Computational Linguistics (COLING), 2024
Aihua Pei
Zehua Yang
Shunan Zhu
Ruoxi Cheng
Ju Jia
AAML
464
6
0
01 Dec 2024
Diagnosing Medical Datasets with Training Dynamics
Laura Wenderoth
180
0
0
03 Nov 2024
Improving Model Evaluation using SMART Filtering of Benchmark Datasets
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Vipul Gupta
Candace Ross
David Pantoja
R. Passonneau
Megan Ung
Adina Williams
790
17
0
26 Oct 2024
Disentangling Hate Across Target Identities
Yiping Jin
Leo Wanner
Aneesh Moideen Koya
263
1
0
14 Oct 2024
Gamified crowd-sourcing of high-quality data for visual fine-tuning
Shashank Yadav
Rohan Tomar
Garvit Jain
Chirag Ahooja
Shubham Chaudhary
Charles Elkan
345
1
0
05 Oct 2024
The Hard Positive Truth about Vision-Language Compositionality
European Conference on Computer Vision (ECCV), 2024
Amita Kamath
Cheng-Yu Hsieh
Kai-Wei Chang
Ranjay Krishna
CLIP
CoGe
VLM
327
18
0
26 Sep 2024
Misrepresented Technological Solutions in Imagined Futures: The Origins and Dangers of AI Hype in the Research Community
AAAI/ACM Conference on AI, Ethics, and Society (AIES), 2024
Savannah Thais
171
7
0
08 Aug 2024
Compare without Despair: Reliable Preference Evaluation with Generation Separability
Sayan Ghosh
Tejas Srinivasan
Swabha Swayamdipta
463
4
0
02 Jul 2024
Learning to Correct for QA Reasoning with Black-box LLMs
Jaehyung Kim
Dongyoung Kim
Yiming Yang
LRM
285
8
0
26 Jun 2024
KGPA: Robustness Evaluation for Large Language Models via Cross-Domain Knowledge Graphs
Aihua Pei
Zehua Yang
Shunan Zhu
Ruoxi Cheng
Ju Jia
Lina Wang
370
2
0
16 Jun 2024
Towards Trustworthy AI: A Review of Ethical and Robust Large Language Models
Meftahul Ferdaus
Mahdi Abdelguerfi
Elias Ioup
Kendall N. Niles
Ken Pathak
Steve Sloan
458
35
0
01 Jun 2024
LLaMA-NAS: Efficient Neural Architecture Search for Large Language Models
Anthony Sarah
S. N. Sridhar
Maciej Szankin
Sairam Sundaresan
345
12
0
28 May 2024
Designing NLP Systems That Adapt to Diverse Worldviews
Claudiu Creanga
Liviu P. Dinu
262
3
0
18 May 2024
From Transformers to LLMs: A Systematic Survey of Efficiency Considerations in NLP
Wazib Ansar
Saptarsi Goswami
Amlan Chakrabarti
MedIm
557
12
0
15 May 2024
Examining the robustness of LLM evaluation to the distributional assumptions of benchmarks
Melissa Ailem
Katerina Marazopoulou
Charlotte Siska
James Bono
423
40
0
25 Apr 2024
SuRe: Summarizing Retrievals using Answer Candidates for Open-domain QA of LLMs
Jaehyung Kim
Jaehyun Nam
Sangwoo Mo
Jongjin Park
Sang-Woo Lee
Minjoon Seo
Jung-Woo Ha
Jinwoo Shin
AIFin
RALM
ELM
382
89
0
17 Apr 2024
Don't Blame the Data, Blame the Model: Understanding Noise and Bias When Learning from Subjective Annotations
Abhishek Anand
Negar Mokhberian
Prathyusha Naresh Kumar
Anweasha Saha
Zihao He
Ashwin Rao
Fred Morstatter
Kristina Lerman
288
12
0
06 Mar 2024
EvoGrad: A Dynamic Take on the Winograd Schema Challenge with Human Adversaries
Jing Han Sun
Ali Emami
369
6
0
20 Feb 2024
Revisiting the Dataset Bias Problem from a Statistical Perspective
European Conference on Artificial Intelligence (ECAI), 2024
Kien Do
D. Nguyen
Hung Le
T. Le
Dang Nguyen
Haripriya Harikumar
T. Tran
Santu Rana
Svetha Venkatesh
230
0
0
05 Feb 2024
Influence Scores at Scale for Efficient Language Data Sampling
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Nikhil Anand
Joshua Tan
Maria Minakova
TDI
312
4
0
27 Nov 2023
Data Similarity is Not Enough to Explain Language Model Performance
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Gregory Yauney
Emily Reif
David M. Mimno
255
11
0
15 Nov 2023
Explore Spurious Correlations at the Concept Level in Language Models for Text Classification
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Yuhang Zhou
Paiheng Xu
Xiaoyu Liu
Bang An
Wei Ai
Furong Huang
LRM
507
52
0
15 Nov 2023
Measuring Adversarial Datasets
Yuanchen Bai
Raoyi Huang
Vijay Viswanathan
Tzu-Sheng Kuo
Tongshuang Wu
272
1
0
06 Nov 2023
People Make Better Edits: Measuring the Efficacy of LLM-Generated Counterfactually Augmented Data for Harmful Language Detection
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Indira Sen
Dennis Assenmacher
Mattia Samory
Isabelle Augenstein
Wil M.P. van der Aalst
Claudia Wagner
576
32
0
02 Nov 2023
Harnessing Dataset Cartography for Improved Compositional Generalization in Transformers
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Osman Batur .Ince
Tanin Zeraati
Semih Yagcioglu
Yadollah Yaghoobzadeh
Erkut Erdem
Aykut Erdem
210
4
0
18 Oct 2023
QADYNAMICS: Training Dynamics-Driven Synthetic QA Diagnostic for Zero-Shot Commonsense Question Answering
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Haochen Shi
Weiqi Wang
Tianqing Fang
Baixuan Xu
Wenxuan Ding
Xin Liu
Yangqiu Song
296
7
0
17 Oct 2023
An Investigation of LLMs' Inefficacy in Understanding Converse Relations
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Chengwen Qi
Bowen Li
Binyuan Hui
Bailin Wang
Jinyang Li
Jinwang Wu
Yuanjun Laili
326
14
0
08 Oct 2023
Mitigating Shortcuts in Language Models with Soft Label Encoding
International Conference on Language Resources and Evaluation (LREC), 2023
Zirui He
Huiqi Deng
Haiyan Zhao
Ninghao Liu
Jundong Li
218
2
0
17 Sep 2023
GLS-CSC: A Simple but Effective Strategy to Mitigate Chinese STM Models' Over-Reliance on Superficial Clue
Yanrui Du
Sendong Zhao
Yuhan Chen
Rai Bai
Jing Liu
Huaqin Wu
Haifeng Wang
Bing Qin
262
2
0
08 Sep 2023
Towards Addressing the Misalignment of Object Proposal Evaluation for Vision-Language Tasks via Semantic Grounding
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Joshua Forster Feinglass
Yezhou Yang
237
2
0
01 Sep 2023
Targeted Data Augmentation for bias mitigation
Agnieszka Mikołajczyk-Bareła
M. Ferlin
M. Grochowski
214
3
0
22 Aug 2023
A survey on bias in machine learning research
Agnieszka Mikołajczyk-Bareła
M. Grochowski
AI4CE
FaML
365
9
0
22 Aug 2023
Exploring Format Consistency for Instruction Tuning
Shi Liang
Runchu Tian
Kunlun Zhu
Yujia Qin
Huadong Wang
Xin Cong
Zhiyuan Liu
Xiaojiang Liu
Maosong Sun
ALM
320
16
0
28 Jul 2023
A Survey on Out-of-Distribution Evaluation of Neural NLP Models
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Xinzhe Li
Ming Liu
Shang Gao
Wray Buntine
291
24
0
27 Jun 2023
SugarCrepe: Fixing Hackable Benchmarks for Vision-Language Compositionality
Neural Information Processing Systems (NeurIPS), 2023
Cheng-Yu Hsieh
Jieyu Zhang
Zixian Ma
Aniruddha Kembhavi
Ranjay Krishna
CoGe
363
214
0
26 Jun 2023
1
2
3
4
Next
Page 1 of 4