TELLER: A Trustworthy Framework for Explainable, Generalizable and Controllable Fake News Detection

12 February 2024

Papers citing "TELLER: A Trustworthy Framework for Explainable, Generalizable and Controllable Fake News Detection"

8 / 8 papers shown

Title
Adversarial Style Augmentation via Large Language Model for Robust Fake News Detection Sungwon Park Sungwon Han Xing Xie Jae-Gil Lee Meeyoung Cha 46 1 0 17 Jun 2024
Robust Domain Misinformation Detection via Multi-modal Feature Alignment Hui Liu Wenya Wang Hao Sun Anderson de Rezende Rocha Haoliang Li 30 11 0 24 Nov 2023
ZYN: Zero-Shot Reward Models with Yes-No Questions for RLAIF Víctor Gallego SyDa 33 4 0 11 Aug 2023
Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 small Kevin Wang Alexandre Variengien Arthur Conmy Buck Shlegeris Jacob Steinhardt 210 486 0 01 Nov 2022
Rethinking Attention-Model Explainability through Faithfulness Violation Test Y. Liu Haoliang Li Yangyang Guo Chen Kong Jing Li Shiqi Wang FAtt 116 41 0 28 Jan 2022
Trustworthy AI: From Principles to Practices Bo-wen Li Peng Qi Bo Liu Shuai Di Jingen Liu Jiquan Pei Jinfeng Yi Bowen Zhou 108 349 0 04 Oct 2021
Making Pre-trained Language Models Better Few-shot Learners Tianyu Gao Adam Fisch Danqi Chen 241 1,898 0 31 Dec 2020
Scaling Laws for Neural Language Models Jared Kaplan Sam McCandlish T. Henighan Tom B. Brown B. Chess R. Child Scott Gray Alec Radford Jeff Wu Dario Amodei 220 4,424 0 23 Jan 2020