Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.07776
Cited By
TELLER: A Trustworthy Framework for Explainable, Generalizable and Controllable Fake News Detection
12 February 2024
Hui Liu
Wenya Wang
Haoru Li
Haoliang Li
Re-assign community
ArXiv
PDF
HTML
Papers citing
"TELLER: A Trustworthy Framework for Explainable, Generalizable and Controllable Fake News Detection"
8 / 8 papers shown
Title
Adversarial Style Augmentation via Large Language Model for Robust Fake News Detection
Sungwon Park
Sungwon Han
Xing Xie
Jae-Gil Lee
Meeyoung Cha
46
1
0
17 Jun 2024
Robust Domain Misinformation Detection via Multi-modal Feature Alignment
Hui Liu
Wenya Wang
Hao Sun
Anderson de Rezende Rocha
Haoliang Li
30
11
0
24 Nov 2023
ZYN: Zero-Shot Reward Models with Yes-No Questions for RLAIF
Víctor Gallego
SyDa
33
4
0
11 Aug 2023
Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 small
Kevin Wang
Alexandre Variengien
Arthur Conmy
Buck Shlegeris
Jacob Steinhardt
210
486
0
01 Nov 2022
Rethinking Attention-Model Explainability through Faithfulness Violation Test
Y. Liu
Haoliang Li
Yangyang Guo
Chen Kong
Jing Li
Shiqi Wang
FAtt
116
41
0
28 Jan 2022
Trustworthy AI: From Principles to Practices
Bo-wen Li
Peng Qi
Bo Liu
Shuai Di
Jingen Liu
Jiquan Pei
Jinfeng Yi
Bowen Zhou
108
349
0
04 Oct 2021
Making Pre-trained Language Models Better Few-shot Learners
Tianyu Gao
Adam Fisch
Danqi Chen
241
1,898
0
31 Dec 2020
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
220
4,424
0
23 Jan 2020
1