Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.10414
Cited By
On Calibration of LLM-based Guard Models for Reliable Content Moderation
14 October 2024
Hongfu Liu
Hengguan Huang
Hao Wang
Xiangming Gu
Ye Wang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"On Calibration of LLM-based Guard Models for Reliable Content Moderation"
1 / 1 papers shown
Title
GuardReasoner: Towards Reasoning-based LLM Safeguards
Yue Liu
Hongcheng Gao
Shengfang Zhai
Jun-Xiong Xia
Tianyi Wu
Zhiwei Xue
Y. Chen
Kenji Kawaguchi
Jiaheng Zhang
Bryan Hooi
AI4TS
LRM
106
13
0
30 Jan 2025
1