Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2311.04044
Cited By
PrivLM-Bench: A Multi-level Privacy Evaluation Benchmark for Language Models
7 November 2023
Haoran Li
Dadi Guo
Donghao Li
Wei Fan
Qi Hu
Xin Liu
Chunkit Chan
Duanyi Yao
Yuan Yao
Yangqiu Song
PILM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"PrivLM-Bench: A Multi-level Privacy Evaluation Benchmark for Language Models"
10 / 10 papers shown
Title
Toward a Human-Centered Evaluation Framework for Trustworthy LLM-Powered GUI Agents
C. L. P. Chen
Zhiping Zhang
Ibrahim Khalilov
Bingcan Guo
Simret Araya Gebreegziabher
Yanfang Ye
Ziang Xiao
Yaxing Yao
Tianshi Li
T. Li
LLMAG
ELM
87
0
0
24 Apr 2025
LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs
Yujun Zhou
Jingdong Yang
Kehan Guo
Pin-Yu Chen
Tian Gao
...
Tian Gao
Werner Geyer
Nuno Moniz
Nitesh V Chawla
Xiangliang Zhang
35
4
0
18 Oct 2024
Sentence Embedding Leaks More Information than You Expect: Generative Embedding Inversion Attack to Recover the Whole Sentence
Haoran Li
Mingshi Xu
Yangqiu Song
77
43
0
04 May 2023
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
291
4,048
0
24 May 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
303
11,730
0
04 Mar 2022
Differentially Private Fine-tuning of Language Models
Da Yu
Saurabh Naik
A. Backurs
Sivakanth Gopi
Huseyin A. Inan
...
Y. Lee
Andre Manoel
Lukas Wutschitz
Sergey Yekhanin
Huishuai Zhang
134
344
0
13 Oct 2021
The Power of Scale for Parameter-Efficient Prompt Tuning
Brian Lester
Rami Al-Rfou
Noah Constant
VPVLM
278
3,835
0
18 Apr 2021
Extracting Training Data from Large Language Models
Nicholas Carlini
Florian Tramèr
Eric Wallace
Matthew Jagielski
Ariel Herbert-Voss
...
Tom B. Brown
D. Song
Ulfar Erlingsson
Alina Oprea
Colin Raffel
MLAU
SILM
267
1,798
0
14 Dec 2020
Language Models as Knowledge Bases?
Fabio Petroni
Tim Rocktaschel
Patrick Lewis
A. Bakhtin
Yuxiang Wu
Alexander H. Miller
Sebastian Riedel
KELM
AI4MH
404
2,576
0
03 Sep 2019
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
294
6,927
0
20 Apr 2018
1