Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.16893
Cited By
The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation (RAG)
23 February 2024
Shenglai Zeng
Jiankun Zhang
Pengfei He
Yue Xing
Yiding Liu
Han Xu
Jie Ren
Shuaiqiang Wang
Dawei Yin
Yi Chang
Jiliang Tang
SILM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation (RAG)"
50 / 51 papers shown
Title
PIG: Privacy Jailbreak Attack on LLMs via Gradient-based Iterative In-Context Optimization
Y. Wang
Yanan Cao
Yubing Ren
Fang Fang
Zheng-Shen Lin
Binxing Fang
PILM
26
0
0
15 May 2025
Securing RAG: A Risk Assessment and Mitigation Framework
Lukas Ammann
Sara Ott
Christoph R. Landolt
Marco P. Lehmann
SILM
14
0
0
13 May 2025
Privacy Risks and Preservation Methods in Explainable Artificial Intelligence: A Scoping Review
Sonal Allana
Mohan Kankanhalli
Rozita Dara
27
0
0
05 May 2025
A False Sense of Privacy: Evaluating Textual Data Sanitization Beyond Surface-level Privacy Leakage
Rui Xin
Niloofar Mireshghallah
Shuyue Stella Li
Michael Duan
Hyunwoo Kim
Yejin Choi
Yulia Tsvetkov
Sewoong Oh
Pang Wei Koh
69
1
0
28 Apr 2025
Privacy-Preserving Federated Embedding Learning for Localized Retrieval-Augmented Generation
Qianren Mao
Qili Zhang
Hanwen Hao
Zhentao Han
Runhua Xu
...
Bo Li
Y. Song
Jin Dong
Jianxin Li
Philip S. Yu
66
0
0
27 Apr 2025
RAG LLMs are Not Safer: A Safety Analysis of Retrieval-Augmented Generation for Large Language Models
Bang An
Shiyue Zhang
Mark Dredze
54
0
0
25 Apr 2025
Towards Harnessing the Collaborative Power of Large and Small Models for Domain Tasks
Yang Janet Liu
Bingjie Yan
Tianyuan Zou
Jianqing Zhang
Zixuan Gu
...
J. Li
Xiaozhou Ye
Ye Ouyang
Qiang Yang
Y. Zhang
ALM
95
1
0
24 Apr 2025
Retrieval Augmented Generation Evaluation in the Era of Large Language Models: A Comprehensive Survey
Aoran Gan
Hao Yu
Kai Zhang
Qi Liu
Wenyu Yan
Zhenya Huang
Shiwei Tong
Guoping Hu
RALM
3DV
38
0
0
21 Apr 2025
The Other Side of the Coin: Exploring Fairness in Retrieval-Augmented Generation
Z. Zhang
Ning Li
Qi Liu
Rui Li
W. Gao
Qingyang Mao
Zhenya Huang
Baosheng Yu
Dacheng Tao
RALM
34
0
0
11 Apr 2025
Enhancing LLM-Based Short Answer Grading with Retrieval-Augmented Generation
Yucheng Chu
Peng He
Hang Li
Haoyu Han
Kaiqi Yang
Yu Xue
Tingting Li
Joseph Krajcik
Jiliang Tang
AI4Ed
39
0
0
07 Apr 2025
Tricking Retrievers with Influential Tokens: An Efficient Black-Box Corpus Poisoning Attack
Cheng Wang
Yiwei Wang
Yujun Cai
Bryan Hooi
AAML
54
0
0
27 Mar 2025
Empowering GraphRAG with Knowledge Filtering and Integration
Kai Guo
Harry Shomer
Shenglai Zeng
Haoyu Han
Yu Wang
Jiliang Tang
61
0
0
18 Mar 2025
Privacy-Aware RAG: Secure and Isolated Knowledge Retrieval
Pengcheng Zhou
Yinglun Feng
Zhongliang Yang
SILM
58
0
0
17 Mar 2025
Poisoned-MRAG: Knowledge Poisoning Attacks to Multimodal Retrieval Augmented Generation
Yinuo Liu
Zenghui Yuan
Guiyao Tie
Jiawen Shi
Lichao Sun
Lichao Sun
Neil Zhenqiang Gong
36
1
0
08 Mar 2025
Transforming Tuberculosis Care: Optimizing Large Language Models For Enhanced Clinician-Patient Communication
Daniil Filienko
Mahek Nizar
Javier Roberti
Denise Galdamez
Haroon Jakher
Sarah Iribarren
Weichao Yuwen
Martine De Cock
LM&MA
31
0
0
28 Feb 2025
Mitigating the Privacy Issues in Retrieval-Augmented Generation (RAG) via Pure Synthetic Data
Shenglai Zeng
Jiankun Zhang
Pengfei He
J. Ren
Tianqi Zheng
Hanqing Lu
Han Xu
Hui Liu
Yue Xing
Jiliang Tang
132
9
0
21 Feb 2025
Commercial LLM Agents Are Already Vulnerable to Simple Yet Dangerous Attacks
Ang Li
Yin Zhou
Vethavikashini Chithrra Raghuram
Tom Goldstein
Micah Goldblum
AAML
71
7
0
12 Feb 2025
URAG: Implementing a Unified Hybrid RAG for Precise Answers in University Admission Chatbots -- A Case Study at HCMUT
Long Nguyen
Tho Quan
41
1
0
27 Jan 2025
RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
Zhuoran Jin
Hongbang Yuan
Tianyi Men
Pengfei Cao
Yubo Chen
Kang-Jun Liu
Jun Zhao
ALM
82
7
0
18 Dec 2024
RemoteRAG: A Privacy-Preserving LLM Cloud RAG Service
Yihang Cheng
Lan Zhang
Junyang Wang
Mu Yuan
Yunhao Yao
77
0
0
17 Dec 2024
Towards Action Hijacking of Large Language Model-based Agent
Yuyang Zhang
Kangjie Chen
Xudong Jiang
Yuxiang Sun
Run Wang
Lina Wang
LLMAG
AAML
73
2
0
14 Dec 2024
Towards Knowledge Checking in Retrieval-augmented Generation: A Representation Perspective
Shenglai Zeng
Jiankun Zhang
Bingheng Li
Yuping Lin
Tianqi Zheng
...
Hui Liu
Hui Liu
Yue Xing
Monica Xiao Cheng
Jiliang Tang
RALM
66
3
0
21 Nov 2024
RAG-Thief: Scalable Extraction of Private Data from Retrieval-Augmented Generation Applications with Agent-based Attacks
Changyue Jiang
Xudong Pan
Geng Hong
Chenfu Bao
Min Yang
SILM
72
9
0
21 Nov 2024
SoK: Unifying Cybersecurity and Cybersafety of Multimodal Foundation Models with an Information Theory Approach
Ruoxi Sun
Jiamin Chang
Hammond Pearce
Chaowei Xiao
B. Li
Qi Wu
Surya Nepal
Minhui Xue
30
0
0
17 Nov 2024
Data Extraction Attacks in Retrieval-Augmented Generation via Backdoors
Yuefeng Peng
Junda Wang
Hong-ye Yu
Amir Houmansadr
SILM
50
2
0
03 Nov 2024
Mask-based Membership Inference Attacks for Retrieval-Augmented Generation
Mingrui Liu
Sixiao Zhang
Cheng Long
AAML
50
2
0
26 Oct 2024
LLaVA Needs More Knowledge: Retrieval Augmented Natural Language Generation with Knowledge Graph for Explaining Thoracic Pathologies
Ameer Hamza
Abdullah
Yong Hyun Ahn
Sungyoung Lee
Seong Tae Kim
24
2
0
07 Oct 2024
Ward: Provable RAG Dataset Inference via LLM Watermarks
Nikola Jovanović
Robin Staab
Maximilian Baader
Martin Vechev
86
1
0
04 Oct 2024
Undesirable Memorization in Large Language Models: A Survey
Ali Satvaty
Suzan Verberne
Fatih Turkmen
ELM
PILM
69
7
0
03 Oct 2024
Trustworthiness in Retrieval-Augmented Generation Systems: A Survey
Yujia Zhou
Yan Liu
Xiaoxi Li
Jiajie Jin
Hongjin Qian
Zheng Liu
Chaozhuo Li
Zhicheng Dou
Tsung-Yi Ho
Philip S. Yu
3DV
RALM
50
26
0
16 Sep 2024
Unleashing Worms and Extracting Data: Escalating the Outcome of Attacks against RAG-based Inference in Scale and Severity Using Jailbreaking
Stav Cohen
Ron Bitton
Ben Nassi
34
4
0
12 Sep 2024
Optimizing RAG Techniques for Automotive Industry PDF Chatbots: A Case Study with Locally Deployed Ollama Models
Fei Liu
Zejun Kang
Xing Han
37
4
0
12 Aug 2024
ConfusedPilot: Confused Deputy Risks in RAG-based LLMs
Ayush RoyChowdhury
Mulong Luo
Prateek Sahu
Sarbartha Banerjee
Mohit Tiwari
SILM
38
0
0
09 Aug 2024
Risks, Causes, and Mitigations of Widespread Deployments of Large Language Models (LLMs): A Survey
Md. Nazmus Sakib
Md Athikul Islam
Royal Pathak
Md Mashrur Arifin
ALM
PILM
29
2
0
01 Aug 2024
Blockchain for Large Language Model Security and Safety: A Holistic Survey
Caleb Geren
Amanda Board
Gaby G. Dagher
Tim Andersen
Jun Zhuang
44
6
0
26 Jul 2024
SecGenAI: Enhancing Security of Cloud-based Generative AI Applications within Australian Critical Technologies of National Interest
Christoforus Yoga Haryanto
Minh Hieu Vu
Trung Duc Nguyen
Emily Lomempow
Yulia Nurliana
Sona Taheri
22
2
0
01 Jul 2024
Seeing Is Believing: Black-Box Membership Inference Attacks Against Retrieval Augmented Generation
Y. Li
Gaoyang Liu
Yang Yang
Chen Wang
AAML
33
3
0
27 Jun 2024
InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales
Zhepei Wei
Wei-Lin Chen
Yu Meng
RALM
53
12
0
19 Jun 2024
Unique Security and Privacy Threats of Large Language Model: A Comprehensive Survey
Shang Wang
Tianqing Zhu
Bo Liu
Ming Ding
Xu Guo
Dayong Ye
Wanlei Zhou
Philip S. Yu
PILM
57
16
0
12 Jun 2024
AI Agents Under Threat: A Survey of Key Security Challenges and Future Pathways
Zehang Deng
Yongjian Guo
Changzhou Han
Wanlun Ma
Junwu Xiong
Sheng Wen
Yang Xiang
42
22
0
04 Jun 2024
Is My Data in Your Retrieval Database? Membership Inference Attacks Against Retrieval Augmented Generation
Maya Anderson
Guy Amit
Abigail Goldsteen
AAML
37
13
0
30 May 2024
A Survey on RAG Meeting LLMs: Towards Retrieval-Augmented Large Language Models
Wenqi Fan
Yujuan Ding
Liang-bo Ning
Shijie Wang
Hengyun Li
Dawei Yin
Tat-Seng Chua
Qing Li
RALM
3DV
38
181
0
10 May 2024
Introducing Super RAGs in Mistral 8x7B-v1
Ayush Thakur
Raghav Gupta
VLM
33
2
0
13 Apr 2024
Follow My Instruction and Spill the Beans: Scalable Data Extraction from Retrieval-Augmented Generation Systems
Zhenting Qi
Hanlin Zhang
Eric Xing
Sham Kakade
Hima Lakkaraju
SILM
40
17
0
27 Feb 2024
On the Trustworthiness Landscape of State-of-the-art Generative Models: A Survey and Outlook
Mingyuan Fan
Chengyu Wang
Cen Chen
Yang Liu
Jun Huang
HILM
31
3
0
31 Jul 2023
Synthetic Query Generation for Privacy-Preserving Deep Retrieval Systems using Differentially Private Language Models
Aldo G. Carranza
Rezsa Farahani
Natalia Ponomareva
Alexey Kurakin
Matthew Jagielski
Milad Nasr
SyDa
12
7
0
10 May 2023
Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory
Xin Cheng
Di Luo
Xiuying Chen
Lemao Liu
Dongyan Zhao
Rui Yan
RALM
145
91
0
03 May 2023
Memorization in NLP Fine-tuning Methods
Fatemehsadat Mireshghallah
Archit Uniyal
Tianhao Wang
David E. Evans
Taylor Berg-Kirkpatrick
AAML
61
39
0
25 May 2022
TEM: High Utility Metric Differential Privacy on Text
Ricardo Silva Carvalho
Theodore Vasiloudis
Oluwaseyi Feyisetan
34
36
0
16 Jul 2021
Deduplicating Training Data Makes Language Models Better
Katherine Lee
Daphne Ippolito
A. Nystrom
Chiyuan Zhang
Douglas Eck
Chris Callison-Burch
Nicholas Carlini
SyDa
237
588
0
14 Jul 2021
1
2
Next