Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1606.03475
Cited By
De-identification of Patient Notes with Recurrent Neural Networks
10 June 2016
Franck Dernoncourt
Ji Young Lee
Özlem Uzuner
Peter Szolovits
OOD
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"De-identification of Patient Notes with Recurrent Neural Networks"
50 / 120 papers shown
Improving the Performance of Radiology Report De-identification with Large-Scale Training and Benchmarking Against Cloud Vendor Methods
Eva Prakash
Maayane Attias
Pierre J. Chambon
Justin Xu
Steven QH Truong
Jean-Benoit Delbrouck
Tessa S. Cook
C. Langlotz
MedIm
197
0
0
06 Nov 2025
Towards Automatic Evaluation and Selection of PHI De-identification Models via Multi-Agent Collaboration
Guanchen Wu
Zuhui Chen
Yuzhang Xie
Carl Yang
LLMAG
179
2
0
17 Oct 2025
JEDA: Query-Free Clinical Order Search from Ambient Dialogues
Praphul Singh
Corey D Barrett
Sumana Srivasta
Amitabh Saikia
Irfan Bulu
Sri Gadde
Krishnaram Kenthapadi
161
0
0
16 Oct 2025
Stronger Re-identification Attacks through Reasoning and Aggregation
Lucas Georges Gabriel Charpentier
Pierre Lison
136
0
0
10 Oct 2025
Protecting De-identified Documents from Search-based Linkage Attacks
Pierre Lison
Mark Anderson
136
0
0
07 Oct 2025
Scalable multilingual PII annotation for responsible AI in LLMs
Bharti Meena
Joanna Skubisz
Harshit Rajgarhia
Nand Dave
Kiran Ganesh
Shivali Dalmia
Abhishek Mukherji
Vasudevan Sundarababu
182
0
0
03 Oct 2025
Not What the Doctor Ordered: Surveying LLM-based De-identification and Quantifying Clinical Information Loss
Kiana Aghakasiri
Noopur Zambare
JoAnn Thai
Carrie Ye
Mayur Mehta
J. Ross Mitchell
Mohamed Abdalla
147
2
0
17 Sep 2025
PRvL: Quantifying the Capabilities and Risks of Large Language Models for PII Redaction
Leon Garza
Anantaa Kotal
Aritran Piplai
Lavanya Elluri
Prajit Das
Aman Chadha
208
5
0
07 Aug 2025
Cross-Domain Transfer and Few-Shot Learning for Personal Identifiable Information Recognition
Junhong Ye
Xu Yuan
Xinying Qiu
224
0
0
16 Jul 2025
Thunder-DeID: Accurate and Efficient De-identification Framework for Korean Court Judgments
Sungen Hahm
Heejin Kim
Gyuseong Lee
Hyunji Park
Jaejin Lee
237
2
0
18 Jun 2025
Synopsis: Secure and private trend inference from encrypted semantic embeddings
Madelyne Xiao
Palak Jain
Micha Gorelick
Sarah Scheffler
536
0
0
29 May 2025
Automated Privacy Information Annotation in Large Language Model Interactions
Hang Zeng
Xiangyu Liu
Yong Hu
Chaoyue Niu
Fan Wu
Shaojie Tang
Guihai Chen
370
4
0
27 May 2025
Re-identification of De-identified Documents with Autoregressive Infilling
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Lucas Georges Gabriel Charpentier
Pierre Lison
262
4
0
19 May 2025
Large Language Model Empowered Privacy-Protected Framework for PHI Annotation in Clinical Notes
Studies in Health Technology and Informatics (SHTI), 2025
Guanchen Wu
Linzhi Zheng
Han Xie
Zhen Xiang
Jiaying Lu
Darren Liu
Delgersuren Bold
Yue Liu
Xiao Hu
Carl Yang
233
2
0
22 Apr 2025
PII-Bench: Evaluating Query-Aware Privacy Protection Systems
Hao Shen
Zhouhong Gu
Haokai Hong
Weili Han
386
3
0
25 Feb 2025
The Empirical Impact of Data Sanitization on Language Models
Anwesan Pal
Radhika Bhargava
Kyle Hinsz
Jacques Esterhuizen
Sudipta Bhattacharya
266
7
0
08 Nov 2024
MEOW: MEMOry Supervised LLM Unlearning Via Inverted Facts
Tianle Gu
Kexin Huang
Ruilin Luo
Yuanqi Yao
Yujiu Yang
Yan Teng
Yingchun Wang
MU
431
18
0
18 Sep 2024
Synthetic4Health: Generating Annotated Synthetic Clinical Letters
Frontiers in Digital Health (Front. Digit. Health), 2024
Libo Ren
Samuel Belkadi
Lifeng Han
Warren Del-Pinto
Goran Nenadic
SyDa
227
6
0
14 Sep 2024
WPN: An Unlearning Method Based on N-pair Contrastive Learning in Language Models
European Conference on Artificial Intelligence (ECAI), 2024
Guitao Chen
Yunshen Wang
Hongye Sun
Guang Chen
MU
230
3
0
18 Aug 2024
Building an Ethical and Trustworthy Biomedical AI Ecosystem for the Translational and Clinical Integration of Foundational Models
Simha Sankar Baradwaj
Destiny Gilliland
Jack Rincon
Henning Hermjakob
Yu Yan
...
Dean Wang
Karol Watson
Alex Bui
Wei Wang
Peipei Ping
419
19
0
18 Jul 2024
Generation and De-Identification of Indian Clinical Discharge Summaries using LLMs
Sanjeet Singh
Shreya Gupta
Niralee Gupta
Naimish Sharma
Lokesh Srivastava
Vibhu Agarwal
Ashutosh Modi
213
0
0
08 Jul 2024
Cloaked Classifiers: Pseudonymization Strategies on Sensitive Classification Tasks
Arij Riabi
Menel Mahamdi
Virginie Mouilleron
Djamé Seddah
247
3
0
25 Jun 2024
Protecting Privacy Through Approximating Optimal Parameters for Sequence Unlearning in Language Models
Dohyun Lee
Daniel Rim
Minseok Choi
Jaegul Choo
PILM
MU
264
12
0
20 Jun 2024
AspirinSum: an Aspect-based utility-preserved de-identification Summarization framework
Ya-Lun Li
275
0
0
20 Jun 2024
Unlocking the Potential of Large Language Models for Clinical Text Anonymization: A Comparative Study
David Pissarra
Isabel Curioso
João Alveira
Duarte Pereira
Bruno Ribeiro
Tomas Souper
Vasco Gomes
A. Carreiro
Vitor Rolla
337
15
0
29 May 2024
Learnable Privacy Neurons Localization in Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Ruizhe Chen
Tianxiang Hu
Yang Feng
Zuo-Qiang Liu
263
31
0
16 May 2024
Silencing the Risk, Not the Whistle: A Semi-automated Text Sanitization Tool for Mitigating the Risk of Whistleblower Re-Identification
Dimitri Staufer
Frank Pallas
Bettina Berendt
238
4
0
02 May 2024
From Text to Transformation: A Comprehensive Review of Large Language Models' Versatility
Pravneet Kaur
Gautam Siddharth Kashyap
Ankit Kumar
Md. Tabrez Nafis
Sandeep Kumar
Vikrant Shokeen
LM&MA
307
79
0
25 Feb 2024
Large Language Models are Advanced Anonymizers
Robin Staab
Mark Vero
Mislav Balunović
Martin Vechev
472
27
0
21 Feb 2024
Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems
Tianyu Cui
Yanling Wang
Chuanpu Fu
Yong Xiao
Sijia Li
...
Junwu Xiong
Xinyu Kong
ZuJie Wen
Ke Xu
Qi Li
396
107
0
11 Jan 2024
SAIC: Integration of Speech Anonymization and Identity Classification
Ming Cheng
Xingjian Diao
Shitong Cheng
Wenjun Liu
319
9
0
23 Dec 2023
De-identification of clinical free text using natural language processing: A systematic review of current approaches
Aleksandar Kovacevic
Bojana Bašaragin
Nikola Milosevic
Goran Nenadic
OOD
206
33
0
28 Nov 2023
Reducing Privacy Risks in Online Self-Disclosures with Language Models
Yao Dou
Isadora Krsek
Tarek Naous
Anubha Kabra
Sauvik Das
Alan Ritter
Wei Xu
478
59
0
16 Nov 2023
Neural Text Sanitization with Privacy Risk Indicators: An Empirical Analysis
Anthia Papadopoulou
Pierre Lison
Mark Anderson
Lilja Øvrelid
Ildikó Pilán
87
0
0
22 Oct 2023
Validating transformers for redaction of text from electronic health records in real-world healthcare
IEEE International Conference on Healthcare Informatics (ICHI), 2023
Z. Kraljevic
Anthony Shek
Joshua Au Yeung
E. Sheldon
Mohammad Al-Agil
...
Xi Bai
Kawsar Noor
Anoop D. Shah
Richard J. B. Dobson
James T. Teo
MedIm
277
9
0
05 Oct 2023
Identifying and Mitigating Privacy Risks Stemming from Language Models: A Survey
Victoria Smith
Ali Shahin Shamsabadi
Carolyn Ashurst
Adrian Weller
PILM
539
42
0
27 Sep 2023
Towards End-User Development for IoT: A Case Study on Semantic Parsing of Cooking Recipes for Programming Kitchen Devices
Filippos Ventirozos
Sarah Clinch
Riza Batista-Navarro
107
1
0
25 Sep 2023
Grandma Karl is 27 years old -- research agenda for pseudonymization of research data
International Conference on Big Data Computing Service and Applications (ICBDCSA), 2023
Elena Volodina
Simon Dobnik
Therese Lindström Tiedemann
Xuan-Son Vu
147
4
0
30 Aug 2023
Protecting User Privacy in Remote Conversational Systems: A Privacy-Preserving framework based on text sanitization
Zhigang Kan
Linbo Qiao
Hao Yu
Liwen Peng
Yifu Gao
Dongsheng Li
275
29
0
14 Jun 2023
Privacy- and Utility-Preserving NLP with Anonymized Data: A case study of Pseudonymization
Oleksandr Yermilov
Vipul Raheja
Artem Chernodub
206
32
0
08 Jun 2023
Are Chatbots Ready for Privacy-Sensitive Applications? An Investigation into Input Regurgitation and Prompt-Induced Sanitization
Aman Priyanshu
Supriti Vijay
Ayush Kumar
Rakshit Naidu
Fatemehsadat Mireshghallah
SILM
439
33
0
24 May 2023
Enhancing Small Medical Learners with Privacy-preserving Contextual Prompting
International Conference on Learning Representations (ICLR), 2023
Xinlu Zhang
Shiyang Li
Xianjun Yang
Chenxin Tian
Yao Qin
Linda R. Petzold
314
13
0
22 May 2023
In the Name of Fairness: Assessing the Bias in Clinical Record De-identification
Conference on Fairness, Accountability and Transparency (FAccT), 2023
Yuxin Xiao
S. Lim
Tom Pollard
Marzyeh Ghassemi
281
20
0
18 May 2023
Development and validation of a natural language processing algorithm to pseudonymize documents in the context of a clinical data warehouse
Methods of Information in Medicine (MIM), 2023
X. Tannier
Perceval Wajsburt
Alice Calliger
Basile Dura
Alexandre Mouchet
M. Hilka
R. Bey
232
16
0
23 Mar 2023
Man vs the machine: The Struggle for Effective Text Anonymisation in the Age of Large Language Models
Constantinos Patsakis
Nikolaos Lykousas
228
11
0
22 Mar 2023
Swing Distillation: A Privacy-Preserving Knowledge Distillation Framework
Junzhuo Li
Xinwei Wu
Weilong Dong
Shuangzhi Wu
Chao Bian
Deyi Xiong
425
5
0
16 Dec 2022
Unintended Memorization and Timing Attacks in Named Entity Recognition Models
Proceedings on Privacy Enhancing Technologies (PoPETs), 2022
Rana Salal Ali
Benjamin Zi Hao Zhao
Hassan Jameel Asghar
Tham Nguyen
Ian D. Wood
Dali Kaafar
AAML
194
3
0
04 Nov 2022
An Easy-to-use and Robust Approach for the Differentially Private De-Identification of Clinical Textual Documents
International Conference on Health Informatics (ICHI), 2022
Yakini Tchouka
Jean-François Couchot
David Laiymani
OOD
165
2
0
02 Nov 2022
BRATsynthetic: Text De-identification using a Markov Chain Replacement Strategy for Surrogate Personal Identifying Information
J. D. Osborne
Tobias O'Leary
A. Nadimpalli
S. Aly
Richard Kennedy
77
3
0
28 Oct 2022
Knowledge Unlearning for Mitigating Privacy Risks in Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Joel Jang
Dongkeun Yoon
Sohee Yang
Sungmin Cha
Moontae Lee
Lajanugen Logeswaran
Minjoon Seo
KELM
PILM
MU
565
409
0
04 Oct 2022
1
2
3
Next
Page 1 of 3