v1v2 (latest)

On the Impossible Safety of Large AI Models

30 September 2022

Papers citing "On the Impossible Safety of Large AI Models"

21 / 21 papers shown

Title
High-Probability Analysis of Online and Federated Zero-Order Optimisation Arya Akhavan David Janz El-Mahdi El-Mhamdi FedML 153 0 0 25 Sep 2025
What Does 'Human-Centred AI' Mean? Olivia Guest 119 1 0 26 Jul 2025
Reality Check: A New Evaluation Ecosystem Is Necessary to Understand AI's Real World Effects Reva Schwartz Rumman Chowdhury Akash Kundu Heather Frase Marzieh Fadaee ... Andrew Thompson Maya Carlyle Qinghua Lu Matthew Holmes Theodora Skeadas 246 7 0 24 May 2025
Approaching the Harm of Gradient Attacks While Only Flipping Labels Abdessamad El-Kabid El-Mahdi El-Mhamdi AAML 223 1 0 28 Feb 2025
On the Byzantine Fault Tolerance of signSGD with Majority Vote Emanuele Mengoli Luzius Moll Virgilio Strozzi El-Mahdi El-Mhamdi AAML FedML 279 1 0 26 Feb 2025
A Case for Specialisation in Non-Human Entities El-Mahdi El-Mhamdi Lê Nguyên Hoang Mariame Tighanimine 118 0 0 05 Feb 2025
A Survey on Offensive AI Within Cybersecurity Sahil Girhepuje Aviral Verma Gaurav Raina AAML 128 7 0 26 Sep 2024
The poison of dimensionality Lê-Nguyên Hoang 225 3 0 25 Sep 2024
Building an Ethical and Trustworthy Biomedical AI Ecosystem for the Translational and Clinical Integration of Foundational Models Simha Sankar Baradwaj Destiny Gilliland Jack Rincon Henning Hermjakob Yu Yan ... Dean Wang Karol Watson Alex Bui Wei Wang Peipei Ping 243 15 0 18 Jul 2024
On the Workflows and Smells of Leaderboard Operations (LBOps): An Exploratory Study of Foundation Model Leaderboards Zhimin Zhao A. A. Bangash F. Côgo Bram Adams Ahmed E. Hassan 658 2 0 04 Jul 2024
Data Quality in Edge Machine Learning: A State-of-the-Art Survey M. D. Belgoumri Mohamed Reda Bouadjenek Sunil Aryal Hakim Hacid 260 2 0 01 Jun 2024
Large Language Models for Cyber Security: A Systematic Literature Review HanXiang Xu Shenao Wang Ningke Li Kaidi Wang Yanjie Zhao Kai Chen Ting Yu Yang Liu Haoyu Wang 502 94 0 08 May 2024
A Comprehensive Study of Knowledge Editing for Large Language Models Ningyu Zhang Yunzhi Yao Bo Tian Peng Wang Shumin Deng ... Lei Liang Qing Cui Xiao-Jun Zhu Jun Zhou Huajun Chen KELM 365 122 0 02 Jan 2024
SoK: Memorization in General-Purpose Large Language Models Valentin Hartmann Anshuman Suri Vincent Bindschaedler David Evans Shruti Tople Robert West KELM LLMAG 264 34 0 24 Oct 2023
Can LLM-Generated Misinformation Be Detected?International Conference on Learning Representations (ICLR), 2023 Canyu Chen Kai Shu DeLMO 633 228 0 25 Sep 2023
Large Language Models for Software Engineering: A Systematic Literature ReviewACM Transactions on Software Engineering and Methodology (TOSEM), 2023 Xinying Hou Yanjie Zhao Yue Liu Zhou Yang Kailong Wang Li Li Xiapu Luo David Lo John C. Grundy Haoyu Wang 272 706 0 21 Aug 2023
LLM Censorship: A Machine Learning Challenge or a Computer Security Problem? David Glukhov Ilia Shumailov Y. Gal Nicolas Papernot Vardan Papyan AAML ELM 178 71 0 20 Jul 2023
Jailbroken: How Does LLM Safety Training Fail?Neural Information Processing Systems (NeurIPS), 2023 Alexander Wei Nika Haghtalab Jacob Steinhardt 557 1,335 0 05 Jul 2023
Citation: A Key to Building Responsible and Accountable Large Language Models Jie Huang Kevin Chen-Chuan Chang HILM 246 28 0 05 Jul 2023
Position: Considerations for Differentially Private Learning with Large-Scale Public PretrainingInternational Conference on Machine Learning (ICML), 2022 Florian Tramèr Gautam Kamath Nicholas Carlini SILM 313 94 0 13 Dec 2022
A Non-Expert's Introduction to Data Ethics for Mathematicians M. A. Porter FaML 229 3 0 18 Jan 2022