ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2209.15259
  4. Cited By
On the Impossible Safety of Large AI Models
v1v2 (latest)

On the Impossible Safety of Large AI Models

30 September 2022
El-Mahdi El-Mhamdi
Sadegh Farhadkhani
R. Guerraoui
Nirupam Gupta
L. Hoang
Rafael Pinot
Sébastien Rouault
John Stephan
ArXiv (abs)PDFHTML

Papers citing "On the Impossible Safety of Large AI Models"

21 / 21 papers shown
Title
High-Probability Analysis of Online and Federated Zero-Order Optimisation
High-Probability Analysis of Online and Federated Zero-Order Optimisation
Arya Akhavan
David Janz
El-Mahdi El-Mhamdi
FedML
153
0
0
25 Sep 2025
What Does 'Human-Centred AI' Mean?
What Does 'Human-Centred AI' Mean?
Olivia Guest
119
1
0
26 Jul 2025
Reality Check: A New Evaluation Ecosystem Is Necessary to Understand AI's Real World Effects
Reality Check: A New Evaluation Ecosystem Is Necessary to Understand AI's Real World Effects
Reva Schwartz
Rumman Chowdhury
Akash Kundu
Heather Frase
Marzieh Fadaee
...
Andrew Thompson
Maya Carlyle
Qinghua Lu
Matthew Holmes
Theodora Skeadas
246
7
0
24 May 2025
Approaching the Harm of Gradient Attacks While Only Flipping Labels
Approaching the Harm of Gradient Attacks While Only Flipping Labels
Abdessamad El-Kabid
El-Mahdi El-Mhamdi
AAML
223
1
0
28 Feb 2025
On the Byzantine Fault Tolerance of signSGD with Majority Vote
On the Byzantine Fault Tolerance of signSGD with Majority Vote
Emanuele Mengoli
Luzius Moll
Virgilio Strozzi
El-Mahdi El-Mhamdi
AAMLFedML
279
1
0
26 Feb 2025
A Case for Specialisation in Non-Human Entities
A Case for Specialisation in Non-Human Entities
El-Mahdi El-Mhamdi
Lê Nguyên Hoang
Mariame Tighanimine
118
0
0
05 Feb 2025
A Survey on Offensive AI Within Cybersecurity
A Survey on Offensive AI Within Cybersecurity
Sahil Girhepuje
Aviral Verma
Gaurav Raina
AAML
128
7
0
26 Sep 2024
The poison of dimensionality
The poison of dimensionality
Lê-Nguyên Hoang
225
3
0
25 Sep 2024
Building an Ethical and Trustworthy Biomedical AI Ecosystem for the
  Translational and Clinical Integration of Foundational Models
Building an Ethical and Trustworthy Biomedical AI Ecosystem for the Translational and Clinical Integration of Foundational Models
Simha Sankar Baradwaj
Destiny Gilliland
Jack Rincon
Henning Hermjakob
Yu Yan
...
Dean Wang
Karol Watson
Alex Bui
Wei Wang
Peipei Ping
243
15
0
18 Jul 2024
On the Workflows and Smells of Leaderboard Operations (LBOps): An Exploratory Study of Foundation Model Leaderboards
On the Workflows and Smells of Leaderboard Operations (LBOps): An Exploratory Study of Foundation Model Leaderboards
Zhimin Zhao
A. A. Bangash
F. Côgo
Bram Adams
Ahmed E. Hassan
658
2
0
04 Jul 2024
Data Quality in Edge Machine Learning: A State-of-the-Art Survey
Data Quality in Edge Machine Learning: A State-of-the-Art Survey
M. D. Belgoumri
Mohamed Reda Bouadjenek
Sunil Aryal
Hakim Hacid
260
2
0
01 Jun 2024
Large Language Models for Cyber Security: A Systematic Literature Review
Large Language Models for Cyber Security: A Systematic Literature Review
HanXiang Xu
Shenao Wang
Ningke Li
Kaidi Wang
Yanjie Zhao
Kai Chen
Ting Yu
Yang Liu
Haoyu Wang
502
94
0
08 May 2024
A Comprehensive Study of Knowledge Editing for Large Language Models
A Comprehensive Study of Knowledge Editing for Large Language Models
Ningyu Zhang
Yunzhi Yao
Bo Tian
Peng Wang
Shumin Deng
...
Lei Liang
Qing Cui
Xiao-Jun Zhu
Jun Zhou
Huajun Chen
KELM
365
122
0
02 Jan 2024
SoK: Memorization in General-Purpose Large Language Models
SoK: Memorization in General-Purpose Large Language Models
Valentin Hartmann
Anshuman Suri
Vincent Bindschaedler
David Evans
Shruti Tople
Robert West
KELMLLMAG
264
34
0
24 Oct 2023
Can LLM-Generated Misinformation Be Detected?
Can LLM-Generated Misinformation Be Detected?International Conference on Learning Representations (ICLR), 2023
Canyu Chen
Kai Shu
DeLMO
633
228
0
25 Sep 2023
Large Language Models for Software Engineering: A Systematic Literature
  Review
Large Language Models for Software Engineering: A Systematic Literature ReviewACM Transactions on Software Engineering and Methodology (TOSEM), 2023
Xinying Hou
Yanjie Zhao
Yue Liu
Zhou Yang
Kailong Wang
Li Li
Xiapu Luo
David Lo
John C. Grundy
Haoyu Wang
272
706
0
21 Aug 2023
LLM Censorship: A Machine Learning Challenge or a Computer Security
  Problem?
LLM Censorship: A Machine Learning Challenge or a Computer Security Problem?
David Glukhov
Ilia Shumailov
Y. Gal
Nicolas Papernot
Vardan Papyan
AAMLELM
178
71
0
20 Jul 2023
Jailbroken: How Does LLM Safety Training Fail?
Jailbroken: How Does LLM Safety Training Fail?Neural Information Processing Systems (NeurIPS), 2023
Alexander Wei
Nika Haghtalab
Jacob Steinhardt
557
1,335
0
05 Jul 2023
Citation: A Key to Building Responsible and Accountable Large Language
  Models
Citation: A Key to Building Responsible and Accountable Large Language Models
Jie Huang
Kevin Chen-Chuan Chang
HILM
246
28
0
05 Jul 2023
Position: Considerations for Differentially Private Learning with
  Large-Scale Public Pretraining
Position: Considerations for Differentially Private Learning with Large-Scale Public PretrainingInternational Conference on Machine Learning (ICML), 2022
Florian Tramèr
Gautam Kamath
Nicholas Carlini
SILM
313
94
0
13 Dec 2022
A Non-Expert's Introduction to Data Ethics for Mathematicians
A Non-Expert's Introduction to Data Ethics for Mathematicians
M. A. Porter
FaML
229
3
0
18 Jan 2022
1