ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2508.17158
  4. Cited By
Towards Safeguarding LLM Fine-tuning APIs against Cipher Attacks

Towards Safeguarding LLM Fine-tuning APIs against Cipher Attacks

23 August 2025
Jack Youstra
Mohammed Mahfoud
Yang Yan
Henry Sleight
Ethan Perez
Mrinank Sharma
    AAML
ArXiv (abs)PDFHTMLGithub (10363★)

Papers citing "Towards Safeguarding LLM Fine-tuning APIs against Cipher Attacks"

2 / 2 papers shown
Title
Detecting Adversarial Fine-tuning with Auditing Agents
Detecting Adversarial Fine-tuning with Auditing Agents
Sarah Egler
John Schulman
Nicholas Carlini
AAMLMLAU
145
0
0
17 Oct 2025
All Code, No Thought: Current Language Models Struggle to Reason in Ciphered Language
All Code, No Thought: Current Language Models Struggle to Reason in Ciphered Language
Shiyuan Guo
Henry Sleight
Fabien Roger
ELMLRM
133
0
0
10 Oct 2025
1