ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.16851
  4. Cited By
Towards LLM Guardrails via Sparse Representation Steering

Towards LLM Guardrails via Sparse Representation Steering

21 March 2025
Zeqing He
Zhibo Wang
Huiyu Xu
Kui Ren
    LLMSV
ArXivPDFHTML

Papers citing "Towards LLM Guardrails via Sparse Representation Steering"

1 / 1 papers shown
Title
Steering the CensorShip: Uncovering Representation Vectors for LLM "Thought" Control
Steering the CensorShip: Uncovering Representation Vectors for LLM "Thought" Control
Hannah Cyberey
David E. Evans
LLMSV
72
0
0
23 Apr 2025
1