ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.01174
  4. Cited By
Towards Inference-time Category-wise Safety Steering for Large Language
  Models

Towards Inference-time Category-wise Safety Steering for Large Language Models

2 October 2024
Amrita Bhattacharjee
Shaona Ghosh
Traian Rebedea
Christopher Parisien
    LLMSV
ArXivPDFHTML

Papers citing "Towards Inference-time Category-wise Safety Steering for Large Language Models"

1 / 1 papers shown
Title
Focus On This, Not That! Steering LLMs With Adaptive Feature Specification
Focus On This, Not That! Steering LLMs With Adaptive Feature Specification
Tom A. Lamb
Adam Davies
Alasdair Paren
Philip H. S. Torr
Francesco Pinto
45
0
0
30 Oct 2024
1