ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.20809
  4. Cited By
Improved Representation Steering for Language Models

Improved Representation Steering for Language Models

27 May 2025
Zhengxuan Wu
Qinan Yu
Aryaman Arora
Christopher D. Manning
Christopher Potts
    LLMSV
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "Improved Representation Steering for Language Models"

3 / 3 papers shown
Title
EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models
EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models
Ziwen Xu
Shuxun Wang
Kewei Xu
Haoming Xu
Mengru Wang
Xinle Deng
Yunzhi Yao
Guozhou Zheng
Ningyu Zhang
Xin Xu
KELMLLMSV
1.1K
6
0
21 Apr 2025
Comparing Bad Apples to Good Oranges: Aligning Large Language Models via Joint Preference Optimization
Comparing Bad Apples to Good Oranges: Aligning Large Language Models via Joint Preference Optimization
Hritik Bansal
Ashima Suvarna
Gantavya Bhatt
Nanyun Peng
Kai-Wei Chang
Aditya Grover
ALM
382
15
0
31 Mar 2024
LEACE: Perfect linear concept erasure in closed form
LEACE: Perfect linear concept erasure in closed formNeural Information Processing Systems (NeurIPS), 2023
Nora Belrose
David Schneider-Joseph
Shauli Ravfogel
Robert Bamler
Edward Raff
Stella Biderman
KELMMU
761
164
0
06 Jun 2023
1