Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2501.03012
Cited By
Analyzing Fine-tuning Representation Shift for Multimodal LLMs Steering alignment
6 January 2025
Pegah Khayatan
Mustafa Shukor
Jayneel Parekh
Matthieu Cord
LLMSV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Analyzing Fine-tuning Representation Shift for Multimodal LLMs Steering alignment"
1 / 1 papers shown
Title
Robustly identifying concepts introduced during chat fine-tuning using crosscoders
Julian Minder
Clement Dumas
Caden Juang
Bilal Chugtai
Neel Nanda
21
0
0
03 Apr 2025
1