ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.07530
  4. Cited By
How Aligned are Different Alignment Metrics?

How Aligned are Different Alignment Metrics?

10 July 2024
Jannis Ahlert
Thomas Klein
Felix Wichmann
Robert Geirhos
ArXiv (abs)PDFHTML

Papers citing "How Aligned are Different Alignment Metrics?"

3 / 3 papers shown
Title
Representational Similarity via Interpretable Visual Concepts
Representational Similarity via Interpretable Visual ConceptsInternational Conference on Learning Representations (ICLR), 2025
Neehar Kondapaneni
Oisin Mac Aodha
Pietro Perona
DRL
923
3
0
19 Mar 2025
Alignment and Adversarial Robustness: Are More Human-Like Models More Secure?
Alignment and Adversarial Robustness: Are More Human-Like Models More Secure?
Blaine Hoak
Kunyang Li
Patrick McDaniel
AAML
131
0
0
17 Feb 2025
Connecting Concept Convexity and Human-Machine Alignment in Deep Neural
  Networks
Connecting Concept Convexity and Human-Machine Alignment in Deep Neural Networks
Teresa Dorszewski
Lenka Tětková
Lorenz Linhardt
Lars Kai Hansen
HAI
188
1
0
10 Sep 2024
1