Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.17567
Cited By
Skill-Mix: a Flexible and Expandable Family of Evaluations for AI models
26 October 2023
Dingli Yu
Simran Kaur
Arushi Gupta
Jonah Brown-Cohen
Anirudh Goyal
Sanjeev Arora
ALM
LLMAG
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Skill-Mix: a Flexible and Expandable Family of Evaluations for AI models"
6 / 6 papers shown
Title
Are Transformers Able to Reason by Connecting Separated Knowledge in Training Data?
Yutong Yin
Zhaoran Wang
LRM
ReLM
68
0
0
27 Jan 2025
Limits to scalable evaluation at the frontier: LLM as Judge won't beat twice the data
Florian E. Dorner
Vivian Y. Nastl
Moritz Hardt
ELM
ALM
35
5
0
17 Oct 2024
Can Models Learn Skill Composition from Examples?
Haoyu Zhao
Simran Kaur
Dingli Yu
Anirudh Goyal
Sanjeev Arora
CoGe
MoE
53
2
0
29 Sep 2024
Training on the Test Task Confounds Evaluation and Emergence
Ricardo Dominguez-Olmedo
Florian E. Dorner
Moritz Hardt
ELM
58
6
1
10 Jul 2024
Crosslingual Capabilities and Knowledge Barriers in Multilingual Large Language Models
Lynn Chua
Badih Ghazi
Yangsibo Huang
Pritish Kamath
Ravi Kumar
Pasin Manurangsi
Amer Sinha
Chulin Xie
Chiyuan Zhang
51
1
0
23 Jun 2024
Compositional Capabilities of Autoregressive Transformers: A Study on Synthetic, Interpretable Tasks
Rahul Ramesh
Ekdeep Singh Lubana
Mikail Khona
Robert P. Dick
Hidenori Tanaka
CoGe
22
6
0
21 Nov 2023
1