Papers
Communities
Organizations
Events
Blog
Pricing
Feedback
Contact Sales
Search
Open menu
Home
Papers
2305.16938
Cited By
v1
v2 (latest)
Few-shot Fine-tuning vs. In-context Learning: A Fair Comparison and Evaluation
26 May 2023
Marius Mosbach
Tiago Pimentel
Haiqin Yang
Dietrich Klakow
Yanai Elazar
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Few-shot Fine-tuning vs. In-context Learning: A Fair Comparison and Evaluation"
43 / 43 papers shown
Title
Supervised In-Context Fine-Tuning for Generative Sequence Labeling
David Dukić
Goran Glavaš
Jan Šnajder
BDL
52
0
0
31 Aug 2025
Optimal Dynamic Regret by Transformers for Non-Stationary Reinforcement Learning
Baiyuan Chen
Shinji Ito
Masaaki Imaizumi
20
0
0
22 Aug 2025
When Punctuation Matters: A Large-Scale Comparison of Prompt Robustness Methods for LLMs
Mikhail Seleznyov
Mikhail Chaichuk
Gleb Ershov
Alexander Panchenko
Elena Tutubalina
Oleg Somov
19
1
0
15 Aug 2025
Agentic Design Review System
Sayan Nag
K. J. Joseph
Koustava Goswami
Vlad I. Morariu
Balaji Vasan Srinivasan
36
0
0
14 Aug 2025
Alzheimer's Dementia Detection Using Perplexity from Paired Large Language Models
Yao Xiao
H. Christensen
Stefan Goetze
129
0
0
11 Jun 2025
Mimicking or Reasoning: Rethinking Multi-Modal In-Context Learning in Vision-Language Models
Chengyue Huang
Yuchen Zhu
Sichen Zhu
Jingyun Xiao
Moises Andrade
Shivang Chopra
Z. Kira
ReLM
VLM
LRM
76
3
0
09 Jun 2025
LTG at SemEval-2025 Task 10: Optimizing Context for Classification of Narrative Roles
Egil Rønningstad
Gaurav Negi
87
0
0
06 Jun 2025
Exploring In-context Example Generation for Machine Translation
Dohyun Lee
Seungil Lee
Chanwoo Yang
Yujin Baek
Jaegul Choo
84
0
0
31 May 2025
Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning
Shaobo Wang
Xiangqi Jin
Ziming Wang
Jinqiao Wang
Jingyun Zhang
...
Zichen Wen
Zhong Li
Bin Wang
Xuming Hu
Linfeng Zhang
SyDa
217
4
0
18 May 2025
On the generalization of language models from in-context learning and finetuning: a controlled study
Andrew Kyle Lampinen
Arslan Chaudhry
Stephanie Chan
Cody Wild
Diane Wan
Alex Ku
Jorg Bornschein
Razvan Pascanu
Murray Shanahan
James L. McClelland
294
11
0
01 May 2025
In a Few Words: Comparing Weak Supervision and LLMs for Short Query Intent Classification
Daria Alexander
Arjen P. de Vries
124
0
0
30 Apr 2025
POPri: Private Federated Learning using Preference-Optimized Synthetic Data
Charlie Hou
Mei-Yu Wang
Yige Zhu
Daniel Lazar
Giulia Fanti
FedML
292
4
0
23 Apr 2025
From Large to Super-Tiny: End-to-End Optimization for Cost-Efficient LLMs
Jiliang Ni
Jiachen Pu
Zhongyi Yang
Kun Zhou
Hui Wang
Xiaoliang Xiao
Dakui Wang
Xin Li
Jingfeng Luo
Conggang Hu
284
0
0
18 Apr 2025
Mimic In-Context Learning for Multimodal Tasks
Yuchu Jiang
Jiale Fu
Chenduo Hao
Xinting Hu
Yingzhe Peng
Xin Geng
Xu Yang
161
5
0
11 Apr 2025
Large Language Models are Unreliable for Cyber Threat Intelligence
Emanuele Mezzi
Fabio Massacci
Katja Tuma
168
3
0
29 Mar 2025
Efficient Many-Shot In-Context Learning with Dynamic Block-Sparse Attention
Emily Xiao
Chin-Jou Li
Yilin Zhang
Graham Neubig
Amanda Bertsch
BDL
185
1
0
11 Mar 2025
Uncovering inequalities in new knowledge learning by large language models across different languages
Chenglong Wang
Haoyu Tang
Xiyuan Yang
Yueqi Xie
Jina Suh
...
Junming Huang
Yu Xie
Zhaoya Gong
Xing Xie
Fangzhao Wu
156
0
0
06 Mar 2025
SAKE: Steering Activations for Knowledge Editing
Marco Scialanga
Thibault Laugel
Vincent Grari
Marcin Detyniecki
KELM
LLMSV
182
2
0
03 Mar 2025
Dynamic Gradient Sparsification Training for Few-Shot Fine-tuning of CT Lymph Node Segmentation Foundation Model
Zihao Luo
Zijun Gao
Wenjun Liao
Shichuan Zhang
Guotai Wang
Xiangde Luo
123
0
0
02 Mar 2025
Shh, don't say that! Domain Certification in LLMs
Cornelius Emde
Alasdair Paren
Preetham Arvind
Maxime Kayser
Tom Rainforth
Thomas Lukasiewicz
Guohao Li
Philip Torr
Adel Bibi
190
2
0
26 Feb 2025
TestNUC: Enhancing Test-Time Computing Approaches and Scaling through Neighboring Unlabeled Data Consistency
Henry Peng Zou
Zhengyao Gu
Yue Zhou
Yankai Chen
Weizhi Zhang
Liancheng Fang
Yibo Wang
Yangning Li
Kay Liu
Philip S. Yu
199
3
0
26 Feb 2025
Adaptive Prompting: Ad-hoc Prompt Composition for Social Bias Detection
Maximilian Spliethover
Tim Knebler
Fabian Fumagalli
Maximilian Muschalik
Barbara Hammer
Eyke Hüllermeier
Henning Wachsmuth
249
1
0
10 Feb 2025
Self-Rationalization in the Wild: A Large Scale Out-of-Distribution Evaluation on NLI-related tasks
Jing Yang
Max Glockner
Anderson de Rezende Rocha
Iryna Gurevych
LRM
205
1
0
07 Feb 2025
FlanEC: Exploring Flan-T5 for Post-ASR Error Correction
Moreno La Quatra
Valerio Mario Salerno
Yu Tsao
Sabato Marco Siniscalchi
223
3
0
22 Jan 2025
Efficient LLM Context Distillation
Rajesh Upadhayayaya
Zachary Smith
Chritopher Kottmyer
Manish Raj Osti
257
3
0
03 Sep 2024
LLM-based MOFs Synthesis Condition Extraction using Few-Shot Demonstrations
Lei Shi
Zhimeng Liu
Yi Yang
Weize Wu
Yuyang Zhang
...
Zipeng Liu
Huobin Tan
Hongyi Gao
Yue Zhang
Ge Wang
149
3
0
06 Aug 2024
Dialogue Ontology Relation Extraction via Constrained Chain-of-Thought Decoding
Renato Vukovic
David Arps
Carel van Niekerk
Benjamin Matthias Ruppik
Hsien-chin Lin
Michael Heck
Milica Gašić
128
3
0
05 Aug 2024
MolecularGPT: Open Large Language Model (LLM) for Few-Shot Molecular Property Prediction
Yuyan Liu
Sirui Ding
Sheng Zhou
Wenqi Fan
Qiaoyu Tan
119
20
0
18 Jun 2024
Comparative Analysis of Different Efficient Fine Tuning Methods of Large Language Models (LLMs) in Low-Resource Setting
Krishna Prasad Varadarajan Srinivasan
Prasanth Gumpena
Madhusudhana Yattapu
Vishal H. Brahmbhatt
42
4
0
21 May 2024
In-Context Learning with Long-Context Models: An In-Depth Exploration
Amanda Bertsch
Maor Ivgi
Uri Alon
Jonathan Berant
Matthew R. Gormley
Matthew R. Gormley
Graham Neubig
ReLM
AIMat
287
95
0
30 Apr 2024
LLMs with Industrial Lens: Deciphering the Challenges and Prospects -- A Survey
Ashok Urlana
Charaka Vinayak Kumar
Ajeet Kumar Singh
B. Garlapati
S. Chalamala
Rahul Mishra
172
11
0
22 Feb 2024
Reliable LLM-based User Simulator for Task-Oriented Dialogue Systems
Ivan Sekulić
Silvia Terragni
Victor Guimaraes
Nghia Khau
Bruna Guedes
Modestas Filipavicius
A. Manso
Roland Mathis
69
8
0
20 Feb 2024
Effective and Efficient Conversation Retrieval for Dialogue State Tracking with Implicit Text Summaries
Seanie Lee
Jianpeng Cheng
Joris Driesen
Alexandru Coca
Anders Johannsen
RALM
144
3
0
20 Feb 2024
Comparing Specialised Small and General Large Language Models on Text Classification: 100 Labelled Samples to Achieve Break-Even Performance
Branislav Pecher
Ivan Srba
Maria Bielikova
ALM
166
10
0
20 Feb 2024
Measuring and Controlling Instruction (In)Stability in Language Model Dialogs
Kenneth Li
Tianle Liu
Naomi Bashkansky
David Bau
Fernanda Viégas
Hanspeter Pfister
Martin Wattenberg
146
16
0
13 Feb 2024
NoisyICL: A Little Noise in Model Parameters Calibrates In-context Learning
Yufeng Zhao
Yoshihiro Sakai
Naoya Inoue
140
6
0
08 Feb 2024
Mind the instructions: a holistic evaluation of consistency and interactions in prompt-based learning
Lucas Weber
Elia Bruni
Dieuwke Hupkes
136
32
0
20 Oct 2023
Can GPT models be Financial Analysts? An Evaluation of ChatGPT and GPT-4 on mock CFA Exams
Ethan Callanan
A. Mbakwe
Antony Papadimitriou
Yulong Pei
Mathieu Sibue
Xiaodan Zhu
Zhiqiang Ma
Xiaomo Liu
Sameena Shah
ELM
103
23
0
12 Oct 2023
Junk DNA Hypothesis: Pruning Small Pre-Trained Weights Irreversibly and Monotonically Impairs "Difficult" Downstream Tasks in LLMs
Lu Yin
Ajay Jaiswal
Shiwei Liu
Souvik Kundu
Zhangyang Wang
129
12
0
29 Sep 2023
In-context Interference in Chat-based Large Language Models
Eric Nuertey Coleman
J. Hurtado
Vincenzo Lomonaco
KELM
88
1
0
22 Sep 2023
TPTU: Large Language Model-based AI Agents for Task Planning and Tool Usage
Jingqing Ruan
Yihong Chen
Bin Zhang
Zhiwei Xu
Tianpeng Bao
...
Shiwei Shi
Hangyu Mao
Ziyue Li
Xingyu Zeng
Rui Zhao
LLMAG
LM&Ro
145
42
0
07 Aug 2023
Measuring the Robustness of NLP Models to Domain Shifts
Nitay Calderon
Naveh Porat
Eyal Ben-David
Alexander Chapanin
Zorik Gekhman
Nadav Oved
Vitaly Shalumov
Roi Reichart
259
8
0
31 May 2023
In-Context Probing: Toward Building Robust Classifiers via Probing Large Language Models
Afra Amini
Massimiliano Ciaramita
ReLM
79
1
0
23 May 2023
1