Towards Few-Shot Adaptation of Foundation Models via Multitask Finetuning

22 February 2024

Yin Li

ArXiv (abs)PDF HTML Github (12★)

Papers citing "Towards Few-Shot Adaptation of Foundation Models via Multitask Finetuning"

22 / 22 papers shown

Can Language Models Compose Skills In-Context?

300

27 Oct 2025

HuggingGraph: Understanding the Supply Chain of LLM Ecosystem

Mohammad Shahedur Rahman

R. Hu

Peng Gao

345

17 Jul 2025

Scaling Laws for Geospatial Foundation Models: A case study on PhilEO Bench

Alessandra Feliciotti

204

17 Jun 2025

Few-Shot Learning for Industrial Time Series: A Comparative Analysis Using the Example of Screw-Fastening Process Monitoring

279

16 Jun 2025

SGD as Free Energy Minimization: A Thermodynamic View on Neural Network Training

210

29 May 2025

Only Large Weights (And Not Skip Connections) Can Prevent the Perils of Rank Collapse

Josh Alman

Zhao Song

371

22 May 2025

Fast RoPE Attention: Combining the Polynomial Method and Fast Fourier Transform

Josh Alman

Zhao Song

349

17 May 2025

HyperFlow: Gradient-Free Emulation of Few-Shot Fine-Tuning

Donggyun Kim

Chanwoo Kim

Seunghoon Hong

184

21 Apr 2025

Theoretical Foundation of Flow-Based Time Series Generation: Provable Approximation, Generalization, and Efficiency

947

18 Mar 2025

Learning to Inference Adaptively for Multimodal Large Language Models

432

13 Mar 2025

Theoretical Guarantees for High Order Trajectory Refinement in Generative Flows

285

12 Mar 2025

Video Latent Flow Matching: Optimal Polynomial Projections for Video Interpolation and Extrapolation

514

01 Feb 2025

Out-of-distribution generalization via composition: a lens through induction heads in TransformersProceedings of the National Academy of Sciences of the United States of America (PNAS), 2024

Jiajun Song

Zhuoyan Xu

Yiqiao Zhong

361

31 Dec 2024

RoPE Attention Can Be Trained in Almost Linear Time

354

23 Dec 2024

Bayesian-guided Label Mapping for Visual ReprogrammingNeural Information Processing Systems (NeurIPS), 2024

412

31 Oct 2024

Bypassing the Exponential Dependency: Looped Transformers Efficiently Learn In-context by Multi-step Gradient DescentInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024

463

15 Oct 2024

Varying Shades of Wrong: Aligning LLMs with Wrong Answers OnlyInternational Conference on Learning Representations (ICLR), 2024

236

14 Oct 2024

MaD-Scientist: AI-based Scientist solving Convection-Diffusion-Reaction Equations Using Massive PINN-Based Prior Data

Anthony Gruber

198

09 Oct 2024

Task Addition in Multi-Task Learning by Geometrical Alignment

Chanhui Lee

137

25 Sep 2024

Do Large Language Models Have Compositional Ability? An Investigation into Limitations and Scalability

384

22 Jul 2024

Why Larger Language Models Do In-context Learning Differently?

268

30 May 2024

Streaming Kernel PCA Algorithm With Small Space

344

08 Mar 2023