ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.15017
  4. Cited By
Towards Few-Shot Adaptation of Foundation Models via Multitask
  Finetuning

Towards Few-Shot Adaptation of Foundation Models via Multitask Finetuning

22 February 2024
Zhuoyan Xu
Zhenmei Shi
Junyi Wei
Fangzhou Mu
Yin Li
Yingyu Liang
ArXiv (abs)PDFHTMLGithub (12★)

Papers citing "Towards Few-Shot Adaptation of Foundation Models via Multitask Finetuning"

22 / 22 papers shown
Can Language Models Compose Skills In-Context?
Can Language Models Compose Skills In-Context?
Zidong Liu
Zhuoyan Xu
Zhenmei Shi
Yingyu Liang
ReLMCoGeLRM
300
0
0
27 Oct 2025
HuggingGraph: Understanding the Supply Chain of LLM Ecosystem
HuggingGraph: Understanding the Supply Chain of LLM Ecosystem
Mohammad Shahedur Rahman
R. Hu
Peng Gao
345
3
0
17 Jul 2025
Scaling Laws for Geospatial Foundation Models: A case study on PhilEO Bench
Scaling Laws for Geospatial Foundation Models: A case study on PhilEO Bench
Nikolaos Dionelis
Jente Bosmans
Riccardo Musto
Giancarlo Paoletti
Simone Sarti
Giacomo Cascarano
Casper Fibaek
Luke Camilleri
B. L. Saux
Alessandra Feliciotti
203
0
0
17 Jun 2025
Few-Shot Learning for Industrial Time Series: A Comparative Analysis Using the Example of Screw-Fastening Process Monitoring
Few-Shot Learning for Industrial Time Series: A Comparative Analysis Using the Example of Screw-Fastening Process Monitoring
X. Tu
Haocheng Zhang
Tao Chengxu
Zuyi Chen
AI4TS
279
0
0
16 Jun 2025
SGD as Free Energy Minimization: A Thermodynamic View on Neural Network Training
SGD as Free Energy Minimization: A Thermodynamic View on Neural Network Training
Ildus Sadrtdinov
Ivan Klimov
E. Lobacheva
Dmitry Vetrov
210
1
0
29 May 2025
Only Large Weights (And Not Skip Connections) Can Prevent the Perils of Rank Collapse
Only Large Weights (And Not Skip Connections) Can Prevent the Perils of Rank Collapse
Josh Alman
Zhao Song
371
10
0
22 May 2025
Fast RoPE Attention: Combining the Polynomial Method and Fast Fourier Transform
Fast RoPE Attention: Combining the Polynomial Method and Fast Fourier Transform
Josh Alman
Zhao Song
349
23
0
17 May 2025
HyperFlow: Gradient-Free Emulation of Few-Shot Fine-Tuning
HyperFlow: Gradient-Free Emulation of Few-Shot Fine-Tuning
Donggyun Kim
Chanwoo Kim
Seunghoon Hong
184
0
0
21 Apr 2025
Theoretical Foundation of Flow-Based Time Series Generation: Provable Approximation, Generalization, and Efficiency
Theoretical Foundation of Flow-Based Time Series Generation: Provable Approximation, Generalization, and Efficiency
Jiangxuan Long
Zhao Song
Chiwun Yang
AI4TS
946
2
0
18 Mar 2025
Learning to Inference Adaptively for Multimodal Large Language Models
Learning to Inference Adaptively for Multimodal Large Language Models
Zhuoyan Xu
Khoi Duc Nguyen
Preeti Mukherjee
Saurabh Bagchi
Somali Chaterji
Yingyu Liang
Yin Li
LRM
432
4
0
13 Mar 2025
Theoretical Guarantees for High Order Trajectory Refinement in Generative Flows
Chengyue Gong
Xiaoyu Li
Yingyu Liang
Jiangxuan Long
Zhenmei Shi
Zhao Song
Yu Tian
285
9
0
12 Mar 2025
Video Latent Flow Matching: Optimal Polynomial Projections for Video Interpolation and Extrapolation
Video Latent Flow Matching: Optimal Polynomial Projections for Video Interpolation and Extrapolation
Yang Cao
Zhao Song
Chiwun Yang
VGen
514
11
0
01 Feb 2025
Out-of-distribution generalization via composition: a lens through induction heads in Transformers
Out-of-distribution generalization via composition: a lens through induction heads in TransformersProceedings of the National Academy of Sciences of the United States of America (PNAS), 2024
Jiajun Song
Zhuoyan Xu
Yiqiao Zhong
361
20
0
31 Dec 2024
RoPE Attention Can Be Trained in Almost Linear Time
RoPE Attention Can Be Trained in Almost Linear Time
Yifang Chen
Jiayan Huo
Xiaoyu Li
Yingyu Liang
Zhenmei Shi
354
19
0
23 Dec 2024
Bayesian-guided Label Mapping for Visual Reprogramming
Bayesian-guided Label Mapping for Visual ReprogrammingNeural Information Processing Systems (NeurIPS), 2024
C. Cai
Zesheng Ye
Bingquan Shen
Jianzhong Qi
Feng Liu
412
8
0
31 Oct 2024
Bypassing the Exponential Dependency: Looped Transformers Efficiently Learn In-context by Multi-step Gradient Descent
Bypassing the Exponential Dependency: Looped Transformers Efficiently Learn In-context by Multi-step Gradient DescentInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Bo Chen
Xiaoyu Li
Yingyu Liang
Zhenmei Shi
Zhao Song
463
28
0
15 Oct 2024
Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only
Varying Shades of Wrong: Aligning LLMs with Wrong Answers OnlyInternational Conference on Learning Representations (ICLR), 2024
Jihan Yao
Wenxuan Ding
Shangbin Feng
Lucy Lu Wang
Yulia Tsvetkov
236
4
0
14 Oct 2024
MaD-Scientist: AI-based Scientist solving Convection-Diffusion-Reaction
  Equations Using Massive PINN-Based Prior Data
MaD-Scientist: AI-based Scientist solving Convection-Diffusion-Reaction Equations Using Massive PINN-Based Prior Data
Mingu Kang
Dongseok Lee
Woojin Cho
Jaehyeon Park
Kookjin Lee
Anthony Gruber
Youngjoon Hong
Noseong Park
DiffMAI4CE
198
1
0
09 Oct 2024
Task Addition in Multi-Task Learning by Geometrical Alignment
Task Addition in Multi-Task Learning by Geometrical Alignment
Soorin Yim
Dae-Woong Jeong
Sung Moon Ko
Sumin Lee
Hyunseung Kim
Chanhui Lee
Sehui Han
137
2
0
25 Sep 2024
Do Large Language Models Have Compositional Ability? An Investigation
  into Limitations and Scalability
Do Large Language Models Have Compositional Ability? An Investigation into Limitations and Scalability
Zhuoyan Xu
Zhenmei Shi
Yingyu Liang
CoGeLRM
378
52
0
22 Jul 2024
Why Larger Language Models Do In-context Learning Differently?
Why Larger Language Models Do In-context Learning Differently?
Zhenmei Shi
Junyi Wei
Zhuoyan Xu
Yingyu Liang
268
45
0
30 May 2024
Streaming Kernel PCA Algorithm With Small Space
Streaming Kernel PCA Algorithm With Small Space
Yichuan Deng
Zhao Song
Zifan Wang
Hangke Zhang
344
5
0
08 Mar 2023
1