ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2212.02475
  4. Cited By
Meta-Learning Fast Weight Language Models

Meta-Learning Fast Weight Language Models

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
5 December 2022
Kevin Clark
Kelvin Guu
Ming-Wei Chang
Panupong Pasupat
Geoffrey E. Hinton
Mohammad Norouzi
    KELM
ArXiv (abs)PDFHTMLGithub

Papers citing "Meta-Learning Fast Weight Language Models"

12 / 12 papers shown
MesaNet: Sequence Modeling by Locally Optimal Test-Time Training
MesaNet: Sequence Modeling by Locally Optimal Test-Time Training
J. Oswald
Nino Scherrer
Seijin Kobayashi
Luca Versari
Songlin Yang
...
Guillaume Lajoie
Charlotte Frenkel
Razvan Pascanu
Blaise Agüera y Arcas
João Sacramento
406
22
0
05 Jun 2025
One-Minute Video Generation with Test-Time Training
One-Minute Video Generation with Test-Time TrainingComputer Vision and Pattern Recognition (CVPR), 2025
Karan Dalal
Daniel Koceja
Gashon Hussein
Jiarui Xu
Yue Zhao
...
Tatsunori Hashimoto
Sanmi Koyejo
Yejin Choi
Yu Sun
Xiaolong Wang
ViT
464
87
0
07 Apr 2025
Generative Adapter: Contextualizing Language Models in Parameters with A
  Single Forward Pass
Generative Adapter: Contextualizing Language Models in Parameters with A Single Forward PassInternational Conference on Learning Representations (ICLR), 2024
Tong Chen
Hao Fang
Patrick Xia
Xiaodong Liu
Benjamin Van Durme
Luke Zettlemoyer
Jianfeng Gao
Hao Cheng
KELM
393
12
0
08 Nov 2024
What is Wrong with Perplexity for Long-context Language Modeling?
What is Wrong with Perplexity for Long-context Language Modeling?International Conference on Learning Representations (ICLR), 2024
Lizhe Fang
Yifei Wang
Zhaoyang Liu
Chenheng Zhang
Stefanie Jegelka
Jinyang Gao
Bolin Ding
Yisen Wang
800
45
0
31 Oct 2024
Learning to (Learn at Test Time): RNNs with Expressive Hidden States
Learning to (Learn at Test Time): RNNs with Expressive Hidden States
Yu Sun
Xinhao Li
Karan Dalal
Jiarui Xu
Arjun Vikram
...
Xinlei Chen
Xiaolong Wang
Sanmi Koyejo
Tatsunori Hashimoto
Carlos Guestrin
754
241
0
05 Jul 2024
Online Test-Time Adaptation of Spatial-Temporal Traffic Flow Forecasting
Online Test-Time Adaptation of Spatial-Temporal Traffic Flow Forecasting
Pengxin Guo
Pengrong Jin
Ziyue Li
Mengwei He
Yu Zhang
AI4TS
205
11
0
08 Jan 2024
Compressed Context Memory For Online Language Model Interaction
Compressed Context Memory For Online Language Model Interaction
Jang-Hyun Kim
Junyoung Yeom
Sangdoo Yun
Hyun Oh Song
KELM
369
33
1
06 Dec 2023
When Meta-Learning Meets Online and Continual Learning: A Survey
When Meta-Learning Meets Online and Continual Learning: A Survey
Jaehyeon Son
Soochan Lee
Gunhee Kim
OODCLL
420
25
0
09 Nov 2023
Learning to (Learn at Test Time)
Learning to (Learn at Test Time)
Yu Sun
Xinhao Li
Karan Dalal
Chloe Hsu
Oluwasanmi Koyejo
Carlos Guestrin
Xiaolong Wang
Tatsunori Hashimoto
Xinlei Chen
SSL
372
12
0
20 Oct 2023
Trainable Transformer in Transformer
Trainable Transformer in TransformerInternational Conference on Machine Learning (ICML), 2023
A. Panigrahi
Sadhika Malladi
Mengzhou Xia
Sanjeev Arora
VLM
422
16
0
03 Jul 2023
Meta-Learning Online Adaptation of Language Models
Meta-Learning Online Adaptation of Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Nathan J. Hu
E. Mitchell
Christopher D. Manning
Chelsea Finn
KELM
379
47
0
24 May 2023
$k$NN-Adapter: Efficient Domain Adaptation for Black-Box Language Models
kkkNN-Adapter: Efficient Domain Adaptation for Black-Box Language Models
Yangsibo Huang
Daogao Liu
Zexuan Zhong
Weijia Shi
Y. Lee
RALMALM
257
22
0
21 Feb 2023
1
Page 1 of 1