ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2105.03824
  4. Cited By
FNet: Mixing Tokens with Fourier Transforms

FNet: Mixing Tokens with Fourier Transforms

9 May 2021
James Lee-Thorp
Joshua Ainslie
Ilya Eckstein
Santiago Ontanon
ArXivPDFHTML

Papers citing "FNet: Mixing Tokens with Fourier Transforms"

50 / 251 papers shown
Title
ConvFormer: Revisiting Transformer for Sequential User Modeling
ConvFormer: Revisiting Transformer for Sequential User Modeling
Hao Wang
Jianxun Lian
Mingyang Wu
Haoxuan Li
Jiajun Fan
Wanyue Xu
Chaozhuo Li
Xing Xie
19
3
0
05 Aug 2023
DRAW: Defending Camera-shooted RAW against Image Manipulation
DRAW: Defending Camera-shooted RAW against Image Manipulation
Xiaoxiao Hu
Qichao Ying
Zhenxing Qian
Sheng Li
Xinpeng Zhang
AAML
42
7
0
31 Jul 2023
Improving BERT with Hybrid Pooling Network and Drop Mask
Improving BERT with Hybrid Pooling Network and Drop Mask
Qian Chen
Wen Wang
Qinglin Zhang
Chong Deng
Ma Yukun
Siqi Zheng
17
0
0
14 Jul 2023
FITS: Modeling Time Series with $10k$ Parameters
FITS: Modeling Time Series with 10k10k10k Parameters
Zhijian Xu
Ailing Zeng
Qiang Xu
AI4TS
22
84
0
06 Jul 2023
Fourier-Mixed Window Attention: Accelerating Informer for Long Sequence
  Time-Series Forecasting
Fourier-Mixed Window Attention: Accelerating Informer for Long Sequence Time-Series Forecasting
Nhat Tran
Jack Xin
AI4TS
31
6
0
02 Jul 2023
WaveMixSR: A Resource-efficient Neural Network for Image
  Super-resolution
WaveMixSR: A Resource-efficient Neural Network for Image Super-resolution
Pranav Jeevan
Akella Srinidhi
Pasunuri Prathiba
A. Sethi
SupR
31
9
0
01 Jul 2023
MultiWave: Multiresolution Deep Architectures through Wavelet
  Decomposition for Multivariate Time Series Prediction
MultiWave: Multiresolution Deep Architectures through Wavelet Decomposition for Multivariate Time Series Prediction
I. Deznabi
M. Fiterau
AI4TS
35
5
0
16 Jun 2023
Recasting Self-Attention with Holographic Reduced Representations
Recasting Self-Attention with Holographic Reduced Representations
Mohammad Mahmudul Alam
Edward Raff
Stella Biderman
Tim Oates
James Holt
8
8
0
31 May 2023
A Quantitative Review on Language Model Efficiency Research
A Quantitative Review on Language Model Efficiency Research
Meng Jiang
Hy Dang
Lingbo Tong
25
0
0
28 May 2023
A Joint Time-frequency Domain Transformer for Multivariate Time Series
  Forecasting
A Joint Time-frequency Domain Transformer for Multivariate Time Series Forecasting
Yushu Chen
Shengzhuo Liu
Jinzhe Yang
Hao Jing
Wenlai Zhao
Guang-Wu Yang
AI4TS
24
15
0
24 May 2023
Fourier Transformer: Fast Long Range Modeling by Removing Sequence Redundancy with FFT Operator
Fourier Transformer: Fast Long Range Modeling by Removing Sequence Redundancy with FFT Operator
Ziwei He
Meng-Da Yang
Minwei Feng
Jingcheng Yin
Xinbing Wang
Jingwen Leng
Zhouhan Lin
ViT
35
13
0
24 May 2023
Recyclable Tuning for Continual Pre-training
Recyclable Tuning for Continual Pre-training
Yujia Qin
Cheng Qian
Xu Han
Yankai Lin
Huadong Wang
Ruobing Xie
Zhiyuan Liu
Maosong Sun
Jie Zhou
CLL
26
11
0
15 May 2023
ChatGPT-Like Large-Scale Foundation Models for Prognostics and Health
  Management: A Survey and Roadmaps
ChatGPT-Like Large-Scale Foundation Models for Prognostics and Health Management: A Survey and Roadmaps
Yanfang Li
Huan Wang
Muxia Sun
LM&MA
AI4TS
AI4CE
29
46
0
10 May 2023
Toeplitz Neural Network for Sequence Modeling
Toeplitz Neural Network for Sequence Modeling
Zhen Qin
Xiaodong Han
Weixuan Sun
Bowen He
Dong Li
Dongxu Li
Yuchao Dai
Lingpeng Kong
Yiran Zhong
AI4TS
ViT
32
40
0
08 May 2023
Deep Learning-Based Multiband Signal Fusion for 3-D SAR Super-Resolution
Deep Learning-Based Multiband Signal Fusion for 3-D SAR Super-Resolution
Josiah W. Smith
Murat Torlak
29
3
0
03 May 2023
Sequence Modeling with Multiresolution Convolutional Memory
Sequence Modeling with Multiresolution Convolutional Memory
Jiaxin Shi
Ke Alexander Wang
E. Fox
42
13
0
02 May 2023
Unlimiformer: Long-Range Transformers with Unlimited Length Input
Unlimiformer: Long-Range Transformers with Unlimited Length Input
Amanda Bertsch
Uri Alon
Graham Neubig
Matthew R. Gormley
RALM
99
122
0
02 May 2023
SpectFormer: Frequency and Attention is what you need in a Vision
  Transformer
SpectFormer: Frequency and Attention is what you need in a Vision Transformer
Badri N. Patro
Vinay P. Namboodiri
Vijay Srinivas Agneeswaran
ViT
35
47
0
13 Apr 2023
Cascaded Local Implicit Transformer for Arbitrary-Scale Super-Resolution
Cascaded Local Implicit Transformer for Arbitrary-Scale Super-Resolution
Haoming Chen
Yu-Syuan Xu
Minui Hong
Yi-Min Tsai
Hsien-Kai Kuo
Chun-Yi Lee
OffRL
39
46
0
29 Mar 2023
TransCODE: Co-design of Transformers and Accelerators for Efficient
  Training and Inference
TransCODE: Co-design of Transformers and Accelerators for Efficient Training and Inference
Shikhar Tuli
N. Jha
38
5
0
27 Mar 2023
EdgeTran: Co-designing Transformers for Efficient Inference on Mobile
  Edge Platforms
EdgeTran: Co-designing Transformers for Efficient Inference on Mobile Edge Platforms
Shikhar Tuli
N. Jha
36
3
0
24 Mar 2023
Multiscale Attention via Wavelet Neural Operators for Vision
  Transformers
Multiscale Attention via Wavelet Neural Operators for Vision Transformers
Anahita Nekoozadeh
M. Ahmadzadeh
Zahra Mardani
ViT
38
2
0
22 Mar 2023
TemporalMaxer: Maximize Temporal Context with only Max Pooling for
  Temporal Action Localization
TemporalMaxer: Maximize Temporal Context with only Max Pooling for Temporal Action Localization
Tuan N. Tang
Kwonyoung Kim
Kwanghoon Sohn
24
29
0
16 Mar 2023
Resurrecting Recurrent Neural Networks for Long Sequences
Resurrecting Recurrent Neural Networks for Long Sequences
Antonio Orvieto
Samuel L. Smith
Albert Gu
Anushan Fernando
Çağlar Gülçehre
Razvan Pascanu
Soham De
88
268
0
11 Mar 2023
FFT-based Dynamic Token Mixer for Vision
FFT-based Dynamic Token Mixer for Vision
Yuki Tatsunami
Masato Taki
45
20
0
07 Mar 2023
MITFAS: Mutual Information based Temporal Feature Alignment and Sampling
  for Aerial Video Action Recognition
MITFAS: Mutual Information based Temporal Feature Alignment and Sampling for Aerial Video Action Recognition
Ruiqi Xian
Xijun Wang
Tianyi Zhou
24
10
0
05 Mar 2023
AZTR: Aerial Video Action Recognition with Auto Zoom and Temporal
  Reasoning
AZTR: Aerial Video Action Recognition with Auto Zoom and Temporal Reasoning
Xijun Wang
Ruiqi Xian
Tianrui Guan
Celso M. de Melo
Stephen M. Nogar
Aniket Bera
Tianyi Zhou
16
11
0
02 Mar 2023
Diagonal State Space Augmented Transformers for Speech Recognition
Diagonal State Space Augmented Transformers for Speech Recognition
G. Saon
Ankit Gupta
Xiaodong Cui
AI4TS
35
26
0
27 Feb 2023
Magnification Invariant Medical Image Analysis: A Comparison of
  Convolutional Networks, Vision Transformers, and Token Mixers
Magnification Invariant Medical Image Analysis: A Comparison of Convolutional Networks, Vision Transformers, and Token Mixers
Pranav Jeevan
Nikhil Cherian Kurian
A. Sethi
MedIm
32
1
0
22 Feb 2023
A Federated Approach for Hate Speech Detection
A Federated Approach for Hate Speech Detection
Jay Gala
Deep Gandhi
Jash Mehta
Zeerak Talat
21
4
0
18 Feb 2023
Efficiency 360: Efficient Vision Transformers
Efficiency 360: Efficient Vision Transformers
Badri N. Patro
Vijay Srinivas Agneeswaran
26
6
0
16 Feb 2023
Simple Hardware-Efficient Long Convolutions for Sequence Modeling
Simple Hardware-Efficient Long Convolutions for Sequence Modeling
Daniel Y. Fu
Elliot L. Epstein
Eric N. D. Nguyen
A. Thomas
Michael Zhang
Tri Dao
Atri Rudra
Christopher Ré
16
52
0
13 Feb 2023
Fourier-RNNs for Modelling Noisy Physics Data
Fourier-RNNs for Modelling Noisy Physics Data
Vignesh Gopakumar
Stanislas Pamela
L. Zanisi
AI4TS
18
2
0
13 Feb 2023
A Unified View of Long-Sequence Models towards Modeling Million-Scale
  Dependencies
A Unified View of Long-Sequence Models towards Modeling Million-Scale Dependencies
Hongyu Hè
Marko Kabić
25
2
0
13 Feb 2023
The Framework Tax: Disparities Between Inference Efficiency in NLP
  Research and Deployment
The Framework Tax: Disparities Between Inference Efficiency in NLP Research and Deployment
Jared Fernandez
Jacob Kahn
Clara Na
Yonatan Bisk
Emma Strubell
FedML
33
10
0
13 Feb 2023
DNArch: Learning Convolutional Neural Architectures by Backpropagation
DNArch: Learning Convolutional Neural Architectures by Backpropagation
David W. Romero
Neil Zeghidour
AI4CE
24
4
0
10 Feb 2023
MTS-Mixers: Multivariate Time Series Forecasting via Factorized Temporal
  and Channel Mixing
MTS-Mixers: Multivariate Time Series Forecasting via Factorized Temporal and Channel Mixing
Zhe Li
Zhongwen Rao
Lujia Pan
Zenglin Xu
AI4TS
33
60
0
09 Feb 2023
Efficient Joint Learning for Clinical Named Entity Recognition and
  Relation Extraction Using Fourier Networks: A Use Case in Adverse Drug Events
Efficient Joint Learning for Clinical Named Entity Recognition and Relation Extraction Using Fourier Networks: A Use Case in Adverse Drug Events
A. Yazdani
D. Proios
H. Rouhizadeh
Douglas Teodoro
21
7
0
08 Feb 2023
Scaling Up Computer Vision Neural Networks Using Fast Fourier Transform
Scaling Up Computer Vision Neural Networks Using Fast Fourier Transform
Siddharth Agrawal
15
0
0
02 Feb 2023
Out-of-distributional risk bounds for neural operators with applications
  to the Helmholtz equation
Out-of-distributional risk bounds for neural operators with applications to the Helmholtz equation
Jose Antonio Lara Benitez
Takashi Furuya
F. Faucher
Anastasis Kratsios
X. Tricoche
Maarten V. de Hoop
39
16
0
27 Jan 2023
WLD-Reg: A Data-dependent Within-layer Diversity Regularizer
WLD-Reg: A Data-dependent Within-layer Diversity Regularizer
Firas Laakom
Jenni Raitoharju
Alexandros Iosifidis
Moncef Gabbouj
AI4CE
29
7
0
03 Jan 2023
Cramming: Training a Language Model on a Single GPU in One Day
Cramming: Training a Language Model on a Single GPU in One Day
Jonas Geiping
Tom Goldstein
MoE
30
85
0
28 Dec 2022
Pretraining Without Attention
Pretraining Without Attention
Junxiong Wang
J. Yan
Albert Gu
Alexander M. Rush
27
48
0
20 Dec 2022
First De-Trend then Attend: Rethinking Attention for Time-Series
  Forecasting
First De-Trend then Attend: Rethinking Attention for Time-Series Forecasting
Xiyuan Zhang
Xiaoyong Jin
Karthick Gopalswamy
Gaurav Gupta
Youngsuk Park
Xingjian Shi
Hongya Wang
Danielle C. Maddix
Yuyang Wang
AI4TS
27
19
0
15 Dec 2022
Efficient Long Sequence Modeling via State Space Augmented Transformer
Efficient Long Sequence Modeling via State Space Augmented Transformer
Simiao Zuo
Xiaodong Liu
Jian Jiao
Denis Xavier Charles
Eren Manavoglu
Tuo Zhao
Jianfeng Gao
125
36
0
15 Dec 2022
Paraphrase Identification with Deep Learning: A Review of Datasets and
  Methods
Paraphrase Identification with Deep Learning: A Review of Datasets and Methods
Chao Zhou
Cheng Qiu
Daniel Ernesto Acuna
32
25
0
13 Dec 2022
TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models
  of Different Modalities
TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different Modalities
Zhe Zhao
Yudong Li
Cheng-An Hou
Jing-xin Zhao
Rong Tian
...
Xingwu Sun
Zhanhui Kang
Xiaoyong Du
Linlin Shen
Kimmo Yan
VLM
41
23
0
13 Dec 2022
BudgetLongformer: Can we Cheaply Pretrain a SotA Legal Language Model
  From Scratch?
BudgetLongformer: Can we Cheaply Pretrain a SotA Legal Language Model From Scratch?
Joel Niklaus
Daniele Giofré
33
11
0
30 Nov 2022
Improving Commonsense in Vision-Language Models via Knowledge Graph
  Riddles
Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles
Shuquan Ye
Yujia Xie
Dongdong Chen
Yichong Xu
Lu Yuan
Chenguang Zhu
Jing Liao
VLM
24
11
0
29 Nov 2022
Lightweight Structure-Aware Attention for Visual Understanding
Lightweight Structure-Aware Attention for Visual Understanding
Heeseung Kwon
F. M. Castro
M. Marín-Jiménez
N. Guil
Alahari Karteek
28
2
0
29 Nov 2022
Previous
123456
Next