Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1612.08083
Cited By
Language Modeling with Gated Convolutional Networks
23 December 2016
Yann N. Dauphin
Angela Fan
Michael Auli
David Grangier
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Language Modeling with Gated Convolutional Networks"
50 / 915 papers shown
Title
Black-box language model explanation by context length probing
Ondřej Cífka
Antoine Liutkus
MILM
LRM
24
6
0
30 Dec 2022
Cramming: Training a Language Model on a Single GPU in One Day
Jonas Geiping
Tom Goldstein
MoE
32
86
0
28 Dec 2022
Structural State Translation: Condition Transfer between Civil Structures Using Domain-Generalization for Structural Health Monitoring
Furkan Luleci
F. Catbas
27
2
0
28 Dec 2022
Pretraining Without Attention
Junxiong Wang
J. Yan
Albert Gu
Alexander M. Rush
27
48
0
20 Dec 2022
SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations
Ioannis Tsiamas
José A. R. Fonollosa
Marta R. Costa-jussá
46
6
0
19 Dec 2022
Spatial-temporal traffic modeling with a fusion graph reconstructed by tensor decomposition
Qin Li
Xu Yang
Yong Wang
Yuankai Wu
Deqiang He
46
10
0
12 Dec 2022
HieNet: Bidirectional Hierarchy Framework for Automated ICD Coding
Shi Wang
Daniel Tang
Luchen Zhang
Huilin Li
Ding-Jung Han
24
10
0
09 Dec 2022
Framewise WaveGAN: High Speed Adversarial Vocoder in Time Domain with Very Low Computational Complexity
Ahmed Mustafa
J. Valin
Jan Büthe
Paris Smaragdis
Mike Goodwin
30
4
0
08 Dec 2022
COmic: Convolutional Kernel Networks for Interpretable End-to-End Learning on (Multi-)Omics Data
Jonas C. Ditz
Bernhard Reuter
Nícolas Pfeifer
29
1
0
02 Dec 2022
Simplifying and Understanding State Space Models with Diagonal Linear RNNs
Ankit Gupta
Harsh Mehta
Jonathan Berant
29
21
0
01 Dec 2022
Neural Speech Phase Prediction based on Parallel Estimation Architecture and Anti-Wrapping Losses
Yang Ai
Zhenhua Ling
21
24
0
29 Nov 2022
A new Speech Feature Fusion method with cross gate parallel CNN for Speaker Recognition
Jiacheng Zhang
Wenyi Yan
Ye Zhang
23
2
0
24 Nov 2022
CaloMan: Fast generation of calorimeter showers with density estimation on learned manifolds
Jesse C. Cresswell
Brendan Leigh Ross
G. Loaiza-Ganem
H. Reyes-González
Marco Letizia
Anthony L. Caterini
24
36
0
23 Nov 2022
Hybrid Transformer Based Feature Fusion for Self-Supervised Monocular Depth Estimation
S. Tomar
Maitreya Suin
A. N. Rajagopalan
ViT
MDE
29
4
0
20 Nov 2022
NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis
Hyeong-Seok Choi
Jinhyeok Yang
Juheon Lee
Hyeongju Kim
20
46
0
17 Nov 2022
ParCNetV2: Oversized Kernel with Enhanced Attention
Ruihan Xu
Haokui Zhang
Wenze Hu
Shiliang Zhang
Xiaoyu Wang
ViT
32
6
0
14 Nov 2022
Online Phase Reconstruction via DNN-based Phase Differences Estimation
Yoshiki Masuyama
Kohei Yatabe
Kento Nagatomo
Yasuhiro Oikawa
3DV
16
7
0
12 Nov 2022
A Comparative Study of Data Augmentation Techniques for Deep Learning Based Emotion Recognition
Ravi Shankar
Abdouh Harouna Kenfack
Arjun Somayazulu
A. Venkataraman
14
3
0
09 Nov 2022
Egocentric Audio-Visual Noise Suppression
Roshan S. Sharma
Weipeng He
Ju Lin
Egor Lakomkin
Yang Liu
Kaustubh Kalgaonkar
EgoV
24
1
0
07 Nov 2022
SCA: Streaming Cross-attention Alignment for Echo Cancellation
Yang Liu
Yangyang Shi
Yun Li
Kaustubh Kalgaonkar
Sriram Srinivasan
X. Lei
40
8
0
01 Nov 2022
Structured State Space Decoder for Speech Recognition and Synthesis
Koichi Miyazaki
Masato Murata
Tomoki Koriyama
34
12
0
31 Oct 2022
Efficient Speech Translation with Dynamic Latent Perceivers
Ioannis Tsiamas
Gerard I. Gállego
José A. R. Fonollosa
Marta R. Costa-jussá
25
2
0
28 Oct 2022
Nonparallel High-Quality Audio Super Resolution with Domain Adaptation and Resampling CycleGANs
Reo Yoneyama
Ryuichi Yamamoto
Kentaro Tachibana
26
4
0
28 Oct 2022
What Language Model to Train if You Have One Million GPU Hours?
Teven Le Scao
Thomas Wang
Daniel Hesslow
Lucile Saulnier
Stas Bekman
...
Lintang Sutawika
Jaesung Tae
Zheng-Xin Yong
Julien Launay
Iz Beltagy
MoE
AI4CE
232
105
0
27 Oct 2022
N
N
N
-gram Is Back: Residual Learning of Neural Text Generation with
n
n
n
-gram Language Model
Huayang Li
Deng Cai
J. Xu
Taro Watanabe
VLM
37
1
0
26 Oct 2022
Visualize Before You Write: Imagination-Guided Open-Ended Text Generation
Wanrong Zhu
An Yan
Yujie Lu
Wenda Xu
Junfeng Fang
Miguel P. Eckstein
William Yang Wang
82
37
0
07 Oct 2022
C2KD: Cross-Lingual Cross-Modal Knowledge Distillation for Multilingual Text-Video Retrieval
Andrew Rouditchenko
Yung-Sung Chuang
Nina Shvetsova
Samuel Thomas
Rogerio Feris
Brian Kingsbury
Leonid Karlinsky
David Harwath
Hilde Kuehne
James R. Glass
VLM
39
4
0
07 Oct 2022
An Overview of Affective Speech Synthesis and Conversion in the Deep Learning Era
Andreas Triantafyllopoulos
Björn W. Schuller
Gokcce .Iymen
M. Sezgin
Xiangheng He
...
Shuo Liu
Silvan Mertes
Elisabeth André
Ruibo Fu
Jianhua Tao
20
53
0
06 Oct 2022
Temporal Spatial Decomposition and Fusion Network for Time Series Forecasting
Liwang Zhou
Jing Gao
AI4TS
14
1
0
06 Oct 2022
Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks
Zhenhailong Wang
Xiaoman Pan
Dian Yu
Dong Yu
Jianshu Chen
Heng Ji
VLM
46
9
0
01 Oct 2022
Cooperation in the Latent Space: The Benefits of Adding Mixture Components in Variational Autoencoders
Oskar Kviman
Ricky Molén
A. Hotti
Semih Kurt
Victor Elvira
J. Lagergren
32
11
0
30 Sep 2022
Music Source Separation with Band-split RNN
Yi Luo
Jianwei Yu
60
107
0
30 Sep 2022
Multi-scale Attention Network for Single Image Super-Resolution
Yan Wang
Yusen Li
Gang Wang
Xiaoguang Liu
SupR
39
39
0
28 Sep 2022
Searching a High-Performance Feature Extractor for Text Recognition Network
Hui Zhang
Quanming Yao
James T. Kwok
X. Bai
30
7
0
27 Sep 2022
Multi-encoder attention-based architectures for sound recognition with partial visual assistance
Wim Boes
Hugo Van hamme
14
1
0
26 Sep 2022
Rethinking Performance Gains in Image Dehazing Networks
Yuda Song
Yang Zhou
Hui Qian
Xin Du
SSeg
36
48
0
23 Sep 2022
Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition
Ye Bai
Jie Li
W. Han
Hao Ni
Kaituo Xu
Zhuo Zhang
Cheng Yi
Xiaorui Wang
MoE
31
1
0
17 Sep 2022
Lightweight Long-Range Generative Adversarial Networks
Bowen Li
Thomas Lukasiewicz
GAN
42
3
0
08 Sep 2022
Spach Transformer: Spatial and Channel-wise Transformer Based on Local and Global Self-attentions for PET Image Denoising
Se-In Jang
T. Pan
Ye Li
P. Heidari
Junyu Chen
Quanzheng Li
Kuang Gong
ViT
MedIm
36
27
0
07 Sep 2022
SemSegDepth: A Combined Model for Semantic Segmentation and Depth Completion
J. Lagos
Esa Rahtu
VLM
3DV
35
5
0
01 Sep 2022
Transformers with Learnable Activation Functions
Haishuo Fang
Ji-Ung Lee
N. Moosavi
Iryna Gurevych
AI4CE
25
7
0
30 Aug 2022
Decoding speech perception from non-invasive brain recordings
Alexandre Défossez
Charlotte Caucheteux
Jérémy Rapin
Ori Kabeli
J. King
46
118
0
25 Aug 2022
Dance Style Transfer with Cross-modal Transformer
Wenjie Yin
Hang Yin
Kim Baraka
Danica Kragic
Mårten Björkman
50
23
0
19 Aug 2022
Uncertainty Quantification for Traffic Forecasting: A Unified Approach
Weizhu Qian
Dalin Zhang
Yan Zhao
Kai Zheng
James J. Q. Yu
BDL
AI4TS
32
22
0
11 Aug 2022
See What You See: Self-supervised Cross-modal Retrieval of Visual Stimuli from Brain Activity
Zesheng Ye
Lina Yao
Yu Zhang
Silvia Gustin
29
6
0
07 Aug 2022
Detecting Algorithmically Generated Domains Using a GCNN-LSTM Hybrid Neural Network
Zhilin Wang
25
0
0
06 Aug 2022
Model Blending for Text Classification
Ramit Pahwa
26
0
0
05 Aug 2022
Towards Understanding Mixture of Experts in Deep Learning
Zixiang Chen
Yihe Deng
Yue-bo Wu
Quanquan Gu
Yuan-Fang Li
MLT
MoE
42
53
0
04 Aug 2022
Conv-NILM-Net, a causal and multi-appliance model for energy source separation
Mohamed Alami Chehboune
Jérémie Decock
Rim Kaddah
Jesse Read
27
1
0
03 Aug 2022
Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?
Yi Tay
Mostafa Dehghani
Samira Abnar
Hyung Won Chung
W. Fedus
J. Rao
Sharan Narang
Vinh Q. Tran
Dani Yogatama
Donald Metzler
AI4CE
34
100
0
21 Jul 2022
Previous
1
2
3
...
6
7
8
...
17
18
19
Next