Language Modeling with Gated Convolutional Networks

23 December 2016

Angela Fan

Papers citing "Language Modeling with Gated Convolutional Networks"

50 / 915 papers shown

Title
Black-box language model explanation by context length probing Ondřej Cífka Antoine Liutkus MILM LRM 24 6 0 30 Dec 2022
Cramming: Training a Language Model on a Single GPU in One Day Jonas Geiping Tom Goldstein MoE 32 86 0 28 Dec 2022
Structural State Translation: Condition Transfer between Civil Structures Using Domain-Generalization for Structural Health Monitoring Furkan Luleci F. Catbas 27 2 0 28 Dec 2022
Pretraining Without Attention Junxiong Wang J. Yan Albert Gu Alexander M. Rush 27 48 0 20 Dec 2022
SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations Ioannis Tsiamas José A. R. Fonollosa Marta R. Costa-jussá 46 6 0 19 Dec 2022
Spatial-temporal traffic modeling with a fusion graph reconstructed by tensor decomposition Qin Li Xu Yang Yong Wang Yuankai Wu Deqiang He 46 10 0 12 Dec 2022
HieNet: Bidirectional Hierarchy Framework for Automated ICD Coding Shi Wang Daniel Tang Luchen Zhang Huilin Li Ding-Jung Han 24 10 0 09 Dec 2022
Framewise WaveGAN: High Speed Adversarial Vocoder in Time Domain with Very Low Computational Complexity Ahmed Mustafa J. Valin Jan Büthe Paris Smaragdis Mike Goodwin 30 4 0 08 Dec 2022
COmic: Convolutional Kernel Networks for Interpretable End-to-End Learning on (Multi-)Omics Data Jonas C. Ditz Bernhard Reuter Nícolas Pfeifer 29 1 0 02 Dec 2022
Simplifying and Understanding State Space Models with Diagonal Linear RNNs Ankit Gupta Harsh Mehta Jonathan Berant 29 21 0 01 Dec 2022
Neural Speech Phase Prediction based on Parallel Estimation Architecture and Anti-Wrapping Losses Yang Ai Zhenhua Ling 21 24 0 29 Nov 2022
A new Speech Feature Fusion method with cross gate parallel CNN for Speaker Recognition Jiacheng Zhang Wenyi Yan Ye Zhang 23 2 0 24 Nov 2022
CaloMan: Fast generation of calorimeter showers with density estimation on learned manifolds Jesse C. Cresswell Brendan Leigh Ross G. Loaiza-Ganem H. Reyes-González Marco Letizia Anthony L. Caterini 24 36 0 23 Nov 2022
Hybrid Transformer Based Feature Fusion for Self-Supervised Monocular Depth Estimation S. Tomar Maitreya Suin A. N. Rajagopalan ViT MDE 29 4 0 20 Nov 2022
NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis Hyeong-Seok Choi Jinhyeok Yang Juheon Lee Hyeongju Kim 20 46 0 17 Nov 2022
ParCNetV2: Oversized Kernel with Enhanced Attention Ruihan Xu Haokui Zhang Wenze Hu Shiliang Zhang Xiaoyu Wang ViT 32 6 0 14 Nov 2022
Online Phase Reconstruction via DNN-based Phase Differences Estimation Yoshiki Masuyama Kohei Yatabe Kento Nagatomo Yasuhiro Oikawa 3DV 16 7 0 12 Nov 2022
A Comparative Study of Data Augmentation Techniques for Deep Learning Based Emotion Recognition Ravi Shankar Abdouh Harouna Kenfack Arjun Somayazulu A. Venkataraman 14 3 0 09 Nov 2022
Egocentric Audio-Visual Noise Suppression Roshan S. Sharma Weipeng He Ju Lin Egor Lakomkin Yang Liu Kaustubh Kalgaonkar EgoV 24 1 0 07 Nov 2022
SCA: Streaming Cross-attention Alignment for Echo Cancellation Yang Liu Yangyang Shi Yun Li Kaustubh Kalgaonkar Sriram Srinivasan X. Lei 40 8 0 01 Nov 2022
Structured State Space Decoder for Speech Recognition and Synthesis Koichi Miyazaki Masato Murata Tomoki Koriyama 34 12 0 31 Oct 2022
Efficient Speech Translation with Dynamic Latent Perceivers Ioannis Tsiamas Gerard I. Gállego José A. R. Fonollosa Marta R. Costa-jussá 25 2 0 28 Oct 2022
Nonparallel High-Quality Audio Super Resolution with Domain Adaptation and Resampling CycleGANs Reo Yoneyama Ryuichi Yamamoto Kentaro Tachibana 26 4 0 28 Oct 2022
What Language Model to Train if You Have One Million GPU Hours? Teven Le Scao Thomas Wang Daniel Hesslow Lucile Saulnier Stas Bekman ... Lintang Sutawika Jaesung Tae Zheng-Xin Yong Julien Launay Iz Beltagy MoE AI4CE 232 105 0 27 Oct 2022
$N$ -gram Is Back: Residual Learning of Neural Text Generation with $n$ -gram Language Model Huayang Li Deng Cai J. Xu Taro Watanabe VLM 37 1 0 26 Oct 2022
Visualize Before You Write: Imagination-Guided Open-Ended Text Generation Wanrong Zhu An Yan Yujie Lu Wenda Xu Junfeng Fang Miguel P. Eckstein William Yang Wang 82 37 0 07 Oct 2022
C2KD: Cross-Lingual Cross-Modal Knowledge Distillation for Multilingual Text-Video Retrieval Andrew Rouditchenko Yung-Sung Chuang Nina Shvetsova Samuel Thomas Rogerio Feris Brian Kingsbury Leonid Karlinsky David Harwath Hilde Kuehne James R. Glass VLM 39 4 0 07 Oct 2022
An Overview of Affective Speech Synthesis and Conversion in the Deep Learning Era Andreas Triantafyllopoulos Björn W. Schuller Gokcce .Iymen M. Sezgin Xiangheng He ... Shuo Liu Silvan Mertes Elisabeth André Ruibo Fu Jianhua Tao 20 53 0 06 Oct 2022
Temporal Spatial Decomposition and Fusion Network for Time Series Forecasting Liwang Zhou Jing Gao AI4TS 14 1 0 06 Oct 2022
Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks Zhenhailong Wang Xiaoman Pan Dian Yu Dong Yu Jianshu Chen Heng Ji VLM 46 9 0 01 Oct 2022
Cooperation in the Latent Space: The Benefits of Adding Mixture Components in Variational Autoencoders Oskar Kviman Ricky Molén A. Hotti Semih Kurt Victor Elvira J. Lagergren 32 11 0 30 Sep 2022
Music Source Separation with Band-split RNN Yi Luo Jianwei Yu 60 107 0 30 Sep 2022
Multi-scale Attention Network for Single Image Super-Resolution Yan Wang Yusen Li Gang Wang Xiaoguang Liu SupR 39 39 0 28 Sep 2022
Searching a High-Performance Feature Extractor for Text Recognition Network Hui Zhang Quanming Yao James T. Kwok X. Bai 30 7 0 27 Sep 2022
Multi-encoder attention-based architectures for sound recognition with partial visual assistance Wim Boes Hugo Van hamme 14 1 0 26 Sep 2022
Rethinking Performance Gains in Image Dehazing Networks Yuda Song Yang Zhou Hui Qian Xin Du SSeg 36 48 0 23 Sep 2022
Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition Ye Bai Jie Li W. Han Hao Ni Kaituo Xu Zhuo Zhang Cheng Yi Xiaorui Wang MoE 31 1 0 17 Sep 2022
Lightweight Long-Range Generative Adversarial Networks Bowen Li Thomas Lukasiewicz GAN 42 3 0 08 Sep 2022
Spach Transformer: Spatial and Channel-wise Transformer Based on Local and Global Self-attentions for PET Image Denoising Se-In Jang T. Pan Ye Li P. Heidari Junyu Chen Quanzheng Li Kuang Gong ViT MedIm 36 27 0 07 Sep 2022
SemSegDepth: A Combined Model for Semantic Segmentation and Depth Completion J. Lagos Esa Rahtu VLM 3DV 35 5 0 01 Sep 2022
Transformers with Learnable Activation Functions Haishuo Fang Ji-Ung Lee N. Moosavi Iryna Gurevych AI4CE 25 7 0 30 Aug 2022
Decoding speech perception from non-invasive brain recordings Alexandre Défossez Charlotte Caucheteux Jérémy Rapin Ori Kabeli J. King 46 118 0 25 Aug 2022
Dance Style Transfer with Cross-modal Transformer Wenjie Yin Hang Yin Kim Baraka Danica Kragic Mårten Björkman 50 23 0 19 Aug 2022
Uncertainty Quantification for Traffic Forecasting: A Unified Approach Weizhu Qian Dalin Zhang Yan Zhao Kai Zheng James J. Q. Yu BDL AI4TS 32 22 0 11 Aug 2022
See What You See: Self-supervised Cross-modal Retrieval of Visual Stimuli from Brain Activity Zesheng Ye Lina Yao Yu Zhang Silvia Gustin 29 6 0 07 Aug 2022
Detecting Algorithmically Generated Domains Using a GCNN-LSTM Hybrid Neural Network Zhilin Wang 25 0 0 06 Aug 2022
Model Blending for Text Classification Ramit Pahwa 26 0 0 05 Aug 2022
Towards Understanding Mixture of Experts in Deep Learning Zixiang Chen Yihe Deng Yue-bo Wu Quanquan Gu Yuan-Fang Li MLT MoE 42 53 0 04 Aug 2022
Conv-NILM-Net, a causal and multi-appliance model for energy source separation Mohamed Alami Chehboune Jérémie Decock Rim Kaddah Jesse Read 27 1 0 03 Aug 2022
Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling? Yi Tay Mostafa Dehghani Samira Abnar Hyung Won Chung W. Fedus J. Rao Sharan Narang Vinh Q. Tran Dani Yogatama Donald Metzler AI4CE 34 100 0 21 Jul 2022