Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1612.08083
Cited By
Language Modeling with Gated Convolutional Networks
23 December 2016
Yann N. Dauphin
Angela Fan
Michael Auli
David Grangier
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Language Modeling with Gated Convolutional Networks"
50 / 915 papers shown
Title
A Survey on Semantic Processing Techniques
Rui Mao
Kai He
Xulang Zhang
Guanyi Chen
Jinjie Ni
Zonglin Yang
Min Zhang
23
34
0
22 Oct 2023
Conversational Speech Recognition by Learning Audio-textual Cross-modal Contextual Representation
Kun Wei
Bei Li
Hang Lv
Quan Lu
Ning Jiang
Lei Xie
44
3
0
22 Oct 2023
Dual Grained Quantization: Efficient Fine-Grained Quantization for LLM
Luoming Zhang
Wen Fei
Weijia Wu
Yefei He
Zhenyu Lou
Hong Zhou
MQ
25
5
0
07 Oct 2023
ReLU Strikes Back: Exploiting Activation Sparsity in Large Language Models
Iman Mirzadeh
Keivan Alizadeh-Vahid
Sachin Mehta
C. C. D. Mundo
Oncel Tuzel
Golnoosh Samei
Mohammad Rastegari
Mehrdad Farajtabar
126
61
0
06 Oct 2023
PolySketchFormer: Fast Transformers via Sketching Polynomial Kernels
Praneeth Kacham
Vahab Mirrokni
Peilin Zhong
44
8
0
02 Oct 2023
Generalized Activation via Multivariate Projection
Jiayun Li
Yuxiao Cheng
Yiwen Lu
Zhuofan Xia
Yilin Mo
Gao Huang
LLMSV
25
0
0
29 Sep 2023
Qwen Technical Report
Jinze Bai
Shuai Bai
Yunfei Chu
Zeyu Cui
Kai Dang
...
Zhenru Zhang
Chang Zhou
Jingren Zhou
Xiaohuan Zhou
Tianhang Zhu
OSLM
73
1,622
0
28 Sep 2023
Parallelizing non-linear sequential models over the sequence length
Yi Heng Lim
Qi Zhu
Joshua Selfridge
M. F. Kasim
30
13
0
21 Sep 2023
Directional Source Separation for Robust Speech Recognition on Smart Glasses
Tiantian Feng
Ju Lin
Yiteng Huang
Weipeng He
Kaustubh Kalgaonkar
Niko Moritz
Liting Wan
Xin Lei
Ming Sun
Frank Seide
18
4
0
20 Sep 2023
SlimPajama-DC: Understanding Data Combinations for LLM Training
Zhiqiang Shen
Tianhua Tao
Liqun Ma
Willie Neiswanger
Zhengzhong Liu
...
Bowen Tan
Joel Hestness
Natalia Vassilieva
Daria Soboleva
Eric Xing
30
45
0
19 Sep 2023
Baichuan 2: Open Large-scale Language Models
Ai Ming Yang
Bin Xiao
Bingning Wang
Borong Zhang
Ce Bian
...
Youxin Jiang
Yuchen Gao
Yupeng Zhang
Zenan Zhou
Zhiying Wu
ELM
LRM
77
710
0
19 Sep 2023
Boosting End-to-End Multilingual Phoneme Recognition through Exploiting Universal Speech Attributes Constraints
Hao Yen
Sabato Marco Siniscalchi
Chin-Hui Lee
42
1
0
16 Sep 2023
Auto-Regressive Next-Token Predictors are Universal Learners
Eran Malach
LRM
24
36
0
13 Sep 2023
Evaluating ChatGPT as a Recommender System: A Rigorous Approach
Dario Di Palma
Giovanni Maria Biancofiore
Vito Walter Anelli
Fedelucio Narducci
Tommaso Di Noia
E. Sciascio
ALM
48
27
0
07 Sep 2023
Music Source Separation with Band-Split RoPE Transformer
Wei-Tsung Lu
Ju-Chiang Wang
Qiuqiang Kong
Yun-Ning Hung
27
20
0
05 Sep 2023
Gated recurrent neural networks discover attention
Nicolas Zucchet
Seijin Kobayashi
Yassir Akram
J. Oswald
Maxime Larcher
Angelika Steger
João Sacramento
36
8
0
04 Sep 2023
An Ensemble Approach to Personalized Real Time Predictive Writing for Experts
Sourav Prosad
Viswa Datha Polavarapu
Shrutendra Harsola
19
0
0
25 Aug 2023
NimbRo wins ANA Avatar XPRIZE Immersive Telepresence Competition: Human-Centric Evaluation and Lessons Learned
Christian Lenz
Max Schwarz
Andre Rochow
Bastian Patzold
Raphael Memmesheimer
M. Schreiber
Sven Behnke
22
13
0
23 Aug 2023
Stabilizing RNN Gradients through Pre-training
Luca Herranz-Celotti
Jean Rouat
32
0
0
23 Aug 2023
Deformable Mixer Transformer with Gating for Multi-Task Learning of Dense Prediction
Yangyang Xu
Yibo Yang
Bernard Ghanemm
Lefei Zhang
Du Bo
Dacheng Tao
21
1
0
10 Aug 2023
Multi-graph Spatio-temporal Graph Convolutional Network for Traffic Flow Prediction
Weilong Ding
Tianpu Zhang
Jianwu Wang
Zhuofeng Zhao
AI4TS
26
0
0
10 Aug 2023
A Novel Convolutional Neural Network Architecture with a Continuous Symmetry
Y. Liu
Han-Juan Shao
Bing Bai
AI4CE
26
2
0
03 Aug 2023
A Transformer-based Prediction Method for Depth of Anesthesia During Target-controlled Infusion of Propofol and Remifentanil
Yongkang He
Siyuan Peng
Mingjin Chen
Zhi-Yi Yang
Yuan-Wei Chen
28
4
0
02 Aug 2023
Improving Social Media Popularity Prediction with Multiple Post Dependencies
Zhizhen Zhang
Xiao-Zhu Xie
Meng Yang
Ye Tian
Yong-jia Jiang
Yong Cui
29
5
0
28 Jul 2023
On the unreasonable vulnerability of transformers for image restoration -- and an easy fix
Shashank Agnihotri
Kanchana Vaishnavi Gandikota
Julia Grabinski
Paramanand Chandramouli
M. Keuper
34
9
0
25 Jul 2023
Hybrid-CSR: Coupling Explicit and Implicit Shape Representation for Cortical Surface Reconstruction
Shanlin Sun
Thanh-Tung Le
Chenyu You
Hao Tang
Kun Han
Haoyu Ma
Deying Kong
Xiangyi Yan
Xiaohui Xie
3DV
33
2
0
23 Jul 2023
A Comprehensive Overview of Large Language Models
Humza Naveed
Asad Ullah Khan
Shi Qiu
Muhammad Saqib
Saeed Anwar
Muhammad Usman
Naveed Akhtar
Nick Barnes
Ajmal Mian
OffRL
70
538
0
12 Jul 2023
Deep learning for dynamic graphs: models and benchmarks
Alessio Gravina
D. Bacciu
GNN
AI4CE
45
11
0
12 Jul 2023
Encoder-Decoder Networks for Self-Supervised Pretraining and Downstream Signal Bandwidth Regression on Digital Antenna Arrays
R. Bhattacharjea
Nathan E. West
SSL
23
1
0
06 Jul 2023
Structured State Space Models for Multiple Instance Learning in Digital Pathology
Leo Fillioux
Joseph Boyd
Maria Vakalopoulou
P. Cournède
Stergios Christodoulidis
13
21
0
27 Jun 2023
DNABERT-2: Efficient Foundation Model and Benchmark For Multi-Species Genome
Zhihan Zhou
Yanrong Ji
Weijian Li
Pratik Dutta
R. Davuluri
Han Liu
24
174
0
26 Jun 2023
Intensity-free Convolutional Temporal Point Process: Incorporating Local and Global Event Contexts
Wangtao Zhou
Zhao Kang
Ling Tian
Yimu Su
38
11
0
24 Jun 2023
A Survey on Graph Neural Network Acceleration: Algorithms, Systems, and Customized Hardware
Shichang Zhang
Atefeh Sohrabizadeh
Cheng Wan
Zijie Huang
Ziniu Hu
Yewen Wang
Yingyan Lin
Lin
Jason Cong
Yizhou Sun
GNN
AI4CE
42
23
0
24 Jun 2023
Towards Effective and Compact Contextual Representation for Conformer Transducer Speech Recognition Systems
Mingyu Cui
Jiawen Kang
Jiajun Deng
Xiaoyue Yin
Yutao Xie
Xie Chen
Xunying Liu
35
8
0
23 Jun 2023
Towards Stability of Autoregressive Neural Operators
Michael McCabe
P. Harrington
Shashank Subramanian
Jed Brown
AI4CE
44
17
0
18 Jun 2023
MCPI: Integrating Multimodal Data for Enhanced Prediction of Compound Protein Interactions
Li Zhang
Wenhao Li
Hao Guan
Zhiquan He
Min-Cai Cheng
Han Wang
24
0
0
15 Jun 2023
Research on an improved Conformer end-to-end Speech Recognition Model with R-Drop Structure
Weidong Ji
Shijie Zan
Guohui Zhou
Xu Wang
SyDa
27
1
0
14 Jun 2023
Top-Down Framework for Weakly-supervised Grounded Image Captioning
Chen Cai
Suchen Wang
Kim-Hui Yap
Yi Wang
ObjD
23
3
0
13 Jun 2023
Improving Knee Joint Angle Prediction through Dynamic Contextual Focus and Gated Linear Units
L. Saad Saoud
Humaid Ibrahim
Ahmad Aljarah
Irfan Hussain
AI4CE
29
1
0
12 Jun 2023
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
Dominik Wagner
Ilja Baumann
Tobias Bocklet
53
1
0
10 Jun 2023
A Dynamic Feature Interaction Framework for Multi-task Visual Perception
Yuling Xi
Hao Chen
Ning Wang
Peng Wang
Yanning Zhang
Chunhua Shen
Yifan Liu
41
4
0
08 Jun 2023
HD-DEMUCS: General Speech Restoration with Heterogeneous Decoders
Doyeon Kim
Soo-Whan Chung
Hyewon Han
Youna Ji
Hong-Goo Kang
32
7
0
02 Jun 2023
Graph Neural Network for spatiotemporal data: methods and applications
Yun Li
Dazhou Yu
Zhenke Liu
Minxing Zhang
Xi Gong
Liang Zhao
AI4TS
AI4CE
37
10
0
30 May 2023
Bringing regularized optimal transport to lightspeed: a splitting method adapted for GPUs
Jacob Lindbäck
Zesen Wang
Mikael Johansson
OT
45
1
0
29 May 2023
A Quantitative Review on Language Model Efficiency Research
Meng Jiang
Hy Dang
Lingbo Tong
33
0
0
28 May 2023
Flow Matching for Scalable Simulation-Based Inference
Maximilian Dax
J. Wildberger
Simon Buchholz
Stephen R. Green
Jakob H. Macke
Bernhard Schölkopf
34
49
0
26 May 2023
Neural Machine Translation for Mathematical Formulae
Felix Petersen
M. Schubotz
André Greiner-Petter
Bela Gipp
26
7
0
25 May 2023
Online learning of long-range dependencies
Nicolas Zucchet
Robert Meier
Simon Schug
Asier Mujika
João Sacramento
CLL
52
18
0
25 May 2023
Attention to Mean-Fields for Particle Cloud Generation
Benno Kach
I. Melzer-Pellmann
DiffM
24
15
0
24 May 2023
GNCformer Enhanced Self-attention for Automatic Speech Recognition
Jianxin Li
Z. Duan
S. Li
X. Yu
G. Yang
20
1
0
22 May 2023
Previous
1
2
3
4
5
6
...
17
18
19
Next