ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.08083
  4. Cited By
Language Modeling with Gated Convolutional Networks

Language Modeling with Gated Convolutional Networks

23 December 2016
Yann N. Dauphin
Angela Fan
Michael Auli
David Grangier
ArXivPDFHTML

Papers citing "Language Modeling with Gated Convolutional Networks"

50 / 915 papers shown
Title
Simplicity is Key: An Unsupervised Pretraining Approach for Sparse Radio Channels
Simplicity is Key: An Unsupervised Pretraining Approach for Sparse Radio Channels
Jonathan Ott
Maximilian Stahlke
Tobias Feigl
Bjoern M. Eskofier
Christopher Mutschler
2
0
0
19 May 2025
Qwen3 Technical Report
Qwen3 Technical Report
An Yang
A. Li
Baosong Yang
Beichen Zhang
Binyuan Hui
...
Zekun Wang
Zeyu Cui
Zhenru Zhang
Zhenhong Zhou
Zihan Qiu
LLMAG
OSLM
LRM
50
0
0
14 May 2025
Large Language Models for Computer-Aided Design: A Survey
Large Language Models for Computer-Aided Design: A Survey
Licheng Zhang
Bach Le
Naveed Akhtar
Siew-Kei Lam
Tuan Ngo
3DV
AI4CE
40
0
0
13 May 2025
MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining
MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining
Xiaomi LLM-Core Team
Bingquan Xia
B. S.
Cici
Dawei Zhu
...
Yishuo Wang
Yue Yu
Zhenru Lin
Zhichao Song
Zihao Yue
MoE
ReLM
LRM
AI4CE
48
1
0
12 May 2025
CoGenAV: Versatile Audio-Visual Representation Learning via Contrastive-Generative Synchronization
CoGenAV: Versatile Audio-Visual Representation Learning via Contrastive-Generative Synchronization
Detao Bai
Zhiheng Ma
Xihan Wei
Liefeng Bo
171
0
0
06 May 2025
Faster MoE LLM Inference for Extremely Large Models
Faster MoE LLM Inference for Extremely Large Models
Haoqi Yang
Luohe Shi
Qiwei Li
Zuchao Li
Ping Wang
Bo Du
Mengjia Shen
Hai Zhao
MoE
68
0
0
06 May 2025
Bielik v3 Small: Technical Report
Bielik v3 Small: Technical Report
Krzysztof Ociepa
Łukasz Flis
Remigiusz Kinas
Krzysztof Wróbel
Adrian Gwoździej
29
0
0
05 May 2025
Bielik 11B v2 Technical Report
Bielik 11B v2 Technical Report
Krzysztof Ociepa
Łukasz Flis
Krzysztof Wróbel
Adrian Gwoździej
Remigiusz Kinas
34
0
0
05 May 2025
Voice Cloning: Comprehensive Survey
Voice Cloning: Comprehensive Survey
Hussam Azzuni
Abdulmotaleb El Saddik
VLM
44
0
0
01 May 2025
TriniMark: A Robust Generative Speech Watermarking Method for Trinity-Level Attribution
TriniMark: A Robust Generative Speech Watermarking Method for Trinity-Level Attribution
Yue Li
Wei Liu
Dongdong Lin
44
0
0
29 Apr 2025
Revisiting Reset Mechanisms in Spiking Neural Networks for Sequential Modeling: Specialized Discretization for Binary Activated RNN
Revisiting Reset Mechanisms in Spiking Neural Networks for Sequential Modeling: Specialized Discretization for Binary Activated RNN
Enqi Zhang
MQ
208
0
0
24 Apr 2025
Compass-V2 Technical Report
Compass-V2 Technical Report
Sophia Maria
MoE
LRM
41
0
0
22 Apr 2025
Multi-Scale Tensorial Summation and Dimensional Reduction Guided Neural Network for Edge Detection
Multi-Scale Tensorial Summation and Dimensional Reduction Guided Neural Network for Edge Detection
Lei Xu
Mehmet Yamaç
Mete Ahishali
Moncef Gabbouj
32
0
0
22 Apr 2025
An Efficient Aerial Image Detection with Variable Receptive Fields
An Efficient Aerial Image Detection with Variable Receptive Fields
Liu Wenbin
37
0
0
21 Apr 2025
Protecting Your Voice: Temporal-aware Robust Watermarking
Protecting Your Voice: Temporal-aware Robust Watermarking
Yue Li
Weizhi Liu
Dongdong Lin
37
0
0
21 Apr 2025
Multiscale Tensor Summation Factorization as a New Neural Network Layer (MTS Layer) for Multidimensional Data Processing
Multiscale Tensor Summation Factorization as a New Neural Network Layer (MTS Layer) for Multidimensional Data Processing
Mehmet Yamaç
Muhammad Numan Yousaf
S. Kiranyaz
Moncef Gabbouj
28
1
0
17 Apr 2025
Hadamard product in deep learning: Introduction, Advances and Challenges
Hadamard product in deep learning: Introduction, Advances and Challenges
Grigorios G. Chrysos
Yongtao Wu
Razvan Pascanu
Philip Torr
V. Cevher
AAML
98
1
0
17 Apr 2025
AMNet: An Acoustic Model Network for Enhanced Mandarin Speech Synthesis
AMNet: An Acoustic Model Network for Enhanced Mandarin Speech Synthesis
Yubing Cao
Yinfeng Yu
Yongming Li
Liejun Wang
29
0
0
12 Apr 2025
Beyond Degradation Conditions: All-in-One Image Restoration via HOG Transformers
Beyond Degradation Conditions: All-in-One Image Restoration via HOG Transformers
Jiawei Wu
Zhifei Yang
Zihan Wang
Zhi Jin
29
0
0
12 Apr 2025
Bidirectional Linear Recurrent Models for Sequence-Level Multisource Fusion
Bidirectional Linear Recurrent Models for Sequence-Level Multisource Fusion
Qisai Liu
Zhanhong Jiang
Joshua R. Waite
Chao Liu
Aditya Balu
S. Sarkar
AI4TS
29
0
0
11 Apr 2025
STEI-PCN: an efficient pure convolutional network for traffic prediction via spatial-temporal encoding and inferring
STEI-PCN: an efficient pure convolutional network for traffic prediction via spatial-temporal encoding and inferring
Kai Hu
Zhidan Zhao
Zhifeng Hao
GNN
48
0
0
10 Apr 2025
GmNet: Revisiting Gating Mechanisms From A Frequency View
GmNet: Revisiting Gating Mechanisms From A Frequency View
Yifan Wang
Xu Ma
Yitian Zhang
Zhongruo Wang
Sung-Cheol Kim
Vahid Mirjalili
Vidya Renganathan
Y. Fu
47
0
0
28 Mar 2025
IgCraft: A versatile sequence generation framework for antibody discovery and engineering
IgCraft: A versatile sequence generation framework for antibody discovery and engineering
Matthew Greenig
Haowen Zhao
Vladimir Radenkovic
Aubin Ramon
Pietro Sormanni
49
0
0
25 Mar 2025
DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid Framework
DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid Framework
Henrique Morimitsu
Xiaobin Zhu
Roberto M. Cesar Jr.
Xiangyang Ji
Xu-Cheng Yin
MDE
55
0
0
19 Mar 2025
MMAIF: Multi-task and Multi-degradation All-in-One for Image Fusion with Language Guidance
MMAIF: Multi-task and Multi-degradation All-in-One for Image Fusion with Language Guidance
Zihan Cao
Yu Zhong
Zihan Wang
Liang-Jian Deng
62
0
0
19 Mar 2025
DARS: Dynamic Action Re-Sampling to Enhance Coding Agent Performance by Adaptive Tree Traversal
DARS: Dynamic Action Re-Sampling to Enhance Coding Agent Performance by Adaptive Tree Traversal
Vaibhav Aggarwal
Ojasv Kamal
Abhinav Japesh
Zhijing Jin
Bernhard Schölkopf
52
1
0
18 Mar 2025
State Space Model Meets Transformer: A New Paradigm for 3D Object Detection
State Space Model Meets Transformer: A New Paradigm for 3D Object Detection
Chuxin Wang
Wenfei Yang
Xiang Liu
Tianzhu Zhang
59
0
0
18 Mar 2025
Accurate INT8 Training Through Dynamic Block-Level Fallback
Pengle Zhang
Jia wei
Jintao Zhang
Jun-Jie Zhu
Jianfei Chen
MQ
82
4
0
13 Mar 2025
VWAP Execution with Signature-Enhanced Transformers: A Multi-Asset Learning Approach
Remi Genet
70
0
0
04 Mar 2025
SCSegamba: Lightweight Structure-Aware Vision Mamba for Crack Segmentation in Structures
SCSegamba: Lightweight Structure-Aware Vision Mamba for Crack Segmentation in Structures
Hui Liu
Chen Jia
Fan Shi
Xu Cheng
Shengyong Chen
Mamba
47
0
0
03 Mar 2025
Minds on the Move: Decoding Trajectory Prediction in Autonomous Driving with Cognitive Insights
Minds on the Move: Decoding Trajectory Prediction in Autonomous Driving with Cognitive Insights
Haicheng Liao
Chengyue Wang
Kaiqun Zhu
Yilong Ren
Bolin Gao
Shengbo Eben Li
Chengzhong Xu
Zehan Li
72
2
0
27 Feb 2025
Similarity-Distance-Magnitude Universal Verification
Similarity-Distance-Magnitude Universal Verification
Allen Schmaltz
UQCV
AAML
200
0
0
27 Feb 2025
Jacobian Sparse Autoencoders: Sparsify Computations, Not Just Activations
Jacobian Sparse Autoencoders: Sparsify Computations, Not Just Activations
Lucy Farnik
Tim Lawson
Conor Houghton
Laurence Aitchison
61
0
0
25 Feb 2025
Encryption-Friendly LLM Architecture
Encryption-Friendly LLM Architecture
Donghwan Rho
Taeseong Kim
Minje Park
Jung Woo Kim
Hyunsik Chae
Jung Hee Cheon
Ernest K. Ryu
57
2
0
24 Feb 2025
ChordFormer: A Conformer-Based Architecture for Large-Vocabulary Audio Chord Recognition
ChordFormer: A Conformer-Based Architecture for Large-Vocabulary Audio Chord Recognition
Muhammad Waseem Akram
Stefano Dettori
V. Colla
Giorgio Buttazzo
57
0
0
17 Feb 2025
Is Long Range Sequential Modeling Necessary For Colorectal Tumor Segmentation?
Is Long Range Sequential Modeling Necessary For Colorectal Tumor Segmentation?
Abhishek Srivastava
Koushik Biswas
Gorkem Durak
Gulsah Ozden
Mustafa Adli
Ulas Bagci
Mamba
3DV
39
0
0
10 Feb 2025
Accelerating Linear Recurrent Neural Networks for the Edge with Unstructured Sparsity
Accelerating Linear Recurrent Neural Networks for the Edge with Unstructured Sparsity
Alessandro Pierro
Steven Abreu
Jonathan Timcheck
Philipp Stratmann
Andreas Wild
S. Shrestha
75
0
0
03 Feb 2025
Qwen2.5-1M Technical Report
An Yang
Bowen Yu
Chong Li
Dayiheng Liu
Fei Huang
...
Xingzhang Ren
Xinlong Yang
You Li
Zhiying Xu
Zizhuo Zhang
71
12
0
28 Jan 2025
A Study of the Plausibility of Attention between RNN Encoders in Natural Language Inference
A Study of the Plausibility of Attention between RNN Encoders in Natural Language Inference
Duc Hau Nguyen
Duc Hau Nguyen
Pascale Sébillot
57
5
0
23 Jan 2025
Modeling Time-Variant Responses of Optical Compressors with Selective State Space Models
Modeling Time-Variant Responses of Optical Compressors with Selective State Space Models
Riccardo Simionato
Stefano Fasciani
78
1
0
17 Jan 2025
CURing Large Models: Compression via CUR Decomposition
CURing Large Models: Compression via CUR Decomposition
Sanghyeon Park
Soo-Mook Moon
41
0
0
08 Jan 2025
STContext: A Multifaceted Dataset for Developing Context-aware Spatio-temporal Crowd Mobility Prediction Models
STContext: A Multifaceted Dataset for Developing Context-aware Spatio-temporal Crowd Mobility Prediction Models
Liyue Chen
Jiangyi Fang
Tengfei Liu
Fangyuan Gao
Leye Wang
AI4TS
36
0
0
08 Jan 2025
CrossSpeech++: Cross-lingual Speech Synthesis with Decoupled Language and Speaker Generation
CrossSpeech++: Cross-lingual Speech Synthesis with Decoupled Language and Speaker Generation
Ji-Hoon Kim
Hong-Sun Yang
Yoon-Cheol Ju
Il-Hwan Kim
Byeong-Yeol Kim
Joon Son Chung
BDL
54
0
0
31 Dec 2024
BrainMAP: Learning Multiple Activation Pathways in Brain Networks
BrainMAP: Learning Multiple Activation Pathways in Brain Networks
Song Wang
Zhenyu Lei
Zhen Tan
Jiaqi Ding
Xinyu Zhao
...
Guorong Wu
Tianlong Chen
Chen Chen
Aiying Zhang
Jundong Li
61
0
0
23 Dec 2024
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for
  Fast, Memory Efficient, and Long Context Finetuning and Inference
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
Benjamin Warner
Antoine Chaffin
Benjamin Clavié
Orion Weller
Oskar Hallström
...
Tom Aarsen
Nathan Cooper
Griffin Adams
Jeremy Howard
Iacopo Poli
93
82
0
18 Dec 2024
Code LLMs: A Taxonomy-based Survey
Code LLMs: A Taxonomy-based Survey
Nishat Raihan
Christian D. Newman
Marcos Zampieri
99
1
0
11 Dec 2024
Efficient LLM Inference using Dynamic Input Pruning and Cache-Aware Masking
Efficient LLM Inference using Dynamic Input Pruning and Cache-Aware Masking
Marco Federici
Davide Belli
M. V. Baalen
Amir Jalalirad
Andrii Skliar
Bence Major
Markus Nagel
Paul N. Whatmough
76
0
0
02 Dec 2024
End-to-End Steering for Autonomous Vehicles via Conditional Imitation
  Co-Learning
End-to-End Steering for Autonomous Vehicles via Conditional Imitation Co-Learning
Mahmoud M. Kishky
Hesham M. Eraqi
Khaled F. Elsayed
66
1
0
25 Nov 2024
MambaTrack: Exploiting Dual-Enhancement for Night UAV Tracking
MambaTrack: Exploiting Dual-Enhancement for Night UAV Tracking
Chunhui Zhang
Li Liu
Hao-Kai Wen
Xi Zhou
Yufei Wang
Mamba
108
2
0
24 Nov 2024
Selective Attention: Enhancing Transformer through Principled Context
  Control
Selective Attention: Enhancing Transformer through Principled Context Control
Xuechen Zhang
Xiangyu Chang
Mingchen Li
A. Roy-Chowdhury
Jiacheng Chen
Samet Oymak
78
3
0
19 Nov 2024
1234...171819
Next