ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.08083
  4. Cited By
Language Modeling with Gated Convolutional Networks
v1v2v3 (latest)

Language Modeling with Gated Convolutional Networks

International Conference on Machine Learning (ICML), 2016
23 December 2016
Yann N. Dauphin
Angela Fan
Michael Auli
David Grangier
ArXiv (abs)PDFHTML

Papers citing "Language Modeling with Gated Convolutional Networks"

50 / 990 papers shown
DF-Mamba: Deformable State Space Modeling for 3D Hand Pose Estimation in Interactions
DF-Mamba: Deformable State Space Modeling for 3D Hand Pose Estimation in Interactions
Yifan Zhou
Takehiko Ohkawa
Guwenxiao Zhou
Kanoko Goto
Takumi Hirose
Yusuke Sekikawa
Nakamasa Inoue
3DHMamba
496
0
0
02 Dec 2025
Flexible Gravitational-Wave Parameter Estimation with Transformers
Flexible Gravitational-Wave Parameter Estimation with Transformers
Annalena Kofler
Maximilian Dax
Stephen R. Green
J. Wildberger
N. Gupte
Jakob H. Macke
J. Gair
A. Buonanno
Bernhard Scholkopf
106
1
0
02 Dec 2025
Rectifying LLM Thought from Lens of Optimization
Rectifying LLM Thought from Lens of Optimization
J. Liu
Hongwei Liu
Songyang Zhang
Kai Chen
LRM
177
2
0
01 Dec 2025
SpaceMind: Camera-Guided Modality Fusion for Spatial Reasoning in Vision-Language Models
SpaceMind: Camera-Guided Modality Fusion for Spatial Reasoning in Vision-Language Models
Ruosen Zhao
Zhikang Zhang
Jialei Xu
Jiahao Chang
Dong Chen
Lingyun Li
Weijian Sun
Zizhuang Wei
VLMLRM
284
4
0
28 Nov 2025
Ovis-Image Technical Report
Ovis-Image Technical Report
Guo-Hua Wang
Liangfu Cao
Tianyu Cui
Minghao Fu
Xiaohao Chen
...
Jianshan Zhao
Lan Li
Bowen Fu
Jiaqi Liu
Qing-Guo Chen
VLM
629
6
0
28 Nov 2025
Estimating the Event-Related Potential from Few EEG Trials
Estimating the Event-Related Potential from Few EEG Trials
Anders Vestergaard Nørskov
Kasper Jørgensen
Alexander Neergaard Zahid
Morten Mørup
127
0
0
28 Nov 2025
AdaCap: An Adaptive Contrastive Approach for Small-Data Neural Networks
AdaCap: An Adaptive Contrastive Approach for Small-Data Neural Networks
Bruno Belucci
Karim Lounici
Katia Méziani
139
0
0
25 Nov 2025
PAST: A Primary-Auxiliary Spatio-Temporal Network for Traffic Time Series Imputation
PAST: A Primary-Auxiliary Spatio-Temporal Network for Traffic Time Series Imputation
Hanwen Hu
Zimo Wen
Shiyou Qian
Jian Co
AI4TS
169
0
0
17 Nov 2025
EmbryoDiff: A Conditional Diffusion Framework with Multi-Focal Feature Fusion for Fine-Grained Embryo Developmental Stage Recognition
EmbryoDiff: A Conditional Diffusion Framework with Multi-Focal Feature Fusion for Fine-Grained Embryo Developmental Stage RecognitionIEEE International Joint Conference on Neural Network (IJCNN), 2025
Yong Sun
Zhengjie Zhang
Junyu Shi
Zhiyuan Zhang
Lijiang Liu
Qiang Nie
245
0
0
14 Nov 2025
On the Analogy between Human Brain and LLMs: Spotting Key Neurons in Grammar Perception
On the Analogy between Human Brain and LLMs: Spotting Key Neurons in Grammar Perception
Sanaz Saki Norouzi
Mohammad Masjedi
Pascal Hitzler
167
0
0
09 Nov 2025
Cambrian-S: Towards Spatial Supersensing in Video
Cambrian-S: Towards Spatial Supersensing in Video
Shusheng Yang
J. Yang
Pinzhi Huang
Ellis L Brown
Zihao Yang
...
Daohan Lu
Rob Fergus
Yann LeCun
Li Fei-Fei
Saining Xie
216
43
0
06 Nov 2025
Multimodal Reasoning via Latent Refocusing
Multimodal Reasoning via Latent Refocusing
Jizheng Ma
Xiaofei Zhou
Yanlong Song
Han Yan
Han Yan
VLMLRM
265
1
0
04 Nov 2025
Agentic World Modeling for 6G: Near-Real-Time Generative State-Space Reasoning
Agentic World Modeling for 6G: Near-Real-Time Generative State-Space Reasoning
Farhad Rezazadeh
Hatim Chergui
Mérouane Debbah
Houbing Song
Dusit Niyato
Lingjia Liu
204
2
0
04 Nov 2025
HieraMamba: Video Temporal Grounding via Hierarchical Anchor-Mamba Pooling
HieraMamba: Video Temporal Grounding via Hierarchical Anchor-Mamba Pooling
Joungbin An
Kristen Grauman
Mamba
295
0
0
27 Oct 2025
Encoder-Decoder Diffusion Language Models for Efficient Training and Inference
Encoder-Decoder Diffusion Language Models for Efficient Training and Inference
Marianne Arriola
Yair Schiff
Hao Phung
Aaron Gokaslan
Volodymyr Kuleshov
196
10
0
26 Oct 2025
Unified token representations for sequential decision models
Unified token representations for sequential decision models
Zhuojing Tian
Yushu Chen
OffRL
146
0
0
24 Oct 2025
A Unified Model for Multi-Task Drone Routing in Post-Disaster Road Assessment
A Unified Model for Multi-Task Drone Routing in Post-Disaster Road Assessment
Huatian Gong
Jiuh-Biing Sheu
Zheng Wang
Xiaoguang Yang
Ran Yan
AI4CE
264
0
0
24 Oct 2025
GPTFace: Generative Pre-training of Facial-Linguistic Transformer by Span Masking and Weakly Correlated Text-image Data
GPTFace: Generative Pre-training of Facial-Linguistic Transformer by Span Masking and Weakly Correlated Text-image Data
Yudong Li
Hao Li
Xianxu Hou
Linlin Shen
200
0
0
21 Oct 2025
Finding Manifolds With Bilinear Autoencoders
Finding Manifolds With Bilinear Autoencoders
Thomas Dooms
Ward Gauderis
161
2
0
19 Oct 2025
From Pixels to Words -- Towards Native Vision-Language Primitives at Scale
From Pixels to Words -- Towards Native Vision-Language Primitives at Scale
Haiwen Diao
Mingxuan Li
Silei Wu
Linjun Dai
Xiaohua Wang
Hanming Deng
Lewei Lu
Dahua Lin
Ziwei Liu
VLM
215
4
0
16 Oct 2025
CuMPerLay: Learning Cubical Multiparameter Persistence Vectorizations
CuMPerLay: Learning Cubical Multiparameter Persistence Vectorizations
Caner Korkmaz
Brighton Nuwagira
Barış Coşkunuzer
Tolga Birdal
187
4
0
14 Oct 2025
Simple Projection Variants Improve ColBERT Performance
Simple Projection Variants Improve ColBERT Performance
Benjamin Clavié
Sean Lee
Rikiya Takehi
Aamir Shakir
Makoto P. Kato
186
2
0
14 Oct 2025
BioOSS: A Bio-Inspired Oscillatory State System with Spatio-Temporal Dynamics
BioOSS: A Bio-Inspired Oscillatory State System with Spatio-Temporal Dynamics
Zhongju Yuan
Geraint Wiggins
Dick Botteldooren
143
0
0
12 Oct 2025
Recurrence-Complete Frame-based Action Models
Recurrence-Complete Frame-based Action Models
Michael Keiblinger
169
2
0
08 Oct 2025
Diversity Is All You Need for Contrastive Learning: Spectral Bounds on Gradient Magnitudes
Diversity Is All You Need for Contrastive Learning: Spectral Bounds on Gradient Magnitudes
Peter Ochieng
111
2
0
07 Oct 2025
Memory Determines Learning Direction: A Theory of Gradient-Based Optimization in State Space Models
Memory Determines Learning Direction: A Theory of Gradient-Based Optimization in State Space Models
JingChuan Guan
T. Kubota
Yasuo Kuniyoshi
Kohei Nakajima
111
0
0
01 Oct 2025
ASTROCO: Self-Supervised Conformer-Style Transformers for Light-Curve Embeddings
ASTROCO: Self-Supervised Conformer-Style Transformers for Light-Curve Embeddings
Antony Tan
P. Protopapas
Martina Cádiz-Leyton
Guillermo Cabrera-Vives
Cristobal Donoso-Oliva
I. Becker
121
0
0
29 Sep 2025
U-MAN: U-Net with Multi-scale Adaptive KAN Network for Medical Image Segmentation
U-MAN: U-Net with Multi-scale Adaptive KAN Network for Medical Image Segmentation
Bohan Huang
Qianyun Bao
Haoyuan Ma
SSeg
398
2
0
26 Sep 2025
Multilingual Vision-Language Models, A Survey
Multilingual Vision-Language Models, A Survey
Andrei-Alexandru Manea
Jindřich Libovický
VLM
206
1
0
26 Sep 2025
LEMs: A Primer On Large Execution Models
LEMs: A Primer On Large Execution Models
Remi Genet
Hugo Inzirillo
150
1
0
21 Sep 2025
Solar Forecasting with Causality: A Graph-Transformer Approach to Spatiotemporal Dependencies
Solar Forecasting with Causality: A Graph-Transformer Approach to Spatiotemporal Dependencies
Yanan Niu
D. Psaltis
C. Moser
Luisa Lambertini
135
0
0
18 Sep 2025
Curriculum Learning for Mesh-based simulations
Curriculum Learning for Mesh-based simulations
Paul Garnier
Vincent Lannelongue
E. Hachem
AI4CE
124
1
0
16 Sep 2025
Unveiling the Response of Large Vision-Language Models to Visually Absent Tokens
Unveiling the Response of Large Vision-Language Models to Visually Absent Tokens
Sohee Kim
Soohyun Ryu
Joonhyung Park
Eunho Yang
177
0
0
03 Sep 2025
Preserving Bilinear Weight Spectra with a Signed and Shrunk Quadratic Activation Function
Preserving Bilinear Weight Spectra with a Signed and Shrunk Quadratic Activation Function
Jason Abohwo
Thomas Mosen
87
0
0
02 Sep 2025
Provable Benefits of In-Tool Learning for Large Language Models
Provable Benefits of In-Tool Learning for Large Language Models
Sam Houliston
Ambroise Odonnat
Charles Arnal
Vivien A. Cabannes
RALM
174
2
0
28 Aug 2025
Automated discovery of finite volume schemes using Graph Neural Networks
Automated discovery of finite volume schemes using Graph Neural Networks
Paul Garnier
J. Viquerat
E. Hachem
160
0
0
26 Aug 2025
Vocoder-Projected Feature Discriminator
Vocoder-Projected Feature Discriminator
Takuhiro Kaneko
Hirokazu Kameoka
Kou Tanaka
Yuto Kondo
DiffM
250
1
0
25 Aug 2025
FasterVoiceGrad: Faster One-step Diffusion-Based Voice Conversion with Adversarial Diffusion Conversion Distillation
FasterVoiceGrad: Faster One-step Diffusion-Based Voice Conversion with Adversarial Diffusion Conversion Distillation
Takuhiro Kaneko
Hirokazu Kameoka
Kou Tanaka
Yuto Kondo
124
1
0
25 Aug 2025
Training Transformers for Mesh-Based Simulations
Training Transformers for Mesh-Based Simulations
Paul Garnier
Vincent Lannelongue
J. Viquerat
E. Hachem
AI4CE
139
2
0
25 Aug 2025
Borrowing From the Future: Enhancing Early Risk Assessment through Contrastive Learning
Borrowing From the Future: Enhancing Early Risk Assessment through Contrastive Learning
Minghui Sun
Matthew M. Engelhard
Benjamin A. Goldstein
161
0
0
15 Aug 2025
Facilitating Personalized TTS for Dysarthric Speakers Using Knowledge Anchoring and Curriculum Learning
Facilitating Personalized TTS for Dysarthric Speakers Using Knowledge Anchoring and Curriculum Learning
Yejin Jeon
Solee Im
Youngjae Kim
G. G. Lee
194
2
0
14 Aug 2025
G-IFT: A Gated Linear Unit adapter with Iterative Fine-Tuning for Low-Resource Children's Speaker Verification
G-IFT: A Gated Linear Unit adapter with Iterative Fine-Tuning for Low-Resource Children's Speaker VerificationWorkshop on Child, Computer and Interaction (CCI), 2025
Vishwas M. Shetty
Jiusi Zheng
Abeer Alwan
157
0
0
11 Aug 2025
Quantum Temporal Fusion Transformer
Quantum Temporal Fusion Transformer
Krishnakanta Barik
Goutam Paul
AI4TS
274
0
0
06 Aug 2025
Parameter-Efficient Routed Fine-Tuning: Mixture-of-Experts Demands Mixture of Adaptation Modules
Parameter-Efficient Routed Fine-Tuning: Mixture-of-Experts Demands Mixture of Adaptation Modules
Yilun Liu
Yunpu Ma
Yuetian Lu
Shuo Chen
Zifeng Ding
Volker Tresp
MoE
153
0
0
04 Aug 2025
RelMap: Reliable Spatiotemporal Sensor Data Visualization via Imputative Spatial Interpolation
RelMap: Reliable Spatiotemporal Sensor Data Visualization via Imputative Spatial Interpolation
Juntong Chen
Huayuan Ye
He Zhu
Siwei Fu
Changbo Wang
Chenhui Li
271
1
0
02 Aug 2025
LiteFat: Lightweight Spatio-Temporal Graph Learning for Real-Time Driver Fatigue Detection
LiteFat: Lightweight Spatio-Temporal Graph Learning for Real-Time Driver Fatigue Detection
Jing Ren
Suyu Ma
Hong Jia
Xiwei Xu
Ivan Lee
Haytham Fayek
Xiaodong Li
Feng Xia
300
2
0
29 Jul 2025
Innovator: Scientific Continued Pretraining with Fine-grained MoE Upcycling
Innovator: Scientific Continued Pretraining with Fine-grained MoE Upcycling
Ning Liao
Xiaoxing Wang
Peng Liu
Weiyang Guo
Feng Hong
...
Junchi Yan
Zhiyu Li
Feiyu Xiong
Yanfeng Wang
Linfeng Zhang
CLL
279
2
0
24 Jul 2025
Adaptive Neural Quantum States: A Recurrent Neural Network Perspective
Adaptive Neural Quantum States: A Recurrent Neural Network Perspective
Jake McNaughton
Mohamed Hibat-Allah
106
0
0
24 Jul 2025
The Early Bird Identifies the Worm: You Can't Beat a Head Start in Long-Term Body Re-ID (ECHO-BID)
The Early Bird Identifies the Worm: You Can't Beat a Head Start in Long-Term Body Re-ID (ECHO-BID)
Thomas M. Metz
Matthew Q. Hill
A. O’toole
328
1
0
23 Jul 2025
Supernova: Achieving More with Less in Transformer Architectures
Supernova: Achieving More with Less in Transformer Architectures
Andrei-Valentin Tanase
Elena Pelican
201
0
0
21 Jul 2025
1234...181920
Next
Page 1 of 20
Pageof 20