ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.03059
  4. Cited By
Depthwise Separable Convolutions for Neural Machine Translation

Depthwise Separable Convolutions for Neural Machine Translation

9 June 2017
Lukasz Kaiser
Aidan N. Gomez
François Chollet
ArXivPDFHTML

Papers citing "Depthwise Separable Convolutions for Neural Machine Translation"

27 / 27 papers shown
Title
Pluggable Style Representation Learning for Multi-Style Transfer
Pluggable Style Representation Learning for Multi-Style Transfer
Hongda Liu
Longguang Wang
Weijun Guan
Ye Zhang
Yulan Guo
69
1
0
26 Mar 2025
Geometry-Constrained EEG Channel Selection for Brain-Assisted Speech
  Enhancement
Geometry-Constrained EEG Channel Selection for Brain-Assisted Speech Enhancement
Keying Zuo
Qingtian Xu
Jie Zhang
Zhenhua Ling
36
0
0
19 Sep 2024
COVID-Net Assistant: A Deep Learning-Driven Virtual Assistant for
  COVID-19 Symptom Prediction and Recommendation
COVID-Net Assistant: A Deep Learning-Driven Virtual Assistant for COVID-19 Symptom Prediction and Recommendation
Peng Shi
Yuetong Wang
S. Abbasi
Alexander Wong
21
0
0
22 Nov 2022
Learning on tree architectures outperforms a convolutional feedforward
  network
Learning on tree architectures outperforms a convolutional feedforward network
Yuval Meir
Itamar Ben-Noam
Yarden Tzach
Shiri Hodassman
Ido Kanter
AI4CE
11
6
0
21 Nov 2022
DeepLSS: breaking parameter degeneracies in large scale structure with
  deep learning analysis of combined probes
DeepLSS: breaking parameter degeneracies in large scale structure with deep learning analysis of combined probes
T. Kacprzak
J. Fluri
11
12
0
17 Mar 2022
SmartSplit: Latency-Energy-Memory Optimisation for CNN Splitting on
  Smartphone Environment
SmartSplit: Latency-Energy-Memory Optimisation for CNN Splitting on Smartphone Environment
I. Prakash
Aniruddh Bansal
Rohit Verma
R. Shorey
15
8
0
01 Nov 2021
Rethinking Token-Mixing MLP for MLP-based Vision Backbone
Rethinking Token-Mixing MLP for MLP-based Vision Backbone
Tan Yu
Xu Li
Yunfeng Cai
Mingming Sun
Ping Li
45
26
0
28 Jun 2021
Scalable Transformers for Neural Machine Translation
Scalable Transformers for Neural Machine Translation
Peng Gao
Shijie Geng
Yu Qiao
Xiaogang Wang
Jifeng Dai
Hongsheng Li
31
13
0
04 Jun 2021
Security Vulnerability Detection Using Deep Learning Natural Language
  Processing
Security Vulnerability Detection Using Deep Learning Natural Language Processing
Noah Ziems
Shaoen Wu
17
55
0
06 May 2021
BeamLearning: an end-to-end Deep Learning approach for the angular
  localization of sound sources using raw multichannel acoustic pressure data
BeamLearning: an end-to-end Deep Learning approach for the angular localization of sound sources using raw multichannel acoustic pressure data
Hadrien Pujol
Éric Bavu
Alexandre Garcia
44
22
0
27 Apr 2021
Neural Machine Translation: A Review of Methods, Resources, and Tools
Neural Machine Translation: A Review of Methods, Resources, and Tools
Zhixing Tan
Shuo Wang
Zonghan Yang
Gang Chen
Xuancheng Huang
Maosong Sun
Yang Liu
3DV
AI4TS
15
105
0
31 Dec 2020
Speech Command Recognition in Computationally Constrained Environments
  with a Quadratic Self-organized Operational Layer
Speech Command Recognition in Computationally Constrained Environments with a Quadratic Self-organized Operational Layer
M. Soltanian
Junaid Malik
Jenni Raitoharju
Alexandros Iosifidis
S. Kiranyaz
Denmark
16
11
0
23 Nov 2020
Efficient Transformers: A Survey
Efficient Transformers: A Survey
Yi Tay
Mostafa Dehghani
Dara Bahri
Donald Metzler
VLM
82
1,101
0
14 Sep 2020
ConvBERT: Improving BERT with Span-based Dynamic Convolution
ConvBERT: Improving BERT with Span-based Dynamic Convolution
Zihang Jiang
Weihao Yu
Daquan Zhou
Yunpeng Chen
Jiashi Feng
Shuicheng Yan
32
156
0
06 Aug 2020
Improving Robustness using Joint Attention Network For Detecting Retinal
  Degeneration From Optical Coherence Tomography Images
Improving Robustness using Joint Attention Network For Detecting Retinal Degeneration From Optical Coherence Tomography Images
Sharif Amit Kamran
Alireza Tavakkoli
S. Zuckerbrod
13
25
0
16 May 2020
A Survey of Deep Learning Techniques for Neural Machine Translation
A Survey of Deep Learning Techniques for Neural Machine Translation
Shu Yang
Yuxin Wang
X. Chu
VLM
AI4TS
AI4CE
19
138
0
18 Feb 2020
Spatio-Temporal FAST 3D Convolutions for Human Action Recognition
Spatio-Temporal FAST 3D Convolutions for Human Action Recognition
Alexandros Stergiou
R. Poppe
3DH
17
19
0
30 Sep 2019
On NMT Search Errors and Model Errors: Cat Got Your Tongue?
On NMT Search Errors and Model Errors: Cat Got Your Tongue?
Felix Stahlberg
Bill Byrne
LRM
11
152
0
27 Aug 2019
Deep Learning Based Chatbot Models
Deep Learning Based Chatbot Models
Richard Csaky
29
46
0
23 Aug 2019
TVQA+: Spatio-Temporal Grounding for Video Question Answering
TVQA+: Spatio-Temporal Grounding for Video Question Answering
Jie Lei
Licheng Yu
Tamara L. Berg
Mohit Bansal
31
227
0
25 Apr 2019
C3: Concentrated-Comprehensive Convolution and its application to
  semantic segmentation
C3: Concentrated-Comprehensive Convolution and its application to semantic segmentation
Hyojin Park
Y. Yoo
Geonseok Seo
Dongyoon Han
Sangdoo Yun
Nojun Kwak
SSeg
24
9
0
12 Dec 2018
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for
  Speech Separation
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
Yi Luo
N. Mesgarani
16
1,746
0
20 Sep 2018
Universal Transformers
Universal Transformers
Mostafa Dehghani
Stephan Gouws
Oriol Vinyals
Jakob Uszkoreit
Lukasz Kaiser
28
742
0
10 Jul 2018
Tensor2Tensor for Neural Machine Translation
Tensor2Tensor for Neural Machine Translation
Ashish Vaswani
Samy Bengio
E. Brevdo
François Chollet
Aidan N. Gomez
...
Nal Kalchbrenner
Niki Parmar
Ryan Sepassi
Noam M. Shazeer
Jakob Uszkoreit
34
526
0
16 Mar 2018
Revisiting the Effectiveness of Off-the-shelf Temporal Modeling
  Approaches for Large-scale Video Classification
Revisiting the Effectiveness of Off-the-shelf Temporal Modeling Approaches for Large-scale Video Classification
Yunlong Bian
Chuang Gan
Xiao-Chang Liu
Fu Li
Xiang Long
Yandong Li
Heng Qi
Jie Zhou
Shilei Wen
Yuanqing Lin
18
48
0
12 Aug 2017
One Model To Learn Them All
One Model To Learn Them All
Lukasz Kaiser
Aidan N. Gomez
Noam M. Shazeer
Ashish Vaswani
Niki Parmar
Llion Jones
Jakob Uszkoreit
VLM
ViT
17
333
0
16 Jun 2017
Google's Neural Machine Translation System: Bridging the Gap between
  Human and Machine Translation
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,743
0
26 Sep 2016
1