ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1701.06548
  4. Cited By
Regularizing Neural Networks by Penalizing Confident Output
  Distributions

Regularizing Neural Networks by Penalizing Confident Output Distributions

23 January 2017
Gabriel Pereyra
George Tucker
J. Chorowski
Lukasz Kaiser
Geoffrey E. Hinton
    NoLa
ArXivPDFHTML

Papers citing "Regularizing Neural Networks by Penalizing Confident Output Distributions"

23 / 173 papers shown
Title
Sequence to Sequence Mixture Model for Diverse Machine Translation
Sequence to Sequence Mixture Model for Diverse Machine Translation
Xuanli He
Gholamreza Haffari
Mohammad Norouzi
12
57
0
17 Oct 2018
Semi-Supervised Sequence Modeling with Cross-View Training
Semi-Supervised Sequence Modeling with Cross-View Training
Kevin Clark
Minh-Thang Luong
Christopher D. Manning
Quoc V. Le
SSL
6
333
0
22 Sep 2018
Maximum-Entropy Fine-Grained Classification
Maximum-Entropy Fine-Grained Classification
Abhimanyu Dubey
O. Gupta
Ramesh Raskar
Nikhil Naik
6
156
0
16 Sep 2018
Distilled Wasserstein Learning for Word Embedding and Topic Modeling
Distilled Wasserstein Learning for Word Embedding and Topic Modeling
Hongteng Xu
Wenlin Wang
W. Liu
Lawrence Carin
MedIm
FedML
27
84
0
12 Sep 2018
Weakly-Supervised Convolutional Neural Networks for Multimodal Image
  Registration
Weakly-Supervised Convolutional Neural Networks for Multimodal Image Registration
Yipeng Hu
Marc Modat
Eli Gibson
Wenqi Li
N. Ghavami
...
M. Emberton
Sébastien Ourselin
J. A. Noble
D. Barratt
Tom Kamiel Magda Vercauteren
20
381
0
09 Jul 2018
Extending Recurrent Neural Aligner for Streaming End-to-End Speech
  Recognition in Mandarin
Extending Recurrent Neural Aligner for Streaming End-to-End Speech Recognition in Mandarin
Linhao Dong
Shiyu Zhou
Wei Chen
Bo Xu
19
22
0
17 Jun 2018
Spreading vectors for similarity search
Spreading vectors for similarity search
Alexandre Sablayrolles
Matthijs Douze
Cordelia Schmid
Hervé Jégou
MQ
19
114
0
08 Jun 2018
Scaling Neural Machine Translation
Scaling Neural Machine Translation
Myle Ott
Sergey Edunov
David Grangier
Michael Auli
AIMat
19
610
0
01 Jun 2018
Measuring and regularizing networks in function space
Measuring and regularizing networks in function space
Ari S. Benjamin
David Rolnick
Konrad Paul Kording
19
137
0
21 May 2018
Knowledge Distillation in Generations: More Tolerant Teachers Educate
  Better Students
Knowledge Distillation in Generations: More Tolerant Teachers Educate Better Students
Chenglin Yang
Lingxi Xie
Siyuan Qiao
Alan Yuille
25
135
0
15 May 2018
SHADE: Information Based Regularization for Deep Learning
SHADE: Information Based Regularization for Deep Learning
Michael Blot
Thomas Robert
Nicolas Thome
Matthieu Cord
10
12
0
29 Apr 2018
ESPnet: End-to-End Speech Processing Toolkit
ESPnet: End-to-End Speech Processing Toolkit
Shinji Watanabe
Takaaki Hori
Shigeki Karita
Tomoki Hayashi
Jiro Nishitoba
...
Jahn Heymann
Matthew Wiesner
Nanxin Chen
Adithya Renduchintala
Tsubasa Ochiai
VLM
6
1,477
0
30 Mar 2018
Learning Representations for Neural Network-Based Classification Using
  the Information Bottleneck Principle
Learning Representations for Neural Network-Based Classification Using the Information Bottleneck Principle
Rana Ali Amjad
Bernhard C. Geiger
23
195
0
27 Feb 2018
MINE: Mutual Information Neural Estimation
MINE: Mutual Information Neural Estimation
Mohamed Ishmael Belghazi
A. Baratin
Sai Rajeswar
Sherjil Ozair
Yoshua Bengio
Aaron Courville
R. Devon Hjelm
DRL
18
1,248
0
12 Jan 2018
Gradient Regularization Improves Accuracy of Discriminative Models
Gradient Regularization Improves Accuracy of Discriminative Models
D. Varga
Adrián Csiszárik
Zsolt Zombori
18
53
0
28 Dec 2017
Automatic segmentation method of pelvic floor levator hiatus in
  ultrasound using a self-normalising neural network
Automatic segmentation method of pelvic floor levator hiatus in ultrasound using a self-normalising neural network
E. Bonmati
Yipeng Hu
Nikhil Sindhwani
H. Dietz
Jan D'hooge
D. Barratt
Jan Deprest
Tom Kamiel Magda Vercauteren
21
30
0
18 Dec 2017
Training Confidence-calibrated Classifiers for Detecting
  Out-of-Distribution Samples
Training Confidence-calibrated Classifiers for Detecting Out-of-Distribution Samples
Kimin Lee
Honglak Lee
Kibok Lee
Jinwoo Shin
OODD
23
870
0
26 Nov 2017
Classical Structured Prediction Losses for Sequence to Sequence Learning
Classical Structured Prediction Losses for Sequence to Sequence Learning
Sergey Edunov
Myle Ott
Michael Auli
David Grangier
MarcÁurelio Ranzato
AIMat
48
185
0
14 Nov 2017
Intriguing Properties of Adversarial Examples
Intriguing Properties of Adversarial Examples
E. D. Cubuk
Barret Zoph
S. Schoenholz
Quoc V. Le
AAML
23
84
0
08 Nov 2017
mixup: Beyond Empirical Risk Minimization
mixup: Beyond Empirical Risk Minimization
Hongyi Zhang
Moustapha Cissé
Yann N. Dauphin
David Lopez-Paz
NoLa
43
9,587
0
25 Oct 2017
Towards better decoding and language model integration in sequence to
  sequence models
Towards better decoding and language model integration in sequence to sequence models
J. Chorowski
Navdeep Jaitly
6
368
0
08 Dec 2016
Google's Neural Machine Translation System: Bridging the Gap between
  Human and Machine Translation
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,743
0
26 Sep 2016
Effective Approaches to Attention-based Neural Machine Translation
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
218
7,923
0
17 Aug 2015
Previous
1234