All Papers
0 / 0 papers shown
Title |
|---|
Title |
|---|

Title |
|---|
![]() Hierarchical Transformers for Long Document ClassificationAutomatic Speech Recognition & Understanding (ASRU), 2019 |
![]() Correction of Automatic Speech Recognition with Transformer
Sequence-to-sequence ModelIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
![]() Deja-vu: Double Feature Presentation and Iterated Loss in Deep
Transformer NetworksIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
![]() Improving the Gating Mechanism of Recurrent Neural NetworksInternational Conference on Machine Learning (ICML), 2019 |
![]() Using Local Knowledge Graph Construction to Scale Seq2Seq Models to
Multi-Document InputsConference on Empirical Methods in Natural Language Processing (EMNLP), 2019 |
![]() A Mutual Information Maximization Perspective of Language Representation
LearningInternational Conference on Learning Representations (ICLR), 2019 |
![]() BIG MOOD: Relating Transformers to Explicit Commonsense KnowledgeConference on Empirical Methods in Natural Language Processing (EMNLP), 2019 |
![]() Transformer ASR with Contextual Block ProcessingAutomatic Speech Recognition & Understanding (ASRU), 2019 |
![]() A multi-label, dual-output deep neural network for automated bug
triagingInternational Conference on Machine Learning and Applications (ICMLA), 2019 |
![]() Stabilizing Transformers for Reinforcement LearningInternational Conference on Machine Learning (ICML), 2019 |
![]() Deep Transfer Learning for Source Code ModelingInternational journal of software engineering and knowledge engineering (IJSEKE), 2019 |
![]() Structured Pruning of Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2019 |
![]() Alternating Recurrent Dialog Model with Large-scale Pre-trained Language
ModelsConference of the European Chapter of the Association for Computational Linguistics (EACL), 2019 |
![]() Checkmate: Breaking the Memory Wall with Optimal Tensor
RematerializationConference on Machine Learning and Systems (MLSys), 2019 |
![]() State-of-the-Art Speech Recognition Using Multi-Stream Self-Attention
With Dilated 1D ConvolutionsAutomatic Speech Recognition & Understanding (ASRU), 2019 |
![]() A Constructive Prediction of the Generalization Error Across ScalesInternational Conference on Learning Representations (ICLR), 2019 |
![]() V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete
and Continuous ControlInternational Conference on Learning Representations (ICLR), 2019 H. F. Song A. Abdolmaleki Jost Tobias Springenberg Aidan Clark Hubert Soyer ...Dhruva Tirumala N. Heess Dan Belov Martin Riedmiller M. Botvinick |
![]() ALBERT: A Lite BERT for Self-supervised Learning of Language
RepresentationsInternational Conference on Learning Representations (ICLR), 2019 |
![]() Reducing Transformer Depth on Demand with Structured DropoutInternational Conference on Learning Representations (ICLR), 2019 |
![]() Gap Aware Mitigation of Gradient StalenessInternational Conference on Learning Representations (ICLR), 2019 |
![]() Knowledge-Enriched Transformer for Emotion Detection in Textual
ConversationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2019 |
![]() A Random Gossip BMUF Process for Neural Language ModelingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 |
![]() Span-based Joint Entity and Relation Extraction with Transformer
Pre-trainingEuropean Conference on Artificial Intelligence (ECAI), 2019 |
![]() K-BERT: Enabling Language Representation with Knowledge GraphAAAI Conference on Artificial Intelligence (AAAI), 2019 |
![]() Ouroboros: On Accelerating Training of Transformer-Based Language ModelsNeural Information Processing Systems (NeurIPS), 2019 |
![]() How Does BERT Answer Questions? A Layer-Wise Analysis of Transformer
RepresentationsInternational Conference on Information and Knowledge Management (CIKM), 2019 |
![]() Comprehensive Analysis of Aspect Term Extraction Methods using Various
Text EmbeddingsComputer Speech and Language (CSL), 2019 |
![]() A Quantum Search Decoder for Natural Language ProcessingQuantum Machine Intelligence (QMI), 2019 |
![]() Forecaster: A Graph Transformer for Forecasting Spatial and
Time-Dependent DataEuropean Conference on Artificial Intelligence (ECAI), 2019 |
![]() PaLM: A Hybrid Parser and Language ModelConference on Empirical Methods in Natural Language Processing (EMNLP), 2019 |
![]() Deep Equilibrium ModelsNeural Information Processing Systems (NeurIPS), 2019 |
![]() Language Models as Knowledge Bases?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019 |
![]() Subword Language Model for Query Auto-CompletionConference on Empirical Methods in Natural Language Processing (EMNLP), 2019 |