Generating abstractive summaries of Lithuanian news articles using a
transformer modelInternational Conference on Information and Software Technologies (ICIST), 2021 |
The Power of Scale for Parameter-Efficient Prompt TuningConference on Empirical Methods in Natural Language Processing (EMNLP), 2021 |
Comparison of Grammatical Error Correction Using Back-Translation ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021 |
Planning with Learned Entity Prompts for Abstractive SummarizationTransactions of the Association for Computational Linguistics (TACL), 2021 |
Pushing the Limits of Non-Autoregressive Speech RecognitionInterspeech (Interspeech), 2021 |
Efficient Attentions for Long Document SummarizationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021 |
Cryptonite: A Cryptic Crossword Benchmark for Extreme Ambiguity in
LanguageConference on Empirical Methods in Natural Language Processing (EMNLP), 2021 |
Do Transformer Modifications Transfer Across Implementations and
Applications?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021 |
Civil Rephrases Of Toxic Texts With Self-Supervised TransformersConference of the European Chapter of the Association for Computational Linguistics (EACL), 2021 |
Analyzing Commonsense Emergence in Few-shot Knowledge ModelsConference on Automated Knowledge Base Construction (AKBC), 2021 |
Promoting Graph Awareness in Linearized Graph-to-Text GenerationFindings (Findings), 2020 |
AraGPT2: Pre-Trained Transformer for Arabic Language GenerationWorkshop on Arabic Natural Language Processing (WANLP), 2020 |
Few-Shot Text Generation with Pattern-Exploiting TrainingConference on Empirical Methods in Natural Language Processing (EMNLP), 2020 |
Contrastive Learning with Adversarial Perturbations for Conditional Text
GenerationInternational Conference on Learning Representations (ICLR), 2020 |
Collaborative Storytelling with Large-scale Neural Language ModelsMotion in Games (MIG), 2020 |
Whale: Efficient Giant Model Training over Heterogeneous GPUsUSENIX Annual Technical Conference (USENIX ATC), 2020 |
Stochastic Optimization with Laggard Data PipelinesNeural Information Processing Systems (NeurIPS), 2020 |
GO FIGURE: A Meta Evaluation of Factuality in SummarizationFindings (Findings), 2020 |
Effects of Parameter Norm Growth During Transformer Training: Inductive
Bias from Gradient DescentConference on Empirical Methods in Natural Language Processing (EMNLP), 2020 |
Seq2Edits: Sequence Transduction Using Span-level Edit OperationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2020 |
Data Weighted Training Strategies for Grammatical Error CorrectionTransactions of the Association for Computational Linguistics (TACL), 2020 |
A Comparison of Optimization Algorithms for Deep LearningInternational journal of pattern recognition and artificial intelligence (IJPRAI), 2020 |
ADAHESSIAN: An Adaptive Second Order Optimizer for Machine LearningAAAI Conference on Artificial Intelligence (AAAI), 2020 |
Recipes for building an open-domain chatbotConference of the European Chapter of the Association for Computational Linguistics (EACL), 2020 |
Efficient Content-Based Sparse Attention with Routing TransformersTransactions of the Association for Computational Linguistics (TACL), 2020 |