Sharp Nearby, Fuzzy Far Away: How Neural Language Models Use Context

12 May 2018

Dan Jurafsky

Papers citing "Sharp Nearby, Fuzzy Far Away: How Neural Language Models Use Context"

44 / 44 papers shown

Title
Detecting Bias and Enhancing Diagnostic Accuracy in Large Language Models for Healthcare Pardis Sadat Zahraei Zahra Shakeri LM&MA 26 0 0 09 Oct 2024
Positional Encoding Helps Recurrent Neural Networks Handle a Large Vocabulary Takashi Morita 16 3 0 31 Jan 2024
Revisiting Topic-Guided Language Models Carolina Zheng Keyon Vafa David M. Blei BDL 27 1 0 04 Dec 2023
Generative Models as a Complex Systems Science: How can we make sense of large language model behavior? Ari Holtzman Peter West Luke Zettlemoyer AI4CE 30 14 0 31 Jul 2023
Lost in the Middle: How Language Models Use Long Contexts Nelson F. Liu Kevin Lin John Hewitt Ashwin Paranjape Michele Bevilacqua Fabio Petroni Percy Liang RALM 38 1,403 0 06 Jul 2023
Revisiting Entropy Rate Constancy in Text Vivek Verma Nicholas Tomlin Dan Klein 16 4 0 20 May 2023
HiPerformer: Hierarchically Permutation-Equivariant Transformer for Time Series Forecasting Ryo Umagami Yusuke Ono Yusuke Mukuta Tatsuya Harada AI4TS 37 3 0 14 May 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT Yihan Cao Siyu Li Yixin Liu Zhiling Yan Yutong Dai Philip S. Yu Lichao Sun 29 506 0 07 Mar 2023
Understanding and Detecting Hallucinations in Neural Machine Translation via Model Introspection Weijia Xu Sweta Agrawal Eleftheria Briakou Marianna J. Martindale Marine Carpuat HILM 24 46 0 18 Jan 2023
On the Blind Spots of Model-Based Evaluation Metrics for Text Generation Tianxing He Jingyu Zhang Tianle Wang Sachin Kumar Kyunghyun Cho James R. Glass Yulia Tsvetkov 40 44 0 20 Dec 2022
Reranking Overgenerated Responses for End-to-End Task-Oriented Dialogue Systems Songbo Hu Ivan Vulić Fangyu Liu Anna Korhonen 34 0 0 07 Nov 2022
Circling Back to Recurrent Models of Language Gábor Melis 32 0 0 03 Nov 2022
On the Explainability of Natural Language Processing Deep Models Julia El Zini M. Awad 27 82 0 13 Oct 2022
Dynamic Global Memory for Document-level Argument Extraction Xinya Du Sha Li Heng Ji 13 37 0 18 Sep 2022
How to Prompt? Opportunities and Challenges of Zero- and Few-Shot Learning for Human-AI Interaction in Creative Applications of Generative Models Hai Dang Lukas Mecke Florian Lehmann Sven Goller Daniel Buschek 20 97 0 03 Sep 2022
AA-Forecast: Anomaly-Aware Forecast for Extreme Events Ashkan Farhangi Jiang Bian Arthur Huang Haoyi Xiong Jun Wang Zhi-guo Guo AI4TS 26 4 0 21 Aug 2022
RankGen: Improving Text Generation with Large Ranking Models Kalpesh Krishna Yapei Chang John Wieting Mohit Iyyer AIMat 21 68 0 19 May 2022
A Model-Agnostic Data Manipulation Method for Persona-based Dialogue Generation Yu Cao Wei Bi Meng Fang Shuming Shi Dacheng Tao 24 48 0 21 Apr 2022
Team ÚFAL at CMCL 2022 Shared Task: Figuring out the correct recipe for predicting Eye-Tracking features using Pretrained Language Models Sunit Bhattacharya Rishu Kumar Ondrej Bojar 13 2 0 11 Apr 2022
Pretraining with Artificial Language: Studying Transferable Knowledge in Language Models Ryokan Ri Yoshimasa Tsuruoka 29 25 0 19 Mar 2022
Block-Recurrent Transformers DeLesley S. Hutchins Imanol Schlag Yuhuai Wu Ethan Dyer Behnam Neyshabur 18 94 0 11 Mar 2022
Effective Cross-Utterance Language Modeling for Conversational Speech Recognition Bi-Cheng Yan Hsin-Wei Wang Shih-Hsuan Chiu Hsuan-Sheng Chiu Berlin Chen 15 1 0 05 Nov 2021
Coherence boosting: When your pretrained language model is not paying enough attention Nikolay Malkin Zhen Wang Nebojsa Jojic RALM 19 35 0 15 Oct 2021
A surprisal--duration trade-off across and within the world's languages Tiago Pimentel Clara Meister Elizabeth Salesky Simone Teufel Damián E. Blasi Ryan Cotterell LRM 109 29 0 30 Sep 2021
Do Long-Range Language Models Actually Use Long-Range Context? Simeng Sun Kalpesh Krishna Andrew Mattarella-Micke Mohit Iyyer RALM 25 80 0 19 Sep 2021
Studying word order through iterative shuffling Nikolay Malkin Sameera Lanka Pranav Goel Nebojsa Jojic 31 14 0 10 Sep 2021
What Context Features Can Transformer Language Models Use? J. O'Connor Jacob Andreas KELM 21 75 0 15 Jun 2021
Cross-utterance Reranking Models with BERT and Graph Convolutional Networks for Conversational Speech Recognition Shih-Hsuan Chiu Tien-Hong Lo Fu-An Chao Berlin Chen BDL 30 10 0 13 Jun 2021
Adapting Long Context NLM for ASR Rescoring in Conversational Agents Ashish Shenoy S. Bodapati Monica Sunkara S. Ronanki Katrin Kirchhoff 21 21 0 21 Apr 2021
Context Dependent Semantic Parsing: A Survey Zhuang Li Lizhen Qu Gholamreza Haffari 16 19 0 02 Nov 2020
Document-Level Relation Extraction with Adaptive Thresholding and Localized Context Pooling Wenxuan Zhou Kevin Huang Tengyu Ma Jing Huang 18 273 0 21 Oct 2020
Multi-timescale Representation Learning in LSTM Language Models Shivangi Mahto Vy A. Vo Javier S. Turek Alexander G. Huth 15 29 0 27 Sep 2020
Towards Full-line Code Completion with Neural Language Models Wenhan Wang Sijie Shen Ge Li Zhi Jin 16 16 0 18 Sep 2020
Recurrent Quantum Neural Networks Johannes Bausch 21 151 0 25 Jun 2020
Phonotactic Complexity and its Trade-offs Tiago Pimentel Brian Roark Ryan Cotterell 17 37 0 07 May 2020
Temporal Convolutional Attention-based Network For Sequence Modeling Hongyan Hao Yan Wang Siqiao Xue Yudi Xia Jian Zhao S. Furao 16 41 0 28 Feb 2020
Lattice Transformer for Speech Translation Pei Zhang Boxing Chen Niyu Ge Kai Fan 34 48 0 13 Jun 2019
What Does BERT Look At? An Analysis of BERT's Attention Kevin Clark Urvashi Khandelwal Omer Levy Christopher D. Manning MILM 45 1,580 0 11 Jun 2019
Guiding Extractive Summarization with Question-Answering Rewards Kristjan Arumae Fei Liu 31 33 0 04 Apr 2019
Linguistic Knowledge and Transferability of Contextual Representations Nelson F. Liu Matt Gardner Yonatan Belinkov Matthew E. Peters Noah A. Smith 40 717 0 21 Mar 2019
Trellis Networks for Sequence Modeling Shaojie Bai J. Zico Kolter V. Koltun 17 145 0 15 Oct 2018
Information-Weighted Neural Cache Language Models for ASR Lyan Verwimp J. Pelemans Hugo Van hamme P. Wambacq KELM RALM 9 2 0 24 Sep 2018
Neural Approaches to Conversational AI Jianfeng Gao Michel Galley Lihong Li 37 668 0 21 Sep 2018
On Training Recurrent Networks with Truncated Backpropagation Through Time in Speech Recognition Hao Tang James R. Glass 10 19 0 09 Jul 2018