Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1805.04623
Cited By
Sharp Nearby, Fuzzy Far Away: How Neural Language Models Use Context
12 May 2018
Urvashi Khandelwal
He He
Peng Qi
Dan Jurafsky
RALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Sharp Nearby, Fuzzy Far Away: How Neural Language Models Use Context"
44 / 44 papers shown
Title
Detecting Bias and Enhancing Diagnostic Accuracy in Large Language Models for Healthcare
Pardis Sadat Zahraei
Zahra Shakeri
LM&MA
26
0
0
09 Oct 2024
Positional Encoding Helps Recurrent Neural Networks Handle a Large Vocabulary
Takashi Morita
16
3
0
31 Jan 2024
Revisiting Topic-Guided Language Models
Carolina Zheng
Keyon Vafa
David M. Blei
BDL
27
1
0
04 Dec 2023
Generative Models as a Complex Systems Science: How can we make sense of large language model behavior?
Ari Holtzman
Peter West
Luke Zettlemoyer
AI4CE
30
14
0
31 Jul 2023
Lost in the Middle: How Language Models Use Long Contexts
Nelson F. Liu
Kevin Lin
John Hewitt
Ashwin Paranjape
Michele Bevilacqua
Fabio Petroni
Percy Liang
RALM
38
1,403
0
06 Jul 2023
Revisiting Entropy Rate Constancy in Text
Vivek Verma
Nicholas Tomlin
Dan Klein
16
4
0
20 May 2023
HiPerformer: Hierarchically Permutation-Equivariant Transformer for Time Series Forecasting
Ryo Umagami
Yusuke Ono
Yusuke Mukuta
Tatsuya Harada
AI4TS
37
3
0
14 May 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
Yihan Cao
Siyu Li
Yixin Liu
Zhiling Yan
Yutong Dai
Philip S. Yu
Lichao Sun
29
506
0
07 Mar 2023
Understanding and Detecting Hallucinations in Neural Machine Translation via Model Introspection
Weijia Xu
Sweta Agrawal
Eleftheria Briakou
Marianna J. Martindale
Marine Carpuat
HILM
24
46
0
18 Jan 2023
On the Blind Spots of Model-Based Evaluation Metrics for Text Generation
Tianxing He
Jingyu Zhang
Tianle Wang
Sachin Kumar
Kyunghyun Cho
James R. Glass
Yulia Tsvetkov
40
44
0
20 Dec 2022
Reranking Overgenerated Responses for End-to-End Task-Oriented Dialogue Systems
Songbo Hu
Ivan Vulić
Fangyu Liu
Anna Korhonen
34
0
0
07 Nov 2022
Circling Back to Recurrent Models of Language
Gábor Melis
32
0
0
03 Nov 2022
On the Explainability of Natural Language Processing Deep Models
Julia El Zini
M. Awad
27
82
0
13 Oct 2022
Dynamic Global Memory for Document-level Argument Extraction
Xinya Du
Sha Li
Heng Ji
13
37
0
18 Sep 2022
How to Prompt? Opportunities and Challenges of Zero- and Few-Shot Learning for Human-AI Interaction in Creative Applications of Generative Models
Hai Dang
Lukas Mecke
Florian Lehmann
Sven Goller
Daniel Buschek
20
97
0
03 Sep 2022
AA-Forecast: Anomaly-Aware Forecast for Extreme Events
Ashkan Farhangi
Jiang Bian
Arthur Huang
Haoyi Xiong
Jun Wang
Zhi-guo Guo
AI4TS
26
4
0
21 Aug 2022
RankGen: Improving Text Generation with Large Ranking Models
Kalpesh Krishna
Yapei Chang
John Wieting
Mohit Iyyer
AIMat
21
68
0
19 May 2022
A Model-Agnostic Data Manipulation Method for Persona-based Dialogue Generation
Yu Cao
Wei Bi
Meng Fang
Shuming Shi
Dacheng Tao
24
48
0
21 Apr 2022
Team ÚFAL at CMCL 2022 Shared Task: Figuring out the correct recipe for predicting Eye-Tracking features using Pretrained Language Models
Sunit Bhattacharya
Rishu Kumar
Ondrej Bojar
13
2
0
11 Apr 2022
Pretraining with Artificial Language: Studying Transferable Knowledge in Language Models
Ryokan Ri
Yoshimasa Tsuruoka
29
25
0
19 Mar 2022
Block-Recurrent Transformers
DeLesley S. Hutchins
Imanol Schlag
Yuhuai Wu
Ethan Dyer
Behnam Neyshabur
18
94
0
11 Mar 2022
Effective Cross-Utterance Language Modeling for Conversational Speech Recognition
Bi-Cheng Yan
Hsin-Wei Wang
Shih-Hsuan Chiu
Hsuan-Sheng Chiu
Berlin Chen
15
1
0
05 Nov 2021
Coherence boosting: When your pretrained language model is not paying enough attention
Nikolay Malkin
Zhen Wang
Nebojsa Jojic
RALM
19
35
0
15 Oct 2021
A surprisal--duration trade-off across and within the world's languages
Tiago Pimentel
Clara Meister
Elizabeth Salesky
Simone Teufel
Damián E. Blasi
Ryan Cotterell
LRM
109
29
0
30 Sep 2021
Do Long-Range Language Models Actually Use Long-Range Context?
Simeng Sun
Kalpesh Krishna
Andrew Mattarella-Micke
Mohit Iyyer
RALM
25
80
0
19 Sep 2021
Studying word order through iterative shuffling
Nikolay Malkin
Sameera Lanka
Pranav Goel
Nebojsa Jojic
31
14
0
10 Sep 2021
What Context Features Can Transformer Language Models Use?
J. O'Connor
Jacob Andreas
KELM
21
75
0
15 Jun 2021
Cross-utterance Reranking Models with BERT and Graph Convolutional Networks for Conversational Speech Recognition
Shih-Hsuan Chiu
Tien-Hong Lo
Fu-An Chao
Berlin Chen
BDL
30
10
0
13 Jun 2021
Adapting Long Context NLM for ASR Rescoring in Conversational Agents
Ashish Shenoy
S. Bodapati
Monica Sunkara
S. Ronanki
Katrin Kirchhoff
21
21
0
21 Apr 2021
Context Dependent Semantic Parsing: A Survey
Zhuang Li
Lizhen Qu
Gholamreza Haffari
16
19
0
02 Nov 2020
Document-Level Relation Extraction with Adaptive Thresholding and Localized Context Pooling
Wenxuan Zhou
Kevin Huang
Tengyu Ma
Jing Huang
18
273
0
21 Oct 2020
Multi-timescale Representation Learning in LSTM Language Models
Shivangi Mahto
Vy A. Vo
Javier S. Turek
Alexander G. Huth
15
29
0
27 Sep 2020
Towards Full-line Code Completion with Neural Language Models
Wenhan Wang
Sijie Shen
Ge Li
Zhi Jin
16
16
0
18 Sep 2020
Recurrent Quantum Neural Networks
Johannes Bausch
21
151
0
25 Jun 2020
Phonotactic Complexity and its Trade-offs
Tiago Pimentel
Brian Roark
Ryan Cotterell
17
37
0
07 May 2020
Temporal Convolutional Attention-based Network For Sequence Modeling
Hongyan Hao
Yan Wang
Siqiao Xue
Yudi Xia
Jian Zhao
S. Furao
16
41
0
28 Feb 2020
Lattice Transformer for Speech Translation
Pei Zhang
Boxing Chen
Niyu Ge
Kai Fan
34
48
0
13 Jun 2019
What Does BERT Look At? An Analysis of BERT's Attention
Kevin Clark
Urvashi Khandelwal
Omer Levy
Christopher D. Manning
MILM
45
1,580
0
11 Jun 2019
Guiding Extractive Summarization with Question-Answering Rewards
Kristjan Arumae
Fei Liu
31
33
0
04 Apr 2019
Linguistic Knowledge and Transferability of Contextual Representations
Nelson F. Liu
Matt Gardner
Yonatan Belinkov
Matthew E. Peters
Noah A. Smith
40
717
0
21 Mar 2019
Trellis Networks for Sequence Modeling
Shaojie Bai
J. Zico Kolter
V. Koltun
17
145
0
15 Oct 2018
Information-Weighted Neural Cache Language Models for ASR
Lyan Verwimp
J. Pelemans
Hugo Van hamme
P. Wambacq
KELM
RALM
9
2
0
24 Sep 2018
Neural Approaches to Conversational AI
Jianfeng Gao
Michel Galley
Lihong Li
37
668
0
21 Sep 2018
On Training Recurrent Networks with Truncated Backpropagation Through Time in Speech Recognition
Hao Tang
James R. Glass
10
19
0
09 Jul 2018
1