ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.04623
  4. Cited By
Sharp Nearby, Fuzzy Far Away: How Neural Language Models Use Context

Sharp Nearby, Fuzzy Far Away: How Neural Language Models Use Context

12 May 2018
Urvashi Khandelwal
He He
Peng Qi
Dan Jurafsky
    RALM
ArXivPDFHTML

Papers citing "Sharp Nearby, Fuzzy Far Away: How Neural Language Models Use Context"

44 / 44 papers shown
Title
Detecting Bias and Enhancing Diagnostic Accuracy in Large Language
  Models for Healthcare
Detecting Bias and Enhancing Diagnostic Accuracy in Large Language Models for Healthcare
Pardis Sadat Zahraei
Zahra Shakeri
LM&MA
26
0
0
09 Oct 2024
Positional Encoding Helps Recurrent Neural Networks Handle a Large
  Vocabulary
Positional Encoding Helps Recurrent Neural Networks Handle a Large Vocabulary
Takashi Morita
16
3
0
31 Jan 2024
Revisiting Topic-Guided Language Models
Revisiting Topic-Guided Language Models
Carolina Zheng
Keyon Vafa
David M. Blei
BDL
27
1
0
04 Dec 2023
Generative Models as a Complex Systems Science: How can we make sense of
  large language model behavior?
Generative Models as a Complex Systems Science: How can we make sense of large language model behavior?
Ari Holtzman
Peter West
Luke Zettlemoyer
AI4CE
30
14
0
31 Jul 2023
Lost in the Middle: How Language Models Use Long Contexts
Lost in the Middle: How Language Models Use Long Contexts
Nelson F. Liu
Kevin Lin
John Hewitt
Ashwin Paranjape
Michele Bevilacqua
Fabio Petroni
Percy Liang
RALM
38
1,403
0
06 Jul 2023
Revisiting Entropy Rate Constancy in Text
Revisiting Entropy Rate Constancy in Text
Vivek Verma
Nicholas Tomlin
Dan Klein
16
4
0
20 May 2023
HiPerformer: Hierarchically Permutation-Equivariant Transformer for Time
  Series Forecasting
HiPerformer: Hierarchically Permutation-Equivariant Transformer for Time Series Forecasting
Ryo Umagami
Yusuke Ono
Yusuke Mukuta
Tatsuya Harada
AI4TS
37
3
0
14 May 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of
  Generative AI from GAN to ChatGPT
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
Yihan Cao
Siyu Li
Yixin Liu
Zhiling Yan
Yutong Dai
Philip S. Yu
Lichao Sun
29
506
0
07 Mar 2023
Understanding and Detecting Hallucinations in Neural Machine Translation
  via Model Introspection
Understanding and Detecting Hallucinations in Neural Machine Translation via Model Introspection
Weijia Xu
Sweta Agrawal
Eleftheria Briakou
Marianna J. Martindale
Marine Carpuat
HILM
24
46
0
18 Jan 2023
On the Blind Spots of Model-Based Evaluation Metrics for Text Generation
On the Blind Spots of Model-Based Evaluation Metrics for Text Generation
Tianxing He
Jingyu Zhang
Tianle Wang
Sachin Kumar
Kyunghyun Cho
James R. Glass
Yulia Tsvetkov
40
44
0
20 Dec 2022
Reranking Overgenerated Responses for End-to-End Task-Oriented Dialogue
  Systems
Reranking Overgenerated Responses for End-to-End Task-Oriented Dialogue Systems
Songbo Hu
Ivan Vulić
Fangyu Liu
Anna Korhonen
34
0
0
07 Nov 2022
Circling Back to Recurrent Models of Language
Circling Back to Recurrent Models of Language
Gábor Melis
32
0
0
03 Nov 2022
On the Explainability of Natural Language Processing Deep Models
On the Explainability of Natural Language Processing Deep Models
Julia El Zini
M. Awad
27
82
0
13 Oct 2022
Dynamic Global Memory for Document-level Argument Extraction
Dynamic Global Memory for Document-level Argument Extraction
Xinya Du
Sha Li
Heng Ji
13
37
0
18 Sep 2022
How to Prompt? Opportunities and Challenges of Zero- and Few-Shot
  Learning for Human-AI Interaction in Creative Applications of Generative
  Models
How to Prompt? Opportunities and Challenges of Zero- and Few-Shot Learning for Human-AI Interaction in Creative Applications of Generative Models
Hai Dang
Lukas Mecke
Florian Lehmann
Sven Goller
Daniel Buschek
20
97
0
03 Sep 2022
AA-Forecast: Anomaly-Aware Forecast for Extreme Events
AA-Forecast: Anomaly-Aware Forecast for Extreme Events
Ashkan Farhangi
Jiang Bian
Arthur Huang
Haoyi Xiong
Jun Wang
Zhi-guo Guo
AI4TS
26
4
0
21 Aug 2022
RankGen: Improving Text Generation with Large Ranking Models
RankGen: Improving Text Generation with Large Ranking Models
Kalpesh Krishna
Yapei Chang
John Wieting
Mohit Iyyer
AIMat
21
68
0
19 May 2022
A Model-Agnostic Data Manipulation Method for Persona-based Dialogue
  Generation
A Model-Agnostic Data Manipulation Method for Persona-based Dialogue Generation
Yu Cao
Wei Bi
Meng Fang
Shuming Shi
Dacheng Tao
24
48
0
21 Apr 2022
Team ÚFAL at CMCL 2022 Shared Task: Figuring out the correct recipe
  for predicting Eye-Tracking features using Pretrained Language Models
Team ÚFAL at CMCL 2022 Shared Task: Figuring out the correct recipe for predicting Eye-Tracking features using Pretrained Language Models
Sunit Bhattacharya
Rishu Kumar
Ondrej Bojar
13
2
0
11 Apr 2022
Pretraining with Artificial Language: Studying Transferable Knowledge in
  Language Models
Pretraining with Artificial Language: Studying Transferable Knowledge in Language Models
Ryokan Ri
Yoshimasa Tsuruoka
29
25
0
19 Mar 2022
Block-Recurrent Transformers
Block-Recurrent Transformers
DeLesley S. Hutchins
Imanol Schlag
Yuhuai Wu
Ethan Dyer
Behnam Neyshabur
18
94
0
11 Mar 2022
Effective Cross-Utterance Language Modeling for Conversational Speech
  Recognition
Effective Cross-Utterance Language Modeling for Conversational Speech Recognition
Bi-Cheng Yan
Hsin-Wei Wang
Shih-Hsuan Chiu
Hsuan-Sheng Chiu
Berlin Chen
15
1
0
05 Nov 2021
Coherence boosting: When your pretrained language model is not paying
  enough attention
Coherence boosting: When your pretrained language model is not paying enough attention
Nikolay Malkin
Zhen Wang
Nebojsa Jojic
RALM
19
35
0
15 Oct 2021
A surprisal--duration trade-off across and within the world's languages
A surprisal--duration trade-off across and within the world's languages
Tiago Pimentel
Clara Meister
Elizabeth Salesky
Simone Teufel
Damián E. Blasi
Ryan Cotterell
LRM
109
29
0
30 Sep 2021
Do Long-Range Language Models Actually Use Long-Range Context?
Do Long-Range Language Models Actually Use Long-Range Context?
Simeng Sun
Kalpesh Krishna
Andrew Mattarella-Micke
Mohit Iyyer
RALM
25
80
0
19 Sep 2021
Studying word order through iterative shuffling
Studying word order through iterative shuffling
Nikolay Malkin
Sameera Lanka
Pranav Goel
Nebojsa Jojic
31
14
0
10 Sep 2021
What Context Features Can Transformer Language Models Use?
What Context Features Can Transformer Language Models Use?
J. O'Connor
Jacob Andreas
KELM
21
75
0
15 Jun 2021
Cross-utterance Reranking Models with BERT and Graph Convolutional
  Networks for Conversational Speech Recognition
Cross-utterance Reranking Models with BERT and Graph Convolutional Networks for Conversational Speech Recognition
Shih-Hsuan Chiu
Tien-Hong Lo
Fu-An Chao
Berlin Chen
BDL
30
10
0
13 Jun 2021
Adapting Long Context NLM for ASR Rescoring in Conversational Agents
Adapting Long Context NLM for ASR Rescoring in Conversational Agents
Ashish Shenoy
S. Bodapati
Monica Sunkara
S. Ronanki
Katrin Kirchhoff
21
21
0
21 Apr 2021
Context Dependent Semantic Parsing: A Survey
Context Dependent Semantic Parsing: A Survey
Zhuang Li
Lizhen Qu
Gholamreza Haffari
16
19
0
02 Nov 2020
Document-Level Relation Extraction with Adaptive Thresholding and
  Localized Context Pooling
Document-Level Relation Extraction with Adaptive Thresholding and Localized Context Pooling
Wenxuan Zhou
Kevin Huang
Tengyu Ma
Jing Huang
18
273
0
21 Oct 2020
Multi-timescale Representation Learning in LSTM Language Models
Multi-timescale Representation Learning in LSTM Language Models
Shivangi Mahto
Vy A. Vo
Javier S. Turek
Alexander G. Huth
15
29
0
27 Sep 2020
Towards Full-line Code Completion with Neural Language Models
Towards Full-line Code Completion with Neural Language Models
Wenhan Wang
Sijie Shen
Ge Li
Zhi Jin
16
16
0
18 Sep 2020
Recurrent Quantum Neural Networks
Recurrent Quantum Neural Networks
Johannes Bausch
21
151
0
25 Jun 2020
Phonotactic Complexity and its Trade-offs
Phonotactic Complexity and its Trade-offs
Tiago Pimentel
Brian Roark
Ryan Cotterell
17
37
0
07 May 2020
Temporal Convolutional Attention-based Network For Sequence Modeling
Temporal Convolutional Attention-based Network For Sequence Modeling
Hongyan Hao
Yan Wang
Siqiao Xue
Yudi Xia
Jian Zhao
S. Furao
16
41
0
28 Feb 2020
Lattice Transformer for Speech Translation
Lattice Transformer for Speech Translation
Pei Zhang
Boxing Chen
Niyu Ge
Kai Fan
34
48
0
13 Jun 2019
What Does BERT Look At? An Analysis of BERT's Attention
What Does BERT Look At? An Analysis of BERT's Attention
Kevin Clark
Urvashi Khandelwal
Omer Levy
Christopher D. Manning
MILM
45
1,580
0
11 Jun 2019
Guiding Extractive Summarization with Question-Answering Rewards
Guiding Extractive Summarization with Question-Answering Rewards
Kristjan Arumae
Fei Liu
31
33
0
04 Apr 2019
Linguistic Knowledge and Transferability of Contextual Representations
Linguistic Knowledge and Transferability of Contextual Representations
Nelson F. Liu
Matt Gardner
Yonatan Belinkov
Matthew E. Peters
Noah A. Smith
40
717
0
21 Mar 2019
Trellis Networks for Sequence Modeling
Trellis Networks for Sequence Modeling
Shaojie Bai
J. Zico Kolter
V. Koltun
17
145
0
15 Oct 2018
Information-Weighted Neural Cache Language Models for ASR
Information-Weighted Neural Cache Language Models for ASR
Lyan Verwimp
J. Pelemans
Hugo Van hamme
P. Wambacq
KELM
RALM
9
2
0
24 Sep 2018
Neural Approaches to Conversational AI
Neural Approaches to Conversational AI
Jianfeng Gao
Michel Galley
Lihong Li
37
668
0
21 Sep 2018
On Training Recurrent Networks with Truncated Backpropagation Through
  Time in Speech Recognition
On Training Recurrent Networks with Truncated Backpropagation Through Time in Speech Recognition
Hao Tang
James R. Glass
10
19
0
09 Jul 2018
1