ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.07040
  4. Cited By
The NarrativeQA Reading Comprehension Challenge

The NarrativeQA Reading Comprehension Challenge

Transactions of the Association for Computational Linguistics (TACL), 2017
19 December 2017
Tomás Kociský
Jonathan Richard Schwarz
Phil Blunsom
Chris Dyer
Karl Moritz Hermann
Gábor Melis
Edward Grefenstette
ArXiv (abs)PDFHTML

Papers citing "The NarrativeQA Reading Comprehension Challenge"

50 / 546 papers shown
Reasoning Models are Test Exploiters: Rethinking Multiple-Choice
Reasoning Models are Test Exploiters: Rethinking Multiple-Choice
Narun K. Raman
Taylor Lundy
Kevin Leyton-Brown
ELMLRM
214
3
0
21 Jul 2025
FlexOlmo: Open Language Models for Flexible Data Use
FlexOlmo: Open Language Models for Flexible Data Use
Weijia Shi
Akshita Bhagia
Kevin Farhat
Niklas Muennighoff
Pete Walsh
...
Luke Zettlemoyer
Pang Wei Koh
Hannaneh Hajishirzi
Ali Farhadi
Sewon Min
MoE
399
4
0
09 Jul 2025
RAT: Bridging RNN Efficiency and Attention Accuracy via Chunk-based Sequence Modeling
RAT: Bridging RNN Efficiency and Attention Accuracy via Chunk-based Sequence ModelingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
Xiuying Wei
Anunay Yadav
Razvan Pascanu
Çağlar Gülçehre
AI4TS
262
0
0
06 Jul 2025
Language Models Might Not Understand You: Evaluating Theory of Mind via Story Prompting
Language Models Might Not Understand You: Evaluating Theory of Mind via Story Prompting
Nathaniel Getachew
Abulhair Saparov
LRM
163
0
0
23 Jun 2025
Cache Me If You Can: How Many KVs Do You Need for Effective Long-Context LMs?
Cache Me If You Can: How Many KVs Do You Need for Effective Long-Context LMs?
Adithya Bhaskar
Alexander Wettig
Tianyu Gao
Yihe Dong
Danqi Chen
178
5
0
20 Jun 2025
EvolvTrip: Enhancing Literary Character Understanding with Temporal Theory-of-Mind Graphs
EvolvTrip: Enhancing Literary Character Understanding with Temporal Theory-of-Mind Graphs
Bohao Yang
Hainiu Xu
Jinhua Du
Ze Li
Petr Slovak
Chenghua Lin
158
0
0
16 Jun 2025
AbsenceBench: Language Models Can't Tell What's Missing
AbsenceBench: Language Models Can't Tell What's Missing
Harvey Yiyun Fu
Aryan Shrivastava
Jared Moore
Peter West
Chenhao Tan
Ari Holtzman
RALM
215
4
0
13 Jun 2025
Brevity is the soul of sustainability: Characterizing LLM response lengthsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
S. Poddar
Paramita Koley
Janardan Misra
Sanjay Podder
Navveen Balani
Niloy Ganguly
Saptarshi Ghosh
244
5
0
10 Jun 2025
Flow Matching Meets PDEs: A Unified Framework for Physics-Constrained Generation
Giacomo Baldan
Qiang Liu
Alberto Guardone
Nils Thuerey
AI4CE
205
6
0
10 Jun 2025
Graph-KV: Breaking Sequence via Injecting Structural Biases into Large Language Models
Graph-KV: Breaking Sequence via Injecting Structural Biases into Large Language Models
Haoyu Wang
Peihao Wang
Mufei Li
Shikun Liu
Siqi Miao
Zinan Lin
P. Li
200
3
0
09 Jun 2025
Advancing Question Generation with Joint Narrative and Difficulty Control
Advancing Question Generation with Joint Narrative and Difficulty ControlWorkshop on Innovative Use of NLP for Building Educational Applications (UNBEA), 2025
Bernardo Leite
Henrique Lopes Cardoso
142
0
0
07 Jun 2025
Evolutionary Perspectives on the Evaluation of LLM-Based AI Agents: A Comprehensive Survey
Evolutionary Perspectives on the Evaluation of LLM-Based AI Agents: A Comprehensive Survey
Jiachen Zhu
Menghui Zhu
Renting Rui
Rong Shan
Congmin Zheng
...
Jianghao Lin
Weiwen Liu
Ruiming Tang
Yong Yu
Weinan Zhang
LLMAGELM
297
7
0
06 Jun 2025
Stronger Baselines for Retrieval-Augmented Generation with Long-Context Language Models
Stronger Baselines for Retrieval-Augmented Generation with Long-Context Language Models
Alex Laitenberger
Christopher D. Manning
Nelson F. Liu
RALM
226
3
0
04 Jun 2025
TracLLM: A Generic Framework for Attributing Long Context LLMs
TracLLM: A Generic Framework for Attributing Long Context LLMs
Yanting Wang
Wei Zou
Runpeng Geng
Jinyuan Jia
LLMAG
514
4
0
04 Jun 2025
Adaptive Two Sided Laplace Transforms: A Learnable, Interpretable, and Scalable Replacement for Self-Attention
Adaptive Two Sided Laplace Transforms: A Learnable, Interpretable, and Scalable Replacement for Self-Attention
Andrew Kiruluta
154
0
0
01 Jun 2025
Dynamic Chunking and Selection for Reading Comprehension of Ultra-Long Context in Large Language Models
Dynamic Chunking and Selection for Reading Comprehension of Ultra-Long Context in Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Boheng Sheng
Jiacheng Yao
Meicong Zhang
Guoxiu He
RALM
234
5
0
01 Jun 2025
Context is Gold to find the Gold Passage: Evaluating and Training Contextual Document Embeddings
Context is Gold to find the Gold Passage: Evaluating and Training Contextual Document Embeddings
Max Conti
Manuel Faysse
Gautier Viaud
Antoine Bosselut
C´eline Hudelot
Pierre Colombo
289
6
0
30 May 2025
What Has Been Lost with Synthetic Evaluation?
What Has Been Lost with Synthetic Evaluation?
Alexander Gill
Abhilasha Ravichander
Ana Marasović
ELM
362
0
0
28 May 2025
Long Context Scaling: Divide and Conquer via Multi-Agent Question-driven Collaboration
Long Context Scaling: Divide and Conquer via Multi-Agent Question-driven Collaboration
Sibo Xiao
Zixin Lin
Wenyang Gao
Hui Chen
Yue Zhang
LLMAG
367
4
0
27 May 2025
ReadBench: Measuring the Dense Text Visual Reading Ability of Vision-Language Models
ReadBench: Measuring the Dense Text Visual Reading Ability of Vision-Language Models
Benjamin Clavié
Florian Brand
VLMCoGe
233
1
0
25 May 2025
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Fanqi Wan
Weizhou Shen
Shengyi Liao
Yingcheng Shi
Chenliang Li
Ziyi Yang
Ji Zhang
Fei Huang
Jingren Zhou
Ming Yan
OffRLLLMAGReLMLRM
425
13
0
23 May 2025
PaTH Attention: Position Encoding via Accumulating Householder Transformations
PaTH Attention: Position Encoding via Accumulating Householder Transformations
Songlin Yang
Yikang Shen
Kaiyue Wen
Shawn Tan
Mayank Mishra
Liliang Ren
Rameswar Panda
Yoon Kim
887
12
0
22 May 2025
NovelHopQA: Diagnosing Multi-Hop Reasoning Failures in Long Narrative Contexts
NovelHopQA: Diagnosing Multi-Hop Reasoning Failures in Long Narrative Contexts
Abhay Gupta
Michael Lu
Kevin Zhu
Sean O'Brien
Sean O Brien
LRM
313
0
0
20 May 2025
Tianyi: A Traditional Chinese Medicine all-rounder language model and its Real-World Clinical Practice
Tianyi: A Traditional Chinese Medicine all-rounder language model and its Real-World Clinical PracticeInformation Fusion (Inf. Fusion), 2025
Zhi Liu
Tao Yang
Jing Wang
Yexin Chen
Zhan Gao
...
Xiaochen Li
Changyong Luo
Yan Li
Xiaohong Gu
Peng Cao
LM&MA
107
2
0
19 May 2025
SoLoPO: Unlocking Long-Context Capabilities in LLMs via Short-to-Long Preference Optimization
SoLoPO: Unlocking Long-Context Capabilities in LLMs via Short-to-Long Preference Optimization
Huashan Sun
Shengyi Liao
Yansen Han
Yu Bai
Yang Gao
...
Weizhou Shen
Fanqi Wan
Ming Yan
J.N. Zhang
Fei Huang
586
3
0
16 May 2025
Sparse Attention Remapping with Clustering for Efficient LLM Decoding on PIM
Sparse Attention Remapping with Clustering for Efficient LLM Decoding on PIM
Zehao Fan
Garrett Gagnon
Zhenyu Liu
Liu Liu
284
0
0
09 May 2025
HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World Hallucination Detection
HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World Hallucination Detection
Deanna Emery
Michael Goitia
Freddie Vargus
Iulia Neagu
HILMVLM
325
2
0
01 May 2025
Rethinking Memory in LLM based Agents: Representations, Operations, and Emerging Topics
Rethinking Memory in LLM based Agents: Representations, Operations, and Emerging Topics
Yiming Du
Wenyu Huang
Danna Zheng
Zhaowei Wang
Sébastien Montella
Mirella Lapata
Kam-Fai Wong
Jeff Z. Pan
KELMMU
671
17
0
01 May 2025
EnronQA: Towards Personalized RAG over Private Documents
EnronQA: Towards Personalized RAG over Private Documents
Michael J. Ryan
Danmei Xu
Chris Nivera
Daniel Campos
SILM
353
6
0
01 May 2025
LiveLongBench: Tackling Long-Context Understanding for Spoken Texts from Live Streams
LiveLongBench: Tackling Long-Context Understanding for Spoken Texts from Live Streams
Yongxuan Wu
Runyu Chen
Peiyu Liu
Hongjin Qian
RALM
373
1
0
24 Apr 2025
Long Context In-Context Compression by Getting to the Gist of Gisting
Long Context In-Context Compression by Getting to the Gist of Gisting
Aleksandar Petrov
Mark Sandler
A. Zhmoginov
Nolan Miller
Max Vladymyrov
315
3
0
11 Apr 2025
Resona: Improving Context Copying in Linear Recurrence Models with Retrieval
Resona: Improving Context Copying in Linear Recurrence Models with Retrieval
Xinyu Wang
Linrui Ma
Jerry Huang
Peng Lu
Prasanna Parthasarathi
Xiao-Wen Chang
Boxing Chen
Yufei Cui
KELM
441
3
0
28 Mar 2025
Survey on Evaluation of LLM-based Agents
Survey on Evaluation of LLM-based Agents
Asaf Yehudai
Lilach Eden
Alan Li
Guy Uziel
Yilun Zhao
Roy Bar-Haim
Arman Cohan
Michal Shmueli-Scheuer
LLMAGELM
509
84
0
20 Mar 2025
DocVideoQA: Towards Comprehensive Understanding of Document-Centric Videos through Question Answering
DocVideoQA: Towards Comprehensive Understanding of Document-Centric Videos through Question AnsweringIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Han Wang
Kai Hu
Liangcai Gao
635
2
0
20 Mar 2025
Tuning LLMs by RAG Principles: Towards LLM-native Memory
Tuning LLMs by RAG Principles: Towards LLM-native Memory
Jiale Wei
Shuchi Wu
Ruochen Liu
Xiang Ying
Jingbo Shang
Fangbo Tao
RALM
241
1
0
20 Mar 2025
GPU-Accelerated Motion Planning of an Underactuated Forestry Crane in Cluttered Environments
GPU-Accelerated Motion Planning of an Underactuated Forestry Crane in Cluttered Environments
M. Vu
Gerald Ebmer
Alexander Watcher
Marc-Philip Ecker
Giang Nguyen
Tobias Glueck
280
5
0
18 Mar 2025
A Survey on Transformer Context Extension: Approaches and Evaluation
A Survey on Transformer Context Extension: Approaches and Evaluation
Yijun Liu
Jinzheng Yu
Yang Xu
Zhongyang Li
Qingfu Zhu
LLMAG
520
12
0
17 Mar 2025
OpeNLGauge: An Explainable Metric for NLG Evaluation with Open-Weights LLMs
OpeNLGauge: An Explainable Metric for NLG Evaluation with Open-Weights LLMs
Ivan Kartáč
Mateusz Lango
Ondrej Dusek
ELM
378
5
0
14 Mar 2025
CURIE: Evaluating LLMs On Multitask Scientific Long Context Understanding and Reasoning
CURIE: Evaluating LLMs On Multitask Scientific Long Context Understanding and ReasoningInternational Conference on Learning Representations (ICLR), 2025
Hao Cui
Zahra Shamsi
Gowoon Cheon
Xuejian Ma
Shutong Li
...
Eun-Ah Kim
M. Brenner
Viren Jain
Sameera Ponda
Subhashini Venugopalan
ELMLRM
478
26
0
14 Mar 2025
A Survey on Knowledge-Oriented Retrieval-Augmented Generation
A Survey on Knowledge-Oriented Retrieval-Augmented Generation
Mingyue Cheng
Yucong Luo
Jie Ouyang
Qiang Liu
Huijie Liu
...
Bohou Zhang
Jiawei Cao
Jie Ma
Daoyu Wang
Tong Xu
3DV
374
39
0
11 Mar 2025
Guess What I am Thinking: A Benchmark for Inner Thought Reasoning of Role-Playing Language Agents
R. Xu
Mingyu Wang
Xintao Wang
Dakuan Lu
Jue Chen
Wei Chu
Yinghui Xu
LRMLLMAG
365
6
0
11 Mar 2025
Training Plug-n-Play Knowledge Modules with Deep Context Distillation
Training Plug-n-Play Knowledge Modules with Deep Context Distillation
Lucas Caccia
Alan Ansell
Edoardo Ponti
Ivan Vulić
Alessandro Sordoni
SyDa
1.1K
4
0
11 Mar 2025
DeFine: A Decomposed and Fine-Grained Annotated Dataset for Long-form Article Generation
Ming Wang
Fang Wang
Minghao Hu
Li He
Haiyang Wang
...
Li Li
Zhunchen Luo
Wei Luo
Xiaoying Bai
Guotong Geng
315
1
0
10 Mar 2025
MRCEval: A Comprehensive, Challenging and Accessible Machine Reading Comprehension Benchmark
Shengkun Ma
Hao Peng
Lei Hou
Juanzi Li
ELM
249
1
0
10 Mar 2025
LADM: Long-context Training Data Selection with Attention-based Dependency Measurement for LLMs
LADM: Long-context Training Data Selection with Attention-based Dependency Measurement for LLMsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Jianghao Chen
Junhong Wu
Yangyifan Xu
J.N. Zhang
359
9
0
04 Mar 2025
EAIRA: Establishing a Methodology for Evaluating AI Models as Scientific Research Assistants
EAIRA: Establishing a Methodology for Evaluating AI Models as Scientific Research Assistants
Franck Cappello
Sandeep Madireddy
Robert Underwood
N. Getty
Nicholas Chia
...
M. Rafique
Eliu A. Huerta
Yangqiu Song
Ian Foster
Rick L. Stevens
352
3
0
27 Feb 2025
Judge as A Judge: Improving the Evaluation of Retrieval-Augmented Generation through the Judge-Consistency of Large Language Models
Judge as A Judge: Improving the Evaluation of Retrieval-Augmented Generation through the Judge-Consistency of Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Shuliang Liu
Xinze Li
Zhenghao Liu
Shi Yu
Cheng Yang
Zheni Zeng
Zhiyuan Liu
Maosong Sun
Ge Yu
RALM
470
5
0
26 Feb 2025
Towards Threshold-Free KV Cache Pruning
Towards Threshold-Free KV Cache Pruning
Xuanfan Ni
Liyan Xu
Chenyang Lyu
Longyue Wang
Mo Yu
Lemao Liu
Fandong Meng
Jie Zhou
Piji Li
352
0
0
24 Feb 2025
Self-Taught Agentic Long Context Understanding
Self-Taught Agentic Long Context UnderstandingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Yufan Zhuang
Xiaodong Yu
Jialian Wu
Xingwu Sun
Zihan Wang
Jiang Liu
Yusheng Su
Jingbo Shang
Zicheng Liu
Emad Barsoum
LRM
340
2
0
21 Feb 2025
Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation
Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation
Abdelrahman Abdallah
Bhawna Piryani
Jamshid Mozafari
Mohammed Ali
Adam Jatowt
951
5
0
21 Feb 2025
Previous
12345...91011
Next
Page 2 of 11
Pageof 11