Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1909.03065
Cited By
"Going on a vacation" takes longer than "Going for a walk": A Study of Temporal Commonsense Understanding
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
6 September 2019
Ben Zhou
Daniel Khashabi
Qiang Ning
Dan Roth
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
""Going on a vacation" takes longer than "Going for a walk": A Study of Temporal Commonsense Understanding"
50 / 134 papers shown
Structured yet Bounded Temporal Understanding in Large Language Models
Damin Zhang
Julia Taylor Rayz
217
0
0
19 Oct 2025
Hypothesis-Driven Feature Manifold Analysis in LLMs via Supervised Multi-Dimensional Scaling
Federico Tiblias
Irina Bigoulaeva
Jingcheng Niu
Simone Balloccu
Iryna Gurevych
LRM
211
1
0
01 Oct 2025
AgentCoMa: A Compositional Benchmark Mixing Commonsense and Mathematical Reasoning in Real-World Scenarios
Lisa Alazraki
Lihu Chen
Ana Brassard
Joe Stacey
Hossein A. Rahmani
Marek Rei
CoGe
LRM
204
1
0
27 Aug 2025
TComQA: Extracting Temporal Commonsense from Text
Lekshmi R Nair
Arun Sankar
Koninika Pal
RALM
135
0
0
21 Aug 2025
MobQA: A Benchmark Dataset for Semantic Understanding of Human Mobility Data through Question Answering
Hikaru Asano
Hiroki Ouchi
Akira Kasuga
Ryo Yonetani
213
2
0
15 Aug 2025
Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning
Maggie Huan
Yuetai Li
Tuney Zheng
Xiaoyu Xu
Seungone Kim
Minxin Du
Radha Poovendran
Graham Neubig
Xiang Yue
LRM
ELM
254
74
0
01 Jul 2025
Chaining Event Spans for Temporal Relation Grounding
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2025
Jongho Kim
Dohyeon Lee
Minsoo Kim
Seung-won Hwang
212
0
0
17 Jun 2025
From What to Respond to When to Respond: Timely Response Generation for Open-domain Dialogue Agents
Seongbo Jang
Minjin Jeon
Jaehoon Lee
Seonghyeon Lee
Dongha Lee
Hwanjo Yu
409
0
0
17 Jun 2025
LexTime: A Benchmark for Temporal Ordering of Legal Events
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025
Claire Barale
Leslie Barrett
Vikram Sunil Bajaj
Michael Rovatsos
AILaw
392
4
0
04 Jun 2025
Around the World in 24 Hours: Probing LLM Knowledge of Time and Place
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Carolin Holtermann
Paul Röttger
Anne Lauscher
LRM
341
6
0
04 Jun 2025
CrossICL: Cross-Task In-Context Learning via Unsupervised Demonstration Transfer
Jinglong Gao
Xiao Ding
Lingxiao Zou
Bing Qin
Ting Liu
272
0
0
30 May 2025
Improving LLM First-Token Predictions in Multiple-Choice Question Answering via Output Prefilling
Silvia Cappelletti
Tobia Poppi
Samuele Poppi
Zheng-Xin Yong
Diego Garcia-Olano
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
KELM
LRM
246
1
0
21 May 2025
Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks
Yixin Cao
Shibo Hong
Xuzhao Li
Jiahao Ying
Yubo Ma
...
Juanzi Li
Aixin Sun
Qi Zhang
Tat-Seng Chua
Tianwei Zhang
ALM
ELM
595
30
0
26 Apr 2025
Learning to Reason Over Time: Timeline Self-Reflection for Improved Temporal Reasoning in Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Adrián Bazaga
Rexhina Blloshmi
Bill Byrne
Adria de Gispert
ReLM
LRM
506
8
0
07 Apr 2025
WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization
I. Gevers
Victor De Marez
Luna De Bruyne
Walter Daelemans
393
2
0
31 Mar 2025
Improving Preference Extraction In LLMs By Identifying Latent Knowledge Through Classifying Probes
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Sharan Maiya
Yinhong Liu
Ramit Debnath
Anna Korhonen
423
4
0
22 Mar 2025
A Study into Investigating Temporal Robustness of LLMs
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Jonas Wallat
Abdelrahman Abdallah
Adam Jatowt
Avishek Anand
312
12
0
21 Mar 2025
Are Sparse Autoencoders Useful? A Case Study in Sparse Probing
Subhash Kantamneni
Joshua Engels
Senthooran Rajamanoharan
Max Tegmark
Neel Nanda
472
74
0
23 Feb 2025
Counterfactual-Consistency Prompting for Relative Temporal Understanding in Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Jongho Kim
Seung-won Hwang
LRM
AI4CE
565
3
0
17 Feb 2025
TReMu: Towards Neuro-Symbolic Temporal Reasoning for LLM-Agents with Memory in Multi-Session Dialogues
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yubin Ge
Salvatore Romeo
Jason (Jinglun) Cai
Raphael Shu
Monica Sunkara
Yassine Benajiba
Yi Zhang
LLMAG
819
18
0
03 Feb 2025
Weak-to-Strong Generalization Through the Data-Centric Lens
International Conference on Learning Representations (ICLR), 2024
Changho Shin
John Cooper
Frederic Sala
581
14
0
05 Dec 2024
ActionCOMET: A Zero-shot Approach to Learn Image-specific Commonsense Concepts about Actions
Shailaja Keyur Sampat
Yezhou Yang
Chitta Baral
LM&Ro
267
2
0
17 Oct 2024
MM-Forecast: A Multimodal Approach to Temporal Event Forecasting with Large Language Models
ACM Multimedia (MM), 2024
Haoxuan Li
Zhengmao Yang
Yunshan Ma
Yi Bin
Yang Yang
Tat-Seng Chua
273
8
0
08 Aug 2024
A Comprehensive Evaluation of Large Language Models on Temporal Event Forecasting
He Chang
Chenchen Ye
Zhulin Tao
Jie Wu
Zhengmao Yang
Yunshan Ma
Xianglin Huang
Tat-Seng Chua
AI4TS
372
12
0
16 Jul 2024
UnSeenTimeQA: Time-Sensitive Question-Answering Beyond LLMs' Memorization
Md Nayem Uddin
Amir Saeidi
Divij Handa
Agastya Seth
Tran Cao Son
Eduardo Blanco
Steven Corman
Chitta Baral
574
17
0
03 Jul 2024
Timo: Towards Better Temporal Reasoning for Language Models
Zhaochen Su
Jun Zhang
Tong Zhu
Xiaoye Qu
Juntao Li
Min Zhang
Yu Cheng
LRM
333
30
0
20 Jun 2024
Evaluating the Generalization Ability of Quantized LLMs: Benchmark, Analysis, and Toolbox
Yijun Liu
Yuan Meng
Fang Wu
Shenhao Peng
Hang Yao
Chaoyu Guan
Chen Tang
Cheng Wang
Zhi Wang
Wenwu Zhu
MQ
392
9
0
15 Jun 2024
Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?
Zhaochen Su
Juntao Li
Jun Zhang
Tong Zhu
Xiaoye Qu
Pan Zhou
Yan Bowen
Yu Cheng
Min zhang
LRM
303
32
0
13 Jun 2024
Scaling and evaluating sparse autoencoders
Leo Gao
Tom Dupré la Tour
Henk Tillman
Gabriel Goh
Rajan Troll
Alec Radford
Ilya Sutskever
Jan Leike
Jeffrey Wu
326
387
0
06 Jun 2024
A Comprehensive Evaluation on Event Reasoning of Large Language Models
Zhengwei Tao
Zhi Jin
Yifan Zhang
Xiancai Chen
Xiaoying Bai
Yue Fang
Haiyan Zhao
Jia Li
Chongyang Tao
LRM
279
8
0
26 Apr 2024
Continual Learning of Large Language Models: A Comprehensive Survey
Haizhou Shi
Zihao Xu
Hengyi Wang
Weiyi Qin
Wenyuan Wang
Yibin Wang
Zifeng Wang
Sayna Ebrahimi
Hao Wang
CLL
KELM
LRM
475
230
0
25 Apr 2024
LogicBench: Towards Systematic Evaluation of Logical Reasoning Ability of Large Language Models
Mihir Parmar
Nisarg Patel
Neeraj Varshney
Mutsumi Nakamura
Man Luo
Santosh Mashetty
Arindam Mitra
Chitta Baral
LRM
ReLM
ELM
628
75
0
23 Apr 2024
EVIT: Event-Oriented Instruction Tuning for Event Reasoning
Zhengwei Tao
Xiancai Chen
Zhi Jin
Xiaoying Bai
Haiyan Zhao
Yiwei Lou
310
7
0
18 Apr 2024
AcTED: Automatic Acquisition of Typical Event Duration for Semi-supervised Temporal Commonsense QA
Felix Giovanni Virgo
Fei Cheng
L. Pereira
Masayuki Asahara
Ichiro Kobayashi
Sadao Kurohashi
181
0
0
27 Mar 2024
Formulation Comparison for Timeline Construction using LLMs
Kimihiro Hasegawa
Nikhil Kandukuri
Susan Holm
Yukari Yamakawa
Teruko Mitamura
341
2
0
01 Mar 2024
When LLMs Meet Cunning Texts: A Fallacy Understanding Benchmark for Large Language Models
Hai-Tao Zheng
Qingyu Zhou
Yuanzhen Luo
Shirong Ma
Yangning Li
Hai-Tao Zheng
Xuming Hu
Philip S. Yu
LRM
379
28
0
16 Feb 2024
Large Language Models Can Learn Temporal Reasoning
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Siheng Xiong
Ali Payani
Ramana Rao Kompella
Faramarz Fekri
LRM
636
170
0
12 Jan 2024
Temporal Validity Change Prediction
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Georg Wenzel
Adam Jatowt
312
1
0
01 Jan 2024
CORECODE: A Common Sense Annotated Dialogue Dataset with Benchmark Tasks for Chinese Large Language Models
Dan Shi
Chaobin You
Jian-Tao Huang
Taihao Li
Deyi Xiong
LRM
251
2
0
20 Dec 2023
Catwalk: A Unified Language Model Evaluation Framework for Many Datasets
Dirk Groeneveld
Anas Awadalla
Iz Beltagy
Akshita Bhagia
Ian H. Magnusson
Hao Peng
Oyvind Tafjord
Pete Walsh
Kyle Richardson
Jesse Dodge
291
2
0
15 Dec 2023
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
International Conference on Machine Learning (ICML), 2023
Collin Burns
Pavel Izmailov
Jan Hendrik Kirchner
Bowen Baker
Leo Gao
...
Adrien Ecoffet
Manas Joglekar
Jan Leike
Ilya Sutskever
Jeff Wu
ELM
524
434
0
14 Dec 2023
TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Zheng Chu
Jingchang Chen
Qianglong Chen
Weijiang Yu
Haotian Wang
Ming Liu
Bing Qin
LRM
ELM
495
34
0
29 Nov 2023
Towards Robust Temporal Reasoning of Large Language Models via a Multi-Hop QA Dataset and Pseudo-Instruction Tuning
Qingyu Tan
Hwee Tou Ng
Lidong Bing
267
20
0
16 Nov 2023
Are Large Language Models Temporally Grounded?
North American Chapter of the Association for Computational Linguistics (NAACL), 2023
Yifu Qiu
Zheng Zhao
Yftah Ziser
Anna Korhonen
Edoardo Ponti
Shay B. Cohen
LRM
394
25
0
14 Nov 2023
MTGER: Multi-view Temporal Graph Enhanced Temporal Reasoning over Time-Involved Document
Zheng Chu
Zekun Wang
Jiafeng Liang
Ming Liu
Bing Qin
277
2
0
08 Nov 2023
Mind the Gap Between Conversations for Improved Long-Term Dialogue Generation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Qiang Zhang
Jason Naradowsky
Yusuke Miyao
182
11
0
24 Oct 2023
CRoW: Benchmarking Commonsense Reasoning in Real-World Tasks
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Mete Ismayilzada
Debjit Paul
Syrielle Montariol
Mor Geva
Antoine Bosselut
LRM
343
8
0
23 Oct 2023
How Much Consistency Is Your Accuracy Worth?
Jacob K. Johnson
Ana Marasović
170
1
0
20 Oct 2023
Instructive Dialogue Summarization with Query Aggregations
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Bin Wang
Zhengyuan Liu
Nancy F. Chen
439
7
0
17 Oct 2023
TRAM: Benchmarking Temporal Reasoning for Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Yuqing Wang
Yun Zhao
LRM
346
28
0
02 Oct 2023
1
2
3
Next
Page 1 of 3