ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.03065
  4. Cited By
"Going on a vacation" takes longer than "Going for a walk": A Study of
  Temporal Commonsense Understanding

"Going on a vacation" takes longer than "Going for a walk": A Study of Temporal Commonsense Understanding

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
6 September 2019
Ben Zhou
Daniel Khashabi
Qiang Ning
Dan Roth
    AIMat
ArXiv (abs)PDFHTML

Papers citing ""Going on a vacation" takes longer than "Going for a walk": A Study of Temporal Commonsense Understanding"

50 / 134 papers shown
Structured yet Bounded Temporal Understanding in Large Language Models
Structured yet Bounded Temporal Understanding in Large Language Models
Damin Zhang
Julia Taylor Rayz
217
0
0
19 Oct 2025
Hypothesis-Driven Feature Manifold Analysis in LLMs via Supervised Multi-Dimensional Scaling
Hypothesis-Driven Feature Manifold Analysis in LLMs via Supervised Multi-Dimensional Scaling
Federico Tiblias
Irina Bigoulaeva
Jingcheng Niu
Simone Balloccu
Iryna Gurevych
LRM
211
1
0
01 Oct 2025
AgentCoMa: A Compositional Benchmark Mixing Commonsense and Mathematical Reasoning in Real-World Scenarios
AgentCoMa: A Compositional Benchmark Mixing Commonsense and Mathematical Reasoning in Real-World Scenarios
Lisa Alazraki
Lihu Chen
Ana Brassard
Joe Stacey
Hossein A. Rahmani
Marek Rei
CoGeLRM
204
1
0
27 Aug 2025
TComQA: Extracting Temporal Commonsense from Text
TComQA: Extracting Temporal Commonsense from Text
Lekshmi R Nair
Arun Sankar
Koninika Pal
RALM
135
0
0
21 Aug 2025
MobQA: A Benchmark Dataset for Semantic Understanding of Human Mobility Data through Question Answering
MobQA: A Benchmark Dataset for Semantic Understanding of Human Mobility Data through Question Answering
Hikaru Asano
Hiroki Ouchi
Akira Kasuga
Ryo Yonetani
213
2
0
15 Aug 2025
Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning
Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning
Maggie Huan
Yuetai Li
Tuney Zheng
Xiaoyu Xu
Seungone Kim
Minxin Du
Radha Poovendran
Graham Neubig
Xiang Yue
LRMELM
254
74
0
01 Jul 2025
Chaining Event Spans for Temporal Relation Grounding
Chaining Event Spans for Temporal Relation GroundingConference of the European Chapter of the Association for Computational Linguistics (EACL), 2025
Jongho Kim
Dohyeon Lee
Minsoo Kim
Seung-won Hwang
212
0
0
17 Jun 2025
From What to Respond to When to Respond: Timely Response Generation for Open-domain Dialogue Agents
From What to Respond to When to Respond: Timely Response Generation for Open-domain Dialogue Agents
Seongbo Jang
Minjin Jeon
Jaehoon Lee
Seonghyeon Lee
Dongha Lee
Hwanjo Yu
409
0
0
17 Jun 2025
LexTime: A Benchmark for Temporal Ordering of Legal Events
LexTime: A Benchmark for Temporal Ordering of Legal EventsConference on Empirical Methods in Natural Language Processing (EMNLP), 2025
Claire Barale
Leslie Barrett
Vikram Sunil Bajaj
Michael Rovatsos
AILaw
392
4
0
04 Jun 2025
Around the World in 24 Hours: Probing LLM Knowledge of Time and Place
Around the World in 24 Hours: Probing LLM Knowledge of Time and PlaceAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Carolin Holtermann
Paul Röttger
Anne Lauscher
LRM
341
6
0
04 Jun 2025
CrossICL: Cross-Task In-Context Learning via Unsupervised Demonstration Transfer
CrossICL: Cross-Task In-Context Learning via Unsupervised Demonstration Transfer
Jinglong Gao
Xiao Ding
Lingxiao Zou
Bing Qin
Ting Liu
272
0
0
30 May 2025
Improving LLM First-Token Predictions in Multiple-Choice Question Answering via Output Prefilling
Improving LLM First-Token Predictions in Multiple-Choice Question Answering via Output Prefilling
Silvia Cappelletti
Tobia Poppi
Samuele Poppi
Zheng-Xin Yong
Diego Garcia-Olano
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
KELMLRM
246
1
0
21 May 2025
Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks
Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks
Yixin Cao
Shibo Hong
Xuzhao Li
Jiahao Ying
Yubo Ma
...
Juanzi Li
Aixin Sun
Qi Zhang
Tat-Seng Chua
Tianwei Zhang
ALMELM
595
30
0
26 Apr 2025
Learning to Reason Over Time: Timeline Self-Reflection for Improved Temporal Reasoning in Language Models
Learning to Reason Over Time: Timeline Self-Reflection for Improved Temporal Reasoning in Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Adrián Bazaga
Rexhina Blloshmi
Bill Byrne
Adria de Gispert
ReLMLRM
506
8
0
07 Apr 2025
WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization
WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization
I. Gevers
Victor De Marez
Luna De Bruyne
Walter Daelemans
393
2
0
31 Mar 2025
Improving Preference Extraction In LLMs By Identifying Latent Knowledge Through Classifying Probes
Improving Preference Extraction In LLMs By Identifying Latent Knowledge Through Classifying ProbesAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Sharan Maiya
Yinhong Liu
Ramit Debnath
Anna Korhonen
423
4
0
22 Mar 2025
A Study into Investigating Temporal Robustness of LLMs
A Study into Investigating Temporal Robustness of LLMsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Jonas Wallat
Abdelrahman Abdallah
Adam Jatowt
Avishek Anand
312
12
0
21 Mar 2025
Are Sparse Autoencoders Useful? A Case Study in Sparse Probing
Are Sparse Autoencoders Useful? A Case Study in Sparse Probing
Subhash Kantamneni
Joshua Engels
Senthooran Rajamanoharan
Max Tegmark
Neel Nanda
472
74
0
23 Feb 2025
Counterfactual-Consistency Prompting for Relative Temporal Understanding in Large Language Models
Counterfactual-Consistency Prompting for Relative Temporal Understanding in Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Jongho Kim
Seung-won Hwang
LRMAI4CE
565
3
0
17 Feb 2025
TReMu: Towards Neuro-Symbolic Temporal Reasoning for LLM-Agents with Memory in Multi-Session Dialogues
TReMu: Towards Neuro-Symbolic Temporal Reasoning for LLM-Agents with Memory in Multi-Session DialoguesAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Yubin Ge
Salvatore Romeo
Jason (Jinglun) Cai
Raphael Shu
Monica Sunkara
Yassine Benajiba
Yi Zhang
LLMAG
819
18
0
03 Feb 2025
Weak-to-Strong Generalization Through the Data-Centric Lens
Weak-to-Strong Generalization Through the Data-Centric LensInternational Conference on Learning Representations (ICLR), 2024
Changho Shin
John Cooper
Frederic Sala
581
14
0
05 Dec 2024
ActionCOMET: A Zero-shot Approach to Learn Image-specific Commonsense
  Concepts about Actions
ActionCOMET: A Zero-shot Approach to Learn Image-specific Commonsense Concepts about Actions
Shailaja Keyur Sampat
Yezhou Yang
Chitta Baral
LM&Ro
267
2
0
17 Oct 2024
MM-Forecast: A Multimodal Approach to Temporal Event Forecasting with
  Large Language Models
MM-Forecast: A Multimodal Approach to Temporal Event Forecasting with Large Language ModelsACM Multimedia (MM), 2024
Haoxuan Li
Zhengmao Yang
Yunshan Ma
Yi Bin
Yang Yang
Tat-Seng Chua
273
8
0
08 Aug 2024
A Comprehensive Evaluation of Large Language Models on Temporal Event Forecasting
A Comprehensive Evaluation of Large Language Models on Temporal Event Forecasting
He Chang
Chenchen Ye
Zhulin Tao
Jie Wu
Zhengmao Yang
Yunshan Ma
Xianglin Huang
Tat-Seng Chua
AI4TS
372
12
0
16 Jul 2024
UnSeenTimeQA: Time-Sensitive Question-Answering Beyond LLMs' Memorization
UnSeenTimeQA: Time-Sensitive Question-Answering Beyond LLMs' Memorization
Md Nayem Uddin
Amir Saeidi
Divij Handa
Agastya Seth
Tran Cao Son
Eduardo Blanco
Steven Corman
Chitta Baral
574
17
0
03 Jul 2024
Timo: Towards Better Temporal Reasoning for Language Models
Timo: Towards Better Temporal Reasoning for Language Models
Zhaochen Su
Jun Zhang
Tong Zhu
Xiaoye Qu
Juntao Li
Min Zhang
Yu Cheng
LRM
333
30
0
20 Jun 2024
Evaluating the Generalization Ability of Quantized LLMs: Benchmark,
  Analysis, and Toolbox
Evaluating the Generalization Ability of Quantized LLMs: Benchmark, Analysis, and Toolbox
Yijun Liu
Yuan Meng
Fang Wu
Shenhao Peng
Hang Yao
Chaoyu Guan
Chen Tang
Cheng Wang
Zhi Wang
Wenwu Zhu
MQ
392
9
0
15 Jun 2024
Living in the Moment: Can Large Language Models Grasp Co-Temporal
  Reasoning?
Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?
Zhaochen Su
Juntao Li
Jun Zhang
Tong Zhu
Xiaoye Qu
Pan Zhou
Yan Bowen
Yu Cheng
Min zhang
LRM
303
32
0
13 Jun 2024
Scaling and evaluating sparse autoencoders
Scaling and evaluating sparse autoencoders
Leo Gao
Tom Dupré la Tour
Henk Tillman
Gabriel Goh
Rajan Troll
Alec Radford
Ilya Sutskever
Jan Leike
Jeffrey Wu
326
387
0
06 Jun 2024
A Comprehensive Evaluation on Event Reasoning of Large Language Models
A Comprehensive Evaluation on Event Reasoning of Large Language Models
Zhengwei Tao
Zhi Jin
Yifan Zhang
Xiancai Chen
Xiaoying Bai
Yue Fang
Haiyan Zhao
Jia Li
Chongyang Tao
LRM
279
8
0
26 Apr 2024
Continual Learning of Large Language Models: A Comprehensive Survey
Continual Learning of Large Language Models: A Comprehensive Survey
Haizhou Shi
Zihao Xu
Hengyi Wang
Weiyi Qin
Wenyuan Wang
Yibin Wang
Zifeng Wang
Sayna Ebrahimi
Hao Wang
CLLKELMLRM
475
230
0
25 Apr 2024
LogicBench: Towards Systematic Evaluation of Logical Reasoning Ability
  of Large Language Models
LogicBench: Towards Systematic Evaluation of Logical Reasoning Ability of Large Language Models
Mihir Parmar
Nisarg Patel
Neeraj Varshney
Mutsumi Nakamura
Man Luo
Santosh Mashetty
Arindam Mitra
Chitta Baral
LRMReLMELM
628
75
0
23 Apr 2024
EVIT: Event-Oriented Instruction Tuning for Event Reasoning
EVIT: Event-Oriented Instruction Tuning for Event Reasoning
Zhengwei Tao
Xiancai Chen
Zhi Jin
Xiaoying Bai
Haiyan Zhao
Yiwei Lou
310
7
0
18 Apr 2024
AcTED: Automatic Acquisition of Typical Event Duration for
  Semi-supervised Temporal Commonsense QA
AcTED: Automatic Acquisition of Typical Event Duration for Semi-supervised Temporal Commonsense QA
Felix Giovanni Virgo
Fei Cheng
L. Pereira
Masayuki Asahara
Ichiro Kobayashi
Sadao Kurohashi
181
0
0
27 Mar 2024
Formulation Comparison for Timeline Construction using LLMs
Formulation Comparison for Timeline Construction using LLMs
Kimihiro Hasegawa
Nikhil Kandukuri
Susan Holm
Yukari Yamakawa
Teruko Mitamura
341
2
0
01 Mar 2024
When LLMs Meet Cunning Texts: A Fallacy Understanding Benchmark for
  Large Language Models
When LLMs Meet Cunning Texts: A Fallacy Understanding Benchmark for Large Language Models
Hai-Tao Zheng
Qingyu Zhou
Yuanzhen Luo
Shirong Ma
Yangning Li
Hai-Tao Zheng
Xuming Hu
Philip S. Yu
LRM
379
28
0
16 Feb 2024
Large Language Models Can Learn Temporal Reasoning
Large Language Models Can Learn Temporal ReasoningAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Siheng Xiong
Ali Payani
Ramana Rao Kompella
Faramarz Fekri
LRM
636
170
0
12 Jan 2024
Temporal Validity Change Prediction
Temporal Validity Change PredictionAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Georg Wenzel
Adam Jatowt
312
1
0
01 Jan 2024
CORECODE: A Common Sense Annotated Dialogue Dataset with Benchmark Tasks
  for Chinese Large Language Models
CORECODE: A Common Sense Annotated Dialogue Dataset with Benchmark Tasks for Chinese Large Language Models
Dan Shi
Chaobin You
Jian-Tao Huang
Taihao Li
Deyi Xiong
LRM
251
2
0
20 Dec 2023
Catwalk: A Unified Language Model Evaluation Framework for Many Datasets
Catwalk: A Unified Language Model Evaluation Framework for Many Datasets
Dirk Groeneveld
Anas Awadalla
Iz Beltagy
Akshita Bhagia
Ian H. Magnusson
Hao Peng
Oyvind Tafjord
Pete Walsh
Kyle Richardson
Jesse Dodge
291
2
0
15 Dec 2023
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak
  Supervision
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak SupervisionInternational Conference on Machine Learning (ICML), 2023
Collin Burns
Pavel Izmailov
Jan Hendrik Kirchner
Bowen Baker
Leo Gao
...
Adrien Ecoffet
Manas Joglekar
Jan Leike
Ilya Sutskever
Jeff Wu
ELM
524
434
0
14 Dec 2023
TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in
  Large Language Models
TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Zheng Chu
Jingchang Chen
Qianglong Chen
Weijiang Yu
Haotian Wang
Ming Liu
Bing Qin
LRMELM
495
34
0
29 Nov 2023
Towards Robust Temporal Reasoning of Large Language Models via a
  Multi-Hop QA Dataset and Pseudo-Instruction Tuning
Towards Robust Temporal Reasoning of Large Language Models via a Multi-Hop QA Dataset and Pseudo-Instruction Tuning
Qingyu Tan
Hwee Tou Ng
Lidong Bing
267
20
0
16 Nov 2023
Are Large Language Models Temporally Grounded?
Are Large Language Models Temporally Grounded?North American Chapter of the Association for Computational Linguistics (NAACL), 2023
Yifu Qiu
Zheng Zhao
Yftah Ziser
Anna Korhonen
Edoardo Ponti
Shay B. Cohen
LRM
394
25
0
14 Nov 2023
MTGER: Multi-view Temporal Graph Enhanced Temporal Reasoning over
  Time-Involved Document
MTGER: Multi-view Temporal Graph Enhanced Temporal Reasoning over Time-Involved Document
Zheng Chu
Zekun Wang
Jiafeng Liang
Ming Liu
Bing Qin
277
2
0
08 Nov 2023
Mind the Gap Between Conversations for Improved Long-Term Dialogue
  Generation
Mind the Gap Between Conversations for Improved Long-Term Dialogue GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Qiang Zhang
Jason Naradowsky
Yusuke Miyao
182
11
0
24 Oct 2023
CRoW: Benchmarking Commonsense Reasoning in Real-World Tasks
CRoW: Benchmarking Commonsense Reasoning in Real-World TasksConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Mete Ismayilzada
Debjit Paul
Syrielle Montariol
Mor Geva
Antoine Bosselut
LRM
343
8
0
23 Oct 2023
How Much Consistency Is Your Accuracy Worth?
How Much Consistency Is Your Accuracy Worth?
Jacob K. Johnson
Ana Marasović
170
1
0
20 Oct 2023
Instructive Dialogue Summarization with Query Aggregations
Instructive Dialogue Summarization with Query AggregationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Bin Wang
Zhengyuan Liu
Nancy F. Chen
439
7
0
17 Oct 2023
TRAM: Benchmarking Temporal Reasoning for Large Language Models
TRAM: Benchmarking Temporal Reasoning for Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Yuqing Wang
Yun Zhao
LRM
346
28
0
02 Oct 2023
123
Next
Page 1 of 3