ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2007.14435
  4. Cited By
Towards Ecologically Valid Research on Language User Interfaces

Towards Ecologically Valid Research on Language User Interfaces

28 July 2020
H. D. Vries
Dzmitry Bahdanau
Christopher D. Manning
ArXiv (abs)PDFHTML

Papers citing "Towards Ecologically Valid Research on Language User Interfaces"

39 / 39 papers shown
The Collaboration Gap
The Collaboration Gap
Tim R. Davidson
Adam Fourney
Saleema Amershi
Robert West
Eric Horvitz
Ece Kamar
155
3
0
04 Nov 2025
Towards Understanding Visual Grounding in Visual Language Models
Towards Understanding Visual Grounding in Visual Language Models
Georgios Pantazopoulos
Eda B. Özyiğit
ObjD
501
4
0
12 Sep 2025
MoNaCo: More Natural and Complex Questions for Reasoning Across Dozens of Documents
MoNaCo: More Natural and Complex Questions for Reasoning Across Dozens of Documents
Tomer Wolfson
H. Trivedi
Mor Geva
Yoav Goldberg
Dan Roth
Tushar Khot
Ashish Sabharwal
Reut Tsarfaty
RALMLRM
348
16
0
15 Aug 2025
From Calibration to Collaboration: LLM Uncertainty Quantification Should Be More Human-Centered
From Calibration to Collaboration: LLM Uncertainty Quantification Should Be More Human-Centered
Siddartha Devic
Tejas Srinivasan
Jesse Thomason
Willie Neiswanger
Willie Neiswanger
237
13
0
09 Jun 2025
Societal Impacts Research Requires Benchmarks for Creative Composition Tasks
Societal Impacts Research Requires Benchmarks for Creative Composition Tasks
Judy Hanwen Shen
Carlos Guestrin
729
3
0
09 Apr 2025
Browsing Lost Unformed Recollections: A Benchmark for Tip-of-the-Tongue Search and Reasoning
Browsing Lost Unformed Recollections: A Benchmark for Tip-of-the-Tongue Search and ReasoningAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Sky CH-Wang
Darshan Deshpande
Smaranda Muresan
Anand Kannappan
Rebecca Qian
396
7
0
24 Mar 2025
Toward an Evaluation Science for Generative AI Systems
Toward an Evaluation Science for Generative AI Systems
Laura Weidinger
Deb Raji
Hanna M. Wallach
Margaret Mitchell
Angelina Wang
Olawale Salaudeen
Rishi Bommasani
Sayash Kapoor
Deep Ganguli
Sanmi Koyejo
EGVMELM
453
37
0
07 Mar 2025
Do Text-to-Vis Benchmarks Test Real Use of Visualisations?
Do Text-to-Vis Benchmarks Test Real Use of Visualisations?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Hy Nguyen
Xuefei He
Andrew Reeson
Cecile Paris
Josiah Poon
Jonathan K. Kummerfeld
263
0
0
29 Jul 2024
Benchmarks as Microscopes: A Call for Model Metrology
Benchmarks as Microscopes: A Call for Model Metrology
Michael Stephen Saxon
Ari Holtzman
Peter West
William Y. Wang
Naomi Saphra
362
32
0
22 Jul 2024
Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive
  Data Analysis Agents
Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive Data Analysis Agents
Jinyang Li
Nan Huo
Yan Gao
Jiayi Shi
Yingxiu Zhao
Ge Qu
Yurong Wu
Chenhao Ma
Jian-Guang Lou
Reynold Cheng
LLMAG
241
13
0
08 Mar 2024
Selective "Selective Prediction": Reducing Unnecessary Abstention in
  Vision-Language Reasoning
Selective "Selective Prediction": Reducing Unnecessary Abstention in Vision-Language Reasoning
Tejas Srinivasan
Jack Hessel
Tanmay Gupta
Bill Yuchen Lin
Yejin Choi
Jesse Thomason
Khyathi Chandu
452
20
0
23 Feb 2024
KTO: Model Alignment as Prospect Theoretic Optimization
KTO: Model Alignment as Prospect Theoretic Optimization
Kawin Ethayarajh
Winnie Xu
Niklas Muennighoff
Dan Jurafsky
Douwe Kiela
1.2K
917
0
02 Feb 2024
Do Androids Know They're Only Dreaming of Electric Sheep?
Do Androids Know They're Only Dreaming of Electric Sheep?Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Sky CH-Wang
Benjamin Van Durme
Jason Eisner
Chris Kedzie
HILM
314
67
0
28 Dec 2023
FinanceBench: A New Benchmark for Financial Question Answering
FinanceBench: A New Benchmark for Financial Question Answering
Pranab Islam
Anand Kannappan
Douwe Kiela
Rebecca Qian
Nino Scherrer
Bertie Vidgen
RALM
396
181
0
20 Nov 2023
Multitask Multimodal Prompted Training for Interactive Embodied Task
  Completion
Multitask Multimodal Prompted Training for Interactive Embodied Task CompletionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Georgios Pantazopoulos
Malvina Nikandrou
Amit Parekh
Bhathiya Hemanthage
Arash Eshghi
Ioannis Konstas
Verena Rieser
Oliver Lemon
Alessandro Suglia
LM&Ro
259
10
0
07 Nov 2023
On Degrees of Freedom in Defining and Testing Natural Language
  Understanding
On Degrees of Freedom in Defining and Testing Natural Language UnderstandingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Saku Sugawara
S. Tsugita
ELM
379
2
0
24 May 2023
Learning to Simulate Natural Language Feedback for Interactive Semantic
  Parsing
Learning to Simulate Natural Language Feedback for Interactive Semantic ParsingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Hao Yan
Saurabh Srivastava
Yintao Tai
Sida I. Wang
Anuj Kumar
Ziyu Yao
346
23
0
14 May 2023
The StatCan Dialogue Dataset: Retrieving Data Tables through
  Conversations with Genuine Intents
The StatCan Dialogue Dataset: Retrieving Data Tables through Conversations with Genuine IntentsConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Xing Han Lù
Siva Reddy
H. D. Vries
LMTD
284
7
0
03 Apr 2023
JamPatoisNLI: A Jamaican Patois Natural Language Inference Dataset
JamPatoisNLI: A Jamaican Patois Natural Language Inference DatasetConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Ruth-Ann Armstrong
John Hewitt
Christopher D. Manning
334
17
0
07 Dec 2022
Can In-context Learners Learn a Reasoning Concept from Demonstrations?
Can In-context Learners Learn a Reasoning Concept from Demonstrations?
Michal Tefnik
Marek Kadlcík
LRM
403
7
0
03 Dec 2022
Pragmatics in Language Grounding: Phenomena, Tasks, and Modeling
  Approaches
Pragmatics in Language Grounding: Phenomena, Tasks, and Modeling ApproachesConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Daniel Fried
Nicholas Tomlin
Jennifer Hu
Roma Patel
Aida Nematzadeh
290
11
0
15 Nov 2022
Going for GOAL: A Resource for Grounded Football Commentaries
Going for GOAL: A Resource for Grounded Football Commentaries
Alessandro Suglia
José Lopes
E. Bastianelli
Andrea Vanzo
Shubham Agarwal
Malvina Nikandrou
Lu Yu
Ioannis Konstas
Verena Rieser
179
9
0
08 Nov 2022
Fighting FIRe with FIRE: Assessing the Validity of Text-to-Video
  Retrieval Benchmarks
Fighting FIRe with FIRE: Assessing the Validity of Text-to-Video Retrieval BenchmarksFindings (Findings), 2022
Pedro Rodriguez
Mahmoud Azab
Becka Silvert
Renato Sanchez
Linzy Labson
Hardik Shah
Seungwhan Moon
273
2
0
10 Oct 2022
Don't Copy the Teacher: Data and Model Challenges in Embodied Dialogue
Don't Copy the Teacher: Data and Model Challenges in Embodied DialogueConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
So Yeon Min
Hao Zhu
Ruslan Salakhutdinov
Yonatan Bisk
LM&Ro
419
14
0
10 Oct 2022
Evaluation Gaps in Machine Learning Practice
Evaluation Gaps in Machine Learning PracticeConference on Fairness, Accountability and Transparency (FAccT), 2022
Ben Hutchinson
Negar Rostamzadeh
Christina Greer
Katherine A. Heller
Vinodkumar Prabhakaran
ELM
409
82
0
11 May 2022
UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding
  with Text-to-Text Language Models
UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Tianbao Xie
Chen Henry Wu
Peng Shi
Ruiqi Zhong
Torsten Scholak
...
Lingpeng Kong
Rui Zhang
Noah A. Smith
Luke Zettlemoyer
Tao Yu
LMTD
429
351
0
16 Jan 2022
Deep Transfer Learning & Beyond: Transformer Language Models in
  Information Systems Research
Deep Transfer Learning & Beyond: Transformer Language Models in Information Systems Research
Ross Gruetzemacher
D. Paradice
354
49
0
18 Oct 2021
KaggleDBQA: Realistic Evaluation of Text-to-SQL Parsers
KaggleDBQA: Realistic Evaluation of Text-to-SQL ParsersAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Chia-Hsuan Lee
Oleksandr Polozov
Matthew Richardson
LMTDRALM
329
141
0
22 Jun 2021
Targeted Data Acquisition for Evolving Negotiation Agents
Targeted Data Acquisition for Evolving Negotiation AgentsInternational Conference on Machine Learning (ICML), 2021
Minae Kwon
Siddharth Karamcheti
Mariano-Florentino Cuéllar
Dorsa Sadigh
365
7
0
14 Jun 2021
Maintaining Common Ground in Dynamic Environments
Maintaining Common Ground in Dynamic EnvironmentsTransactions of the Association for Computational Linguistics (TACL), 2021
Takuma Udagawa
Akiko Aizawa
191
16
0
29 May 2021
Conversational AI Systems for Social Good: Opportunities and Challenges
Conversational AI Systems for Social Good: Opportunities and Challenges
Peng Qi
Jing Huang
Youzheng Wu
Xiaodong He
Bowen Zhou
288
5
0
13 May 2021
Dynabench: Rethinking Benchmarking in NLP
Dynabench: Rethinking Benchmarking in NLPNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021
Douwe Kiela
Max Bartolo
Yixin Nie
Divyansh Kaushik
Atticus Geiger
...
Pontus Stenetorp
Robin Jia
Joey Tianyi Zhou
Christopher Potts
Adina Williams
444
501
0
07 Apr 2021
DynaSent: A Dynamic Benchmark for Sentiment Analysis
DynaSent: A Dynamic Benchmark for Sentiment AnalysisAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Christopher Potts
Zhengxuan Wu
Atticus Geiger
Douwe Kiela
573
86
0
30 Dec 2020
Did You Ask a Good Question? A Cross-Domain Question Intention
  Classification Benchmark for Text-to-SQL
Did You Ask a Good Question? A Cross-Domain Question Intention Classification Benchmark for Text-to-SQL
Yusen Zhang
Xiangyu Dong
Shuaichen Chang
Tao Yu
Peng Shi
Rui Zhang
OOD
229
23
0
23 Oct 2020
STAR: A Schema-Guided Dialog Dataset for Transfer Learning
STAR: A Schema-Guided Dialog Dataset for Transfer Learning
Johannes E. M. Mosig
Shikib Mehri
Thomas Kober
331
49
0
22 Oct 2020
Formalizing Trust in Artificial Intelligence: Prerequisites, Causes and
  Goals of Human Trust in AI
Formalizing Trust in Artificial Intelligence: Prerequisites, Causes and Goals of Human Trust in AI
Alon Jacovi
Ana Marasović
Tim Miller
Yoav Goldberg
767
591
0
15 Oct 2020
Deploying Lifelong Open-Domain Dialogue Learning
Deploying Lifelong Open-Domain Dialogue Learning
Kurt Shuster
Jack Urbanek
Emily Dinan
Arthur Szlam
Jason Weston
263
24
0
18 Aug 2020
Experience Grounds Language
Experience Grounds Language
Yonatan Bisk
Ari Holtzman
Jesse Thomason
Jacob Andreas
Yoshua Bengio
...
Angeliki Lazaridou
Jonathan May
Aleksandr Nisnevich
Nicolas Pinto
Joseph P. Turian
662
422
0
21 Apr 2020
The Transformative Potential of Artificial Intelligence
The Transformative Potential of Artificial Intelligence
Ross Gruetzemacher
Jess Whittlestone
309
171
0
27 Nov 2019
1
Page 1 of 1