WildChat: 1M ChatGPT Interaction Logs in the Wild

International Conference on Learning Representations (ICLR), 2024

2 May 2024

Wenting Zhao

Xiang Ren

Jack Hessel

Claire Cardie

Yejin Choi

Yuntian Deng

ArXiv (abs)PDF HTML HuggingFace (63 upvotes)

Papers citing "WildChat: 1M ChatGPT Interaction Logs in the Wild"

50 / 235 papers shown

HealthBench: Evaluating Large Language Models Towards Improved Human Health

Joaquin Quiñonero Candela

...

296

127

13 May 2025

Defending against Indirect Prompt Injection by Instruction Detection

325

08 May 2025

SIMPLEMIX: Frustratingly Simple Mixing of Off- and On-policy Data in Language Model Preference Learning

Tianjian Li

Daniel Khashabi

333

05 May 2025

Colombian Waitresses y Jueces canadienses: Gender and Country Biases in Occupation Recommendations from LLMs

Elisa Forcada Rodríguez

Olatz Perez-de-Viñaspre

Jon Ander Campos

Dietrich Klakow

Vagrant Gautam

413

05 May 2025

Real-World Gaps in AI Governance Research

609

30 Apr 2025

CachePrune: Neural-Based Attribution Defense Against Indirect Prompt Injection Attacks

365

29 Apr 2025

JailbreaksOverTime: Detecting Jailbreak Attacks Under Distribution Shift

318

28 Apr 2025

A False Sense of Privacy: Evaluating Textual Data Sanitization Beyond Surface-level Privacy Leakage

Rui Xin

Niloofar Mireshghallah

437

28 Apr 2025

Anyprefer: An Agentic Framework for Preference Data SynthesisInternational Conference on Learning Representations (ICLR), 2025

...

445

27 Apr 2025

Instruction-Tuning Data Synthesis from Scratch via Web ReconstructionAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

...

531

22 Apr 2025

What's the Difference? Supporting Users in Identifying the Effects of Prompt and Model Changes Through Token PatternsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

337

22 Apr 2025

The Bitter Lesson Learned from 2,000+ Multilingual Benchmarks

315

22 Apr 2025

Values in the Wild: Discovering and Analyzing Values in Real-World Language Model Interactions

289

21 Apr 2025

RealSafe-R1: Safety-Aligned DeepSeek-R1 without Compromising Reasoning Capability

276

14 Apr 2025

DICE: A Framework for Dimensional and Contextual Evaluation of Language Models

Aryan Shrivastava

Paula Akemi Aoyagui

303

14 Apr 2025

Societal Impacts Research Requires Benchmarks for Creative Composition Tasks

Judy Hanwen Shen

Carlos Guestrin

614

09 Apr 2025

NoveltyBench: Evaluating Language Models for Humanlike Diversity

560

07 Apr 2025

PolyGuard: A Multilingual Safety Moderation Tool for 17 Languages

358

06 Apr 2025

AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset

...

320

04 Apr 2025

Building Instruction-Tuning Datasets from Human-Written Instructions with Open-Weight Large Language Models

...

441

31 Mar 2025

Learning to Reason for Long-Form Story Generation

Alexander Gurung

Mirella Lapata

ReLM OffRL LRM

360

28 Mar 2025

REALM: A Dataset of Real-World LLM Use CasesAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

180

24 Mar 2025

ChatBench: From Static Benchmarks to Human-AI EvaluationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

369

22 Mar 2025

Sparse Logit Sampling: Accelerating Knowledge Distillation in LLMsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

824

21 Mar 2025

Conversational User-AI Intervention: A Study on Prompt Rewriting for Improved LLM Response Generation

346

21 Mar 2025

How Robust Are Router-LLMs? Analysis of the Fragility of LLM Routing Capabilities

Aly M. Kassem

Bernhard Schölkopf

Zhijing Jin

165

20 Mar 2025

Navigating Rifts in Human-LLM Grounding: Study and BenchmarkAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

340

18 Mar 2025

MetaScale: Test-Time Scaling with Evolving Meta-Thoughts

350

17 Mar 2025

The Lucie-7B LLM and the Lucie Training Dataset: Open resources for multilingual language generation

OpenLLM-France community

928

15 Mar 2025

How to Protect Yourself from 5G Radiation? Investigating LLM Responses to Implicit Misinformation

Ruohao Guo

Wei Xu

Alan Ritter

374

12 Mar 2025

Group Preference Alignment: Customized LLM Response Generation from In-Situ Conversations

234

11 Mar 2025

EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees

408

11 Mar 2025

LLMs syntactically adapt their language use to their conversational partnerAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Florian Kandra

Vera Demberg

Alexander Koller

264

10 Mar 2025

Shifting Long-Context LLMs Research from Input to Output

337

06 Mar 2025

LLM-Safety Evaluations Lack Robustness

993

04 Mar 2025

Large-Scale Data Selection for Instruction Tuning

370

03 Mar 2025

CrowdSelect: Synthetic Instruction Data Selection with Multi-LLM Wisdom

268

03 Mar 2025

Rethinking LLM Bias Probing Using Lessons from the Social Sciences

Kirsten N. Morehouse

S. Swaroop

Weiwei Pan

380

28 Feb 2025

Dataset Featurization: Uncovering Natural Language Features through Unsupervised Data Reconstruction

381

24 Feb 2025

WildLong: Synthesizing Realistic Long-Context Instruction Data at Scale

990

23 Feb 2025

Synthesizing Post-Training Data for LLMs through Multi-Agent SimulationAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

519

21 Feb 2025

Retrieval-augmented systems can be dangerous medical communicators

324

18 Feb 2025

Equilibrate RLHF: Towards Balancing Helpfulness-Safety Trade-off in Large Language Models

415

17 Feb 2025

Idiosyncrasies in Large Language Models

357

17 Feb 2025

Presumed Cultural Identity: How Names Shape LLM Responses

430

17 Feb 2025

SeaExam and SeaBench: Benchmarking LLMs with Local Multilingual Questions in Southeast AsiaNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

553

10 Feb 2025

DeepThink: Aligning Language Models with Domain-Specific User Intents

313

08 Feb 2025

The Best Instruction-Tuning Data are Those That Fit

572

06 Feb 2025

STAIR: Improving Safety Alignment with Introspective Reasoning

405

04 Feb 2025

Diverse Preference Optimization

736

30 Jan 2025