Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2404.08666
Cited By
Revealing Trends in Datasets from the 2022 ACL and EMNLP Conferences
31 March 2024
Jesse Atuhurra
Hidetaka Kamigaito
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Revealing Trends in Datasets from the 2022 ACL and EMNLP Conferences"
14 / 14 papers shown
Title
BotsTalk: Machine-sourced Framework for Automatic Curation of Large-scale Multi-skill Dialogue Datasets
Minju Kim
Chaehyeong Kim
Yongho Song
Seung-won Hwang
Jinyoung Yeo
31
13
0
23 Oct 2022
ECTSum: A New Benchmark Dataset For Bullet Point Summarization of Long Earnings Call Transcripts
Rajdeep Mukherjee
Abhinav Bohra
Akash Banerjee
Soumya Sharma
Manjunath Hegde
...
Shivani Shrivastava
Koustuv Dasgupta
Niloy Ganguly
Saptarshi Ghosh
Pawan Goyal
RALM
38
44
0
22 Oct 2022
PcMSP: A Dataset for Scientific Action Graphs Extraction from Polycrystalline Materials Synthesis Procedure Text
Xianjun Yang
Ya Zhuo
Julia Zuo
Xinlu Zhang
Stephen D. Wilson
Linda R. Petzold
21
11
0
22 Oct 2022
ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback
Jiacheng Ye
Jiahui Gao
Jiangtao Feng
Zhiyong Wu
Tao Yu
Lingpeng Kong
SyDa
VLM
71
69
0
22 Oct 2022
Augmenting Multi-Turn Text-to-SQL Datasets with Self-Play
Qi Liu
Zihuiwen Ye
Tao Yu
Phil Blunsom
Linfeng Song
31
10
0
21 Oct 2022
Crossmodal-3600: A Massively Multilingual Multimodal Evaluation Dataset
Ashish V. Thapliyal
Jordi Pont-Tuset
Xi Chen
Radu Soricut
VGen
67
71
0
25 May 2022
DivEMT: Neural Machine Translation Post-Editing Effort Across Typologically Diverse Languages
Gabriele Sarti
Arianna Bisazza
Ana Guerberof Arenas
Antonio Toral
36
7
0
24 May 2022
SQuALITY: Building a Long-Document Summarization Dataset the Hard Way
Alex Jinpeng Wang
Richard Yuanzhe Pang
Angelica Chen
Jason Phang
Samuel R. Bowman
72
44
0
23 May 2022
KOLD: Korean Offensive Language Dataset
Young-kuk Jeong
Juhyun Oh
Jaimeen Ahn
Jongwon Lee
Jihyung Mon
Sungjoon Park
Alice H. Oh
32
24
0
23 May 2022
Commonsense Knowledge Salience Evaluation with a Benchmark Dataset in E-commerce
Yincen Qu
Ningyu Zhang
Hui Chen
Zelin Dai
Zezhong Xu
Chengming Wang
Xiaoyu Wang
Qianglin Chen
Huajun Chen
29
6
0
22 May 2022
"I'm sorry to hear that": Finding New Biases in Language Models with a Holistic Descriptor Dataset
Eric Michael Smith
Melissa Hall
Melanie Kambadur
Eleonora Presani
Adina Williams
65
128
0
18 May 2022
MReD: A Meta-Review Dataset for Structure-Controllable Text Generation
Chenhui Shen
Liying Cheng
Ran Zhou
Lidong Bing
Yang You
Luo Si
39
33
0
14 Oct 2021
ConditionalQA: A Complex Reading Comprehension Dataset with Conditional Answers
Haitian Sun
William W. Cohen
Ruslan Salakhutdinov
59
33
0
13 Oct 2021
LexGLUE: A Benchmark Dataset for Legal Language Understanding in English
Ilias Chalkidis
Abhik Jana
D. Hartung
M. Bommarito
Ion Androutsopoulos
Daniel Martin Katz
Nikolaos Aletras
AILaw
ELM
123
244
0
03 Oct 2021
1