ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2001.06684
  4. Cited By
How do Data Science Workers Collaborate? Roles, Workflows, and Tools
v1v2v3 (latest)

How do Data Science Workers Collaborate? Roles, Workflows, and Tools

18 January 2020
Amy X. Zhang
Michael J. Muller
Dakuo Wang
    FedMLAI4CE
ArXiv (abs)PDFHTML

Papers citing "How do Data Science Workers Collaborate? Roles, Workflows, and Tools"

50 / 97 papers shown
Who Leads? Comparing Human-Centric and Model-Centric Strategies for Defining ML Target Variables
Who Leads? Comparing Human-Centric and Model-Centric Strategies for Defining ML Target Variables
Mengtian Guo
David Gotz
Yue Wang
175
0
0
29 Oct 2025
Measurement as Bricolage: Examining How Data Scientists Construct Target Variables for Predictive Modeling Tasks
Measurement as Bricolage: Examining How Data Scientists Construct Target Variables for Predictive Modeling Tasks
Luke M. Guerdan
Devansh Saxena
Stevie Chancellor
Zhiwei Steven Wu
Kenneth Holstein
253
1
0
03 Jul 2025
AssistedDS: Benchmarking How External Domain Knowledge Assists LLMs in Automated Data Science
AssistedDS: Benchmarking How External Domain Knowledge Assists LLMs in Automated Data Science
An Luo
Xun Xian
Jin Du
Fangqiao Tian
G. Wang
...
Jayanth Srinivasa
Jayanth Srinivasa
Charles Fleming
Mingyi Hong
Jie Ding
264
5
0
25 May 2025
Systematic Failures in Collective Reasoning under Distributed Information in Multi-Agent LLMs
Systematic Failures in Collective Reasoning under Distributed Information in Multi-Agent LLMs
Yuxuan Li
Aoi Naito
Hirokazu Shirado
LLMAG
362
3
0
15 May 2025
AI LEGO: Scaffolding Cross-Functional Collaboration in Industrial Responsible AI Practices during Early Design Stages
AI LEGO: Scaffolding Cross-Functional Collaboration in Industrial Responsible AI Practices during Early Design Stages
Muzhe Wu
Yanzhi Zhao
Shuyi Han
Michael Xieyang Liu
Hong Shen
434
1
0
15 May 2025
Talking About the Assumption in the Room
Talking About the Assumption in the RoomInternational Conference on Human Factors in Computing Systems (CHI), 2025
Ramaravind Kommiya Mothilal
Faisal M. Lalani
Syed Ishtiaque Ahmed
Shion Guha
Sharifa Sultana
320
3
0
20 Feb 2025
The Evolution of LLM Adoption in Industry Data Curation Practices
The Evolution of LLM Adoption in Industry Data Curation Practices
Crystal Qian
Michael Xieyang Liu
Emily Reif
Grady Simon
Nada Hussein
Nathan Clement
James Wexler
Carrie J. Cai
Michael Terry
Minsuk Kahng
AILawELM
395
10
0
20 Dec 2024
Behavior Matters: An Alternative Perspective on Promoting Responsible
  Data Science
Behavior Matters: An Alternative Perspective on Promoting Responsible Data Science
Ziwei Dong
Ameya Patil
Yuichi Shoda
Leilani Battle
Emily Wall
AI4CE
191
2
0
07 Oct 2024
"It Might be Technically Impressive, But It's Practically Useless to us": Motivations, Practices, Challenges, and Opportunities for Cross-Functional Collaboration around AI within the News Industry
"It Might be Technically Impressive, But It's Practically Useless to us": Motivations, Practices, Challenges, and Opportunities for Cross-Functional Collaboration around AI within the News IndustryInternational Conference on Human Factors in Computing Systems (CHI), 2024
Qing Xiao
Xianzhe Fan
Felix M. Simon
Bingbing Zhang
Motahhare Eslami
218
0
0
18 Sep 2024
A Catalog of Fairness-Aware Practices in Machine Learning Engineering
A Catalog of Fairness-Aware Practices in Machine Learning Engineering
Gianmario Voria
Giulia Sellitto
Carmine Ferrara
Francesco Abate
A. Lucia
F. Ferrucci
Gemma Catolino
Fabio Palomba
FaML
404
5
0
29 Aug 2024
Relationships are Complicated! An Analysis of Relationships Between
  Datasets on the Web
Relationships are Complicated! An Analysis of Relationships Between Datasets on the WebInternational Workshop on the Semantic Web (SW), 2024
Kate Lin
Tarfah Alrashed
Natasha Noy
241
2
0
26 Aug 2024
The Implications of Open Generative Models in Human-Centered Data
  Science Work: A Case Study with Fact-Checking Organizations
The Implications of Open Generative Models in Human-Centered Data Science Work: A Case Study with Fact-Checking OrganizationsAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2024
Robert Wolfe
Tanushree Mitra
484
4
0
04 Aug 2024
Supporting Industry Computing Researchers in Assessing, Articulating,
  and Addressing the Potential Negative Societal Impact of Their Work
Supporting Industry Computing Researchers in Assessing, Articulating, and Addressing the Potential Negative Societal Impact of Their Work
Wesley Hanwen Deng
Solon Barocas
Jennifer Wortman Vaughan
377
15
0
02 Aug 2024
Bringing Data into the Conversation: Adapting Content from Business
  Intelligence Dashboards for Threaded Collaboration Platforms
Bringing Data into the Conversation: Adapting Content from Business Intelligence Dashboards for Threaded Collaboration PlatformsVisual .. (VISUAL), 2024
Tian Meng
Yang Tao
Wuliang Yin
288
4
0
01 Aug 2024
Improving Steering and Verification in AI-Assisted Data Analysis with
  Interactive Task Decomposition
Improving Steering and Verification in AI-Assisted Data Analysis with Interactive Task Decomposition
Majeed Kazemitabaar
Jack Williams
Ian Drosos
Tovi Grossman
Austin Z. Henley
Carina Negreanu
Advait Sarkar
340
67
0
02 Jul 2024
SLEGO: A Collaborative Data Analytics System with LLM Recommender for
  Diverse Users
SLEGO: A Collaborative Data Analytics System with LLM Recommender for Diverse Users
Siu Lung Ng
Hirad Rezaei
F. Rabhi
176
0
0
17 Jun 2024
Towards Feature Engineering with Human and AI's Knowledge: Understanding
  Data Science Practitioners' Perceptions in Human&AI-Assisted Feature
  Engineering Design
Towards Feature Engineering with Human and AI's Knowledge: Understanding Data Science Practitioners' Perceptions in Human&AI-Assisted Feature Engineering Design
Qian Zhu
Dakuo Wang
Shuai Ma
April Yi Wang
Zixin Chen
Udayan Khurana
Xiaojuan Ma
326
2
0
23 May 2024
"Don't Step on My Toes": Resolving Editing Conflicts in Real-Time
  Collaboration in Computational Notebooks
"Don't Step on My Toes": Resolving Editing Conflicts in Real-Time Collaboration in Computational Notebooks
A. Wang
Zihan Wu
Christopher Brooks
Steve Oney
169
7
0
06 Apr 2024
Talaria: Interactively Optimizing Machine Learning Models for Efficient
  Inference
Talaria: Interactively Optimizing Machine Learning Models for Efficient InferenceInternational Conference on Human Factors in Computing Systems (CHI), 2024
Fred Hohman
Chaoqun Wang
Jinmook Lee
Jochen Görtler
Dominik Moritz
Jeffrey P. Bigham
Zhile Ren
Cecile Foret
Qi Shan
Xiaoyi Zhang
324
10
0
03 Apr 2024
"We Have No Idea How Models will Behave in Production until Production":
  How Engineers Operationalize Machine Learning
"We Have No Idea How Models will Behave in Production until Production": How Engineers Operationalize Machine Learning
Shreya Shankar
Rolando Garcia
J. M. Hellerstein
Aditya G. Parameswaran
293
26
0
25 Mar 2024
OutlineSpark: Igniting AI-powered Presentation Slides Creation from
  Computational Notebooks through Outlines
OutlineSpark: Igniting AI-powered Presentation Slides Creation from Computational Notebooks through OutlinesInternational Conference on Human Factors in Computing Systems (CHI), 2024
Fengjie Wang
Yanna Lin
Leni Yang
Haotian Li
Mingyang Gu
Min Zhu
Huamin Qu
310
18
0
14 Mar 2024
Couler: Unified Machine Learning Workflow Optimization in Cloud
Couler: Unified Machine Learning Workflow Optimization in CloudIEEE International Conference on Data Engineering (ICDE), 2024
Xiaoda Wang
Yuan-ju Tang
Tengda Guo
Bo Sang
Jingji Wu
Jian Sha
Ke Zhang
Jiang Qian
Mingjie Tang
244
1
0
12 Mar 2024
A Flexible Cell Classification for ML Projects in Jupyter Notebooks
A Flexible Cell Classification for ML Projects in Jupyter Notebooks
Miguel Pérez-Francisco
Selin Aydin
Horst Lichter
101
2
0
12 Mar 2024
Detectors for Safe and Reliable LLMs: Implementations, Uses, and
  Limitations
Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations
Swapnaja Achintalwar
Adriana Alvarado Garcia
Ateret Anaby-Tavor
Ioana Baldini
Sara E. Berger
...
Aashka Trivedi
Kush R. Varshney
Dennis L. Wei
Shalisha Witherspooon
Marcel Zalmanovici
376
15
0
09 Mar 2024
Guidelines for Integrating Value Sensitive Design in Responsible AI
  Toolkits
Guidelines for Integrating Value Sensitive Design in Responsible AI Toolkits
Malak Sadek
Marios Constantinides
Daniele Quercia
C. Mougenot
307
39
0
29 Feb 2024
Content-Centric Prototyping of Generative AI Applications: Emerging
  Approaches and Challenges in Collaborative Software Teams
Content-Centric Prototyping of Generative AI Applications: Emerging Approaches and Challenges in Collaborative Software Teams
Hari Subramonyam
Divy Thakkar
Jurgen Dieber
Anoop Sinha
332
1
0
27 Feb 2024
Understanding the Dataset Practitioners Behind Large Language Model
  Development
Understanding the Dataset Practitioners Behind Large Language Model Development
Crystal Qian
Emily Reif
Minsuk Kahng
382
4
0
21 Feb 2024
Towards a Non-Ideal Methodological Framework for Responsible ML
Towards a Non-Ideal Methodological Framework for Responsible MLInternational Conference on Human Factors in Computing Systems (CHI), 2024
Ramaravind Kommiya Mothilal
Shion Guha
Syed Ishtiaque Ahmed
425
14
0
20 Jan 2024
Make It Make Sense! Understanding and Facilitating Sensemaking in
  Computational Notebooks
Make It Make Sense! Understanding and Facilitating Sensemaking in Computational Notebooks
Souti Chattopadhyay
Zixuan Feng
Emily Arteaga
Audrey Au
Gonzalo Ramos
Titus Barik
Anita Sarma
164
7
0
18 Dec 2023
Investigating Collaborative Data Practices: a Case Study on Artificial
  Intelligence for Healthcare Research
Investigating Collaborative Data Practices: a Case Study on Artificial Intelligence for Healthcare Research
R. Henkin
Elizabeth Remfry
Duncan J. Reynolds
M. Clinch
Michael R. Barnes
290
3
0
30 Nov 2023
Exploring Links between Conversational Agent Design Challenges and
  Interdisciplinary Collaboration
Exploring Links between Conversational Agent Design Challenges and Interdisciplinary Collaboration
Malak Sadek
C. Mougenot
LLMAG
161
0
0
15 Nov 2023
Is a Seat at the Table Enough? Engaging Teachers and Students in Dataset
  Specification for ML in Education
Is a Seat at the Table Enough? Engaging Teachers and Students in Dataset Specification for ML in Education
Mei Tan
Hansol Lee
Dakuo Wang
Hariharan Subramonyam
221
14
0
09 Nov 2023
The Value-Sensitive Conversational Agent Co-Design Framework
The Value-Sensitive Conversational Agent Co-Design FrameworkInternational journal of human computer interactions (IJHCI), 2023
Malak Sadek
Rafael A. Calvo
C. Mougenot
3DV
260
10
0
18 Oct 2023
MetaAgents: Large Language Model Based Agents for Decision-Making on Teaming
MetaAgents: Large Language Model Based Agents for Decision-Making on Teaming
Yuan Li
Lichao Sun
Yixuan Zhang
LLMAGLM&Ro
514
119
0
10 Oct 2023
Model Compression in Practice: Lessons Learned from Practitioners
  Creating On-device Machine Learning Experiences
Model Compression in Practice: Lessons Learned from Practitioners Creating On-device Machine Learning ExperiencesInternational Conference on Human Factors in Computing Systems (CHI), 2023
Fred Hohman
Mary Beth Kery
Donghao Ren
Dominik Moritz
448
33
0
06 Oct 2023
Fostering Enterprise Conversations Around Data on Collaboration
  Platforms
Fostering Enterprise Conversations Around Data on Collaboration Platforms
Hyeok Kim
Arjun Srinivasan
Matthew Brehmer
254
0
0
06 Oct 2023
Who is the Audience? Designing Casual Data Visualizations for the
  'General Public'
Who is the Audience? Designing Casual Data Visualizations for the 'General Public'
Regina Schuster
Laura M. Koesten
Torsten Moller
Kathleen Gregory
213
0
0
03 Oct 2023
How Do Analysts Understand and Verify AI-Assisted Data Analyses?
How Do Analysts Understand and Verify AI-Assisted Data Analyses?International Conference on Human Factors in Computing Systems (CHI), 2023
Ken Gu
Ruoxi Shang
Tim Althoff
Chenglong Wang
Steven Drucker
AAML
437
52
0
19 Sep 2023
Whombat: An open-source annotation tool for machine learning development
  in bioacoustics
Whombat: An open-source annotation tool for machine learning development in bioacoustics
Santiago Martínez Balvanera
Oisin Mac Aodha
Matthew J. Weldy
Holly Pringle
Ella Browning
Kate E. Jones
296
4
0
24 Aug 2023
Ground Truth Or Dare: Factors Affecting The Creation Of Medical Datasets
  For Training AI
Ground Truth Or Dare: Factors Affecting The Creation Of Medical Datasets For Training AIAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2023
H. D. Zając
Natalia-Rozalia Avlona
T. O. Andersen
F. Kensing
Irina Shklovski
171
35
0
12 Aug 2023
RAI Guidelines: Method for Generating Responsible AI Guidelines Grounded
  in Regulations and Usable by (Non-)Technical Roles
RAI Guidelines: Method for Generating Responsible AI Guidelines Grounded in Regulations and Usable by (Non-)Technical Roles
Marios Constantinides
Edyta Bogucka
Daniele Quercia
Susanna Kallio
Mohammad Tahaei
320
29
0
27 Jul 2023
Machine Learning practices and infrastructures
Machine Learning practices and infrastructuresAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2023
G. Berman
313
7
0
13 Jul 2023
Designing a Direct Feedback Loop between Humans and Convolutional Neural
  Networks through Local Explanations
Designing a Direct Feedback Loop between Humans and Convolutional Neural Networks through Local Explanations
Tong Sun
Yuyang Gao
Shubham Khaladkar
Sijia Liu
Bo Pan
Younghoon Kim
S. Hong
AAMLFAttHAI
405
9
0
08 Jul 2023
Investigating Practices and Opportunities for Cross-functional
  Collaboration around AI Fairness in Industry Practice
Investigating Practices and Opportunities for Cross-functional Collaboration around AI Fairness in Industry PracticeConference on Fairness, Accountability and Transparency (FAccT), 2023
Wesley Hanwen Deng
Nuri Yildirim
Monica Chang
Motahhare Eslami
Kenneth Holstein
Michael A. Madaio
301
59
0
10 Jun 2023
How Do UX Practitioners Communicate AI as a Design Material? Artifacts,
  Conceptions, and Propositions
How Do UX Practitioners Communicate AI as a Design Material? Artifacts, Conceptions, and Propositions
K. J. Kevin Feng
Maxwell James Coppock
David W. McDonald
276
32
0
27 May 2023
SuperNOVA: Design Strategies and Opportunities for Interactive
  Visualization in Computational Notebooks
SuperNOVA: Design Strategies and Opportunities for Interactive Visualization in Computational Notebooks
Zijie J. Wang
David Munechika
Seongmin Lee
Duen Horng Chau
384
12
0
04 May 2023
Why is AI not a Panacea for Data Workers? An Interview Study on Human-AI Collaboration in Data Storytelling
Why is AI not a Panacea for Data Workers? An Interview Study on Human-AI Collaboration in Data StorytellingIEEE Transactions on Visualization and Computer Graphics (TVCG), 2023
Haotian Li
Yun Wang
Q. V. Liao
Huamin Qu
365
30
0
17 Apr 2023
DASS Good: Explainable Data Mining of Spatial Cohort Data
DASS Good: Explainable Data Mining of Spatial Cohort Data
A. Wentzel
Carla Floricel
G. Canahuate
Mohamed A.Naser
A. Mohamed
Clifton Fuller
L. V. Dijk
G. Marai
OOD
195
12
0
10 Apr 2023
Tracing and Visualizing Human-ML/AI Collaborative Processes through
  Artifacts of Data Work
Tracing and Visualizing Human-ML/AI Collaborative Processes through Artifacts of Data WorkInternational Conference on Human Factors in Computing Systems (CHI), 2023
Jennifer Rogers
Anamaria Crisan
236
13
0
05 Apr 2023
A Meta-Summary of Challenges in Building Products with ML Components --
  Collecting Experiences from 4758+ Practitioners
A Meta-Summary of Challenges in Building Products with ML Components -- Collecting Experiences from 4758+ Practitioners
Nadia Nahar
Haoran Zhang
Grace A. Lewis
Shurui Zhou
Jane Hsieh
377
62
0
31 Mar 2023
12
Next
Page 1 of 2