ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2201.12901
  4. Cited By
Training and Evaluating a Jupyter Notebook Data Science Assistant

Training and Evaluating a Jupyter Notebook Data Science Assistant

30 January 2022
Shubham Chandel
Colin B. Clement
Guillermo Serrato
Neel Sundaresan
ArXiv (abs)PDFHTML

Papers citing "Training and Evaluating a Jupyter Notebook Data Science Assistant"

25 / 25 papers shown
Measuring Data Science Automation: A Survey of Evaluation Tools for AI Assistants and Agents
Measuring Data Science Automation: A Survey of Evaluation Tools for AI Assistants and Agents
Irene Testini
José Hernández-Orallo
Lorenzo Pacchiardi
229
3
0
10 Jun 2025
Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Unified Approach for Elevating Benchmark Quality
Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Unified Approach for Elevating Benchmark Quality
Roham Koohestani
Philippe de Bekker
Begüm Koç
Maliheh Izadi
VLM
645
1
0
07 Mar 2025
Rigor, Reliability, and Reproducibility Matter: A Decade-Scale Survey of 572 Code Benchmarks
Rigor, Reliability, and Reproducibility Matter: A Decade-Scale Survey of 572 Code Benchmarks
Jialun Cao
Yuk-Kit Chan
Zixuan Ling
Wenxuan Wang
Shuqing Li
...
Pinjia He
Shuai Wang
Zibin Zheng
Michael R. Lyu
Shing-Chi Cheung
ALM
708
2
0
18 Jan 2025
GitChameleon: Unmasking the Version-Switching Capabilities of Code
  Generation Models
GitChameleon: Unmasking the Version-Switching Capabilities of Code Generation Models
Nizar Islah
Justine Gehring
Diganta Misra
Eilif B. Muller
Irina Rish
Terry Yue Zhuo
Massimo Caccia
SyDa
225
5
0
05 Nov 2024
CodeInsight: A Curated Dataset of Practical Coding Solutions from Stack
  Overflow
CodeInsight: A Curated Dataset of Practical Coding Solutions from Stack OverflowAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Nathanael Beau
Benoît Crabbé
305
6
0
25 Sep 2024
InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation
InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation
Gaurav Sahu
Abhay Puri
Juan A. Rodriguez
Alexandre Drouin
Perouz Taslakian
...
Christopher Pal
Nicolas Chapados
I. Laradji
Sai Rajeswar Mudumba
Issam Hadj Laradji
ELM
453
17
0
08 Jul 2024
A Survey on Large Language Models for Code Generation
A Survey on Large Language Models for Code Generation
Juyong Jiang
Fan Wang
Jiasi Shen
Sungju Kim
Sunghun Kim
673
801
0
01 Jun 2024
MHPP: Exploring the Capabilities and Limitations of Language Models Beyond Basic Code Generation
MHPP: Exploring the Capabilities and Limitations of Language Models Beyond Basic Code Generation
Jianbo Dai
Jianqiao Lu
Yunlong Feng
Guangtao Zeng
Rongju Ruan
Ming Cheng
Dong Huang
Haochen Tan
Zhijiang Guo
LRMELM
623
26
0
19 May 2024
Linguacodus: A Synergistic Framework for Transformative Code Generation
  in Machine Learning Pipelines
Linguacodus: A Synergistic Framework for Transformative Code Generation in Machine Learning PipelinesPeerJ Computer Science (PeerJ Comput. Sci.), 2024
Ekaterina Trofimova
Emil Sataev
Andrey E. Ustyuzhanin
391
0
0
18 Mar 2024
Capture the Flag: Uncovering Data Insights with Large Language Models
Capture the Flag: Uncovering Data Insights with Large Language Models
I. Laradji
Perouz Taslakian
Sai Rajeswar
Valentina Zantedeschi
Alexandre Lacoste
Nicolas Chapados
David Vazquez
Christopher Pal
Alexandre Drouin
304
3
0
21 Dec 2023
CodeScope: An Execution-based Multilingual Multitask Multidimensional
  Benchmark for Evaluating LLMs on Code Understanding and Generation
CodeScope: An Execution-based Multilingual Multitask Multidimensional Benchmark for Evaluating LLMs on Code Understanding and GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Weixiang Yan
Haitian Liu
Yunkun Wang
Yunzhe Li
Qian Chen
...
Tingyu Lin
Weishan Zhao
Li Zhu
Hari Sundaram
Shuiguang Deng
ELMLRM
471
57
0
14 Nov 2023
Safurai-Csharp: Harnessing Synthetic Data to improve language-specific
  Code LLM
Safurai-Csharp: Harnessing Synthetic Data to improve language-specific Code LLM
Davide Cifarelli
Leonardo Boiardi
Alessandro Puppo
Leon Jovanovic
SyDa
200
2
0
06 Nov 2023
LLM for SoC Security: A Paradigm Shift
LLM for SoC Security: A Paradigm ShiftIEEE Access (IEEE Access), 2023
Dipayan Saha
Shams Tarek
Katayoon Yahyaei
S. Saha
Jingbo Zhou
M. Tehranipoor
Farimah Farahmandi
440
92
0
09 Oct 2023
Safurai 001: New Qualitative Approach for Code LLM Evaluation
Safurai 001: New Qualitative Approach for Code LLM Evaluation
Davide Cifarelli
Leonardo Boiardi
Alessandro Puppo
ELM
211
0
0
20 Sep 2023
How Do Analysts Understand and Verify AI-Assisted Data Analyses?
How Do Analysts Understand and Verify AI-Assisted Data Analyses?International Conference on Human Factors in Computing Systems (CHI), 2023
Ken Gu
Ruoxi Shang
Tim Althoff
Chenglong Wang
Steven Drucker
AAML
423
44
0
19 Sep 2023
How Do Data Analysts Respond to AI Assistance? A Wizard-of-Oz Study
How Do Data Analysts Respond to AI Assistance? A Wizard-of-Oz StudyInternational Conference on Human Factors in Computing Systems (CHI), 2023
Ken Gu
Madeleine Grunde-McLaughlin
Andrew M. McNutt
Jeffrey Heer
Tim Althoff
264
46
0
18 Sep 2023
PanGu-Coder2: Boosting Large Language Models for Code with Ranking
  Feedback
PanGu-Coder2: Boosting Large Language Models for Code with Ranking Feedback
Bo Shen
Jiaxin Zhang
Taihong Chen
Daoguang Zan
Bing Geng
...
Ailun Yu
Jichuan Ji
Jingyang Zhao
Yuenan Guo
Qianxiang Wang
ALMELM
267
102
0
27 Jul 2023
SelfEvolve: A Code Evolution Framework via Large Language Models
SelfEvolve: A Code Evolution Framework via Large Language Models
Shuyang Jiang
Yuhao Wang
Yu Wang
347
52
0
05 Jun 2023
xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code
  Understanding, Generation, Translation and Retrieval
xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and RetrievalAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Mohammad Abdullah Matin Khan
M Saiful Bari
Xuan Long Do
Weishi Wang
Md. Rizwan Parvez
Shafiq Joty
ALMELM
560
61
0
06 Mar 2023
Execution-Based Evaluation for Open-Domain Code Generation
Execution-Based Evaluation for Open-Domain Code GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Zhiruo Wang
Shuyan Zhou
Daniel Fried
Graham Neubig
ELM
377
106
0
20 Dec 2022
Large Language Models Meet NL2Code: A Survey
Large Language Models Meet NL2Code: A SurveyAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Daoguang Zan
B. Chen
Fengji Zhang
Di Lu
Bingchao Wu
Bei Guan
Yongji Wang
Jian-Guang Lou
ELMALM
368
251
0
19 Dec 2022
Natural Language to Code Generation in Interactive Data Science
  Notebooks
Natural Language to Code Generation in Interactive Data Science NotebooksAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Pengcheng Yin
Wen-Ding Li
Kefan Xiao
Abhishek Rao
Yeming Wen
...
Paige Bailey
Michele Catasta
Henryk Michalewski
Oleksandr Polozov
Charles Sutton
264
107
0
19 Dec 2022
DS-1000: A Natural and Reliable Benchmark for Data Science Code
  Generation
DS-1000: A Natural and Reliable Benchmark for Data Science Code GenerationInternational Conference on Machine Learning (ICML), 2022
Yuhang Lai
Chengxi Li
Yiming Wang
Tianyi Zhang
Ruiqi Zhong
Luke Zettlemoyer
Scott Yih
Daniel Fried
Si-yi Wang
Tao Yu
ELMALM
376
488
0
18 Nov 2022
Execution-based Evaluation for Data Science Code Generation Models
Execution-based Evaluation for Data Science Code Generation Models
Junjie Huang
Chenglong Wang
Jipeng Zhang
Cong Yan
Haotian Cui
J. Inala
Colin B. Clement
Nan Duan
Jianfeng Gao
ELM
303
43
0
17 Nov 2022
Fault-Aware Neural Code Rankers
Fault-Aware Neural Code RankersNeural Information Processing Systems (NeurIPS), 2022
J. Inala
Chenglong Wang
Mei Yang
Andrés Codas
Mark Encarnación
Shuvendu K. Lahiri
Madan Musuvathi
Jianfeng Gao
ALM
300
54
0
04 Jun 2022
1
Page 1 of 1