GDC Cohort Copilot: An AI Copilot for Curating Cohorts from the Genomic Data Commons
Steven Song
Anirudh Subramanyam
Zhenyu Zhang
Aarti Venkat
Robert L. Grossman
Main:10 Pages
2 Figures
Bibliography:2 Pages
7 Tables
Abstract
Motivation: The Genomic Data Commons (GDC) provides access to high quality, harmonized cancer genomics data through a unified curation and analysis platform centered around patient cohorts. While GDC users can interactively create complex cohorts through the graphical Cohort Builder, users (especially new ones) may struggle to find specific cohort descriptors across hundreds of possible fields and properties. However, users may be better able to describe their desired cohort in free-text natural language.
View on arXivComments on this paper
