ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2501.11199
  4. Cited By
Embedding-Driven Diversity Sampling to Improve Few-Shot Synthetic Data Generation

Embedding-Driven Diversity Sampling to Improve Few-Shot Synthetic Data Generation

28 January 2025
Ivan Lopez
Fateme Nateghi Haredasht
Kaitlin Caoili
Jonathan H. Chen
Akshay S. Chaudhari
    MedImSyDa
ArXiv (abs)PDFHTML

Papers citing "Embedding-Driven Diversity Sampling to Improve Few-Shot Synthetic Data Generation"

14 / 14 papers shown
Title
Enhancing Clinical Documentation with Synthetic Data: Leveraging
  Generative Models for Improved Accuracy
Enhancing Clinical Documentation with Synthetic Data: Leveraging Generative Models for Improved Accuracy
Anjanava Biswas
Wrick Talukdar
SyDa
203
9
0
03 Jun 2024
Zero-Shot Clinical Trial Patient Matching with LLMs
Zero-Shot Clinical Trial Patient Matching with LLMs
Michael Wornow
Alejandro Lozano
Dev Dash
Jenelle A. Jindal
Kenneth W. Mahaffey
Nigam H. Shah
306
56
0
05 Feb 2024
Towards Conversational Diagnostic AI
Towards Conversational Diagnostic AI
Tao Tu
Anil Palepu
M. Schaekermann
Khaled Saab
Jan Freyberg
...
Katherine Chou
Greg S. Corrado
Yossi Matias
Alan Karthikesalingam
Vivek Natarajan
AI4MHLM&MA
245
136
0
11 Jan 2024
Two Directions for Clinical Data Generation with Large Language Models:
  Data-to-Label and Label-to-Data
Two Directions for Clinical Data Generation with Large Language Models: Data-to-Label and Label-to-DataConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Rumeng Li
Xun Wang
Hong Yu
LM&MA
196
40
0
09 Dec 2023
Adapted Large Language Models Can Outperform Medical Experts in Clinical
  Text Summarization
Adapted Large Language Models Can Outperform Medical Experts in Clinical Text SummarizationNature Network Boston (NNB), 2023
Dave Van Veen
Cara Van Uden
Louis Blankemeier
Jean-Benoit Delbrouck
Asad Aali
...
C. Langlotz
Jason Hom
S. Gatidis
John M. Pauly
Akshay S. Chaudhari
ELMAI4MHLM&MA
862
560
0
14 Sep 2023
CORAL: Expert-Curated medical Oncology Reports to Advance Language Model
  Inference
CORAL: Expert-Curated medical Oncology Reports to Advance Language Model Inference
Madhumita Sushil
Vanessa E. Kennedy
Divneet Mandair
Brenda Y. Miao
T. Zack
A. Butte
271
43
0
07 Aug 2023
A Study of Generative Large Language Model for Medical Research and
  Healthcare
A Study of Generative Large Language Model for Medical Research and Healthcare
C.A.I. Peng
Xi Yang
Aokun Chen
Kaleb E. Smith
Nima M. Pournejatian
...
W. Hogan
E. Shenkman
Yi Guo
Jiang Bian
Yonghui Wu
LM&MAELMAI4MH
331
370
0
22 May 2023
MTEB: Massive Text Embedding Benchmark
MTEB: Massive Text Embedding BenchmarkConference of the European Chapter of the Association for Computational Linguistics (EACL), 2022
Niklas Muennighoff
Nouamane Tazi
L. Magne
Nils Reimers
1.0K
671
0
13 Oct 2022
Annotation-efficient deep learning for automatic medical image
  segmentation
Annotation-efficient deep learning for automatic medical image segmentation
Shanshan Wang
Cheng Li
Rongpin Wang
Zaiyi Liu
Meiyun Wang
...
Xin Liu
Jie Chen
Hui-Chong Zhou
Ismail Ben Ayed
Bingsheng Huang
VLMMedIm
245
248
0
09 Dec 2020
Language Models are Few-Shot Learners
Language Models are Few-Shot LearnersNeural Information Processing Systems (NeurIPS), 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
2.0K
52,011
0
28 May 2020
Modeling Tabular data using Conditional GAN
Modeling Tabular data using Conditional GANNeural Information Processing Systems (NeurIPS), 2019
Lei Xu
Maria Skoularidou
Alfredo Cuesta-Infante
K. Veeramachaneni
CMLMUSyDaGAN
440
1,622
0
01 Jul 2019
Publicly Available Clinical BERT Embeddings
Publicly Available Clinical BERT Embeddings
Emily Alsentzer
John R. Murphy
Willie Boag
W. Weng
Di Jin
Tristan Naumann
Matthew B. A. McDermott
AI4MH
651
2,322
0
06 Apr 2019
CheXpert: A Large Chest Radiograph Dataset with Uncertainty Labels and
  Expert Comparison
CheXpert: A Large Chest Radiograph Dataset with Uncertainty Labels and Expert Comparison
Jeremy Irvin
Pranav Rajpurkar
M. Ko
Yifan Yu
Silviana Ciurea-Ilcus
...
D. Larson
C. Langlotz
Bhavik Patel
M. Lungren
A. Ng
568
3,057
0
21 Jan 2019
UMAP: Uniform Manifold Approximation and Projection for Dimension
  Reduction
UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction
Leland McInnes
John Healy
James Melville
945
11,179
0
09 Feb 2018
1