v1v2 (latest)

A Review on Generative AI For Text-To-Image and Image-To-Image Generation and Implications To Scientific Images

28 February 2025

Main:13 Pages

11 Figures

Bibliography:3 Pages

1 Tables

Abstract

This review surveys the state-of-the-art in text-to-image and image-to-image generation within the scope of generative AI. We provide a comparative analysis of three prominent architectures: Variational Autoencoders, Generative Adversarial Networks and Diffusion Models. For each, we elucidate core concepts, architectural innovations, and practical strengths and limitations, particularly for scientific image understanding. Finally, we discuss critical open challenges and potential future research directions in this rapidly evolving field.

View on arXiv

Comments on this paper