ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.20493
33
0

Unified Kernel-Segregated Transpose Convolution Operation

27 February 2025
Vijay Srinivas Tida
Md. Imran Hossen
Liqun Shan
Sai Venkatesh Chilukoti
Sonya Hsu
X. Hei
ArXivPDFHTML
Abstract

The optimization of the transpose convolution layer for deep learning applications is achieved with the kernel segregation mechanism. However, kernel segregation has disadvantages, such as computing extra elements to obtain the output feature map with odd dimensions while launching a thread. To mitigate this problem, we introduce a unified kernel segregation approach that limits the usage of memory and computational resources by employing one unified kernel to execute four sub-kernels. The findings reveal that the suggested approach achieves an average computational speedup of 2.03x (3.89x) when tested on specific datasets with an RTX 2070 GPU (Intel Xeon CPU). The ablation study shows an average computational speedup of 3.5x when evaluating the transpose convolution layers from well-known Generative Adversarial Networks (GANs). The implementation of the proposed method for the transpose convolution layers in the EB-GAN model demonstrates significant memory savings of up to 35 MB.

View on arXiv
@article{tida2025_2502.20493,
  title={ Unified Kernel-Segregated Transpose Convolution Operation },
  author={ Vijay Srinivas Tida and Md Imran Hossen and Liqun Shan and Sai Venkatesh Chilukoti and Sonya Hsu and Xiali Hei },
  journal={arXiv preprint arXiv:2502.20493},
  year={ 2025 }
}
Comments on this paper