ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.12760
8
0

High-resolution Image-based Malware Classification using Multiple Instance Learning

21 November 2023
Tim Peters
H. Farhat
ArXivPDFHTML
Abstract

This paper proposes a novel method of classifying malware into families using high-resolution greyscale images and multiple instance learning to overcome adversarial binary enlargement. Current methods of visualisation-based malware classification largely rely on lossy transformations of inputs such as resizing to handle the large, variable-sized images. Through empirical analysis and experimentation, it is shown that these approaches cause crucial information loss that can be exploited. The proposed solution divides the images into patches and uses embedding-based multiple instance learning with a convolutional neural network and an attention aggregation function for classification. The implementation is evaluated on the Microsoft Malware Classification dataset and achieves accuracies of up to 96.6%96.6\%96.6% on adversarially enlarged samples compared to the baseline of 22.8%22.8\%22.8%. The Python code is available online at https://github.com/timppeters/MIL-Malware-Images .

View on arXiv
Comments on this paper