Enhancing multi-class malware detection in resource-constrained environments

Citation

Abstract

The emergence of multi-class malware attacks such as ransomware, spyware, trojan etc. presents an increasing and serious threat to cybersecurity, particularly in resource constrained environments like in IoT devices. Existing machine learning models have achieved nearly perfect accuracy in binary malware classification but falls short in terms of classifying malware families and individual malwares. Additionally, the complexity of these multi-class malware attacks present a significant challenge of detection in resourceconstrained environments as multi-class detection usually requires high computational capability. This research bridges the gap by enhancing the detection accuracy of multiclass malware classification as well as developing a lightweight model that can run efficiently on resource-constrained devices. In this paper, we propose a robust lightweight machine learning model featuring LightGBM classifier with SMOTE oversampling and SOM-US undersampling techniques for data balancing as well as well-engineered feature selection through Genetic Algorithm. The model performed better than the current state of the art models developed on the same dataset in both malware family classification (4 classes) and individual malware type classification (16 classes) with accuracy of 89.1% and 76% respectively. Thus, Maintaining a balance between classification accuracy and computational efficiency in resource constrained environments. Furthermore, we propose another model using Random Forest classifier with an accuracy of 91.2% in malware family classification and 78.7% in individual malware classification. Demonstrating a significant enhancement in terms of accuracy from the current state of the art models.

Description

Cataloged from PDF version of thesis.
Includes bibliographical references (pages 51-53).
This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2025.

Publisher Link

Type

Thesis