A Structural-Semantic Approach Integrating Graph-Based and Large Language Models Representation to Detect Android Malware

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

The Android operating system’s dominance in the smartphone market cements its pivotal role in shaping contemporary connectivity and technological innovation, with the rate of application development increasing at an unprecedented rate. However, this rapid growth also presents challenges, as malicious actors exploit vulnerabilities to infiltrate systems with malware, posing substantial threats to individual users and organizations. Security experts continuously develop strategies and methods to address these challenges. However, the evolving nature of these attacks presents ongoing challenges to security measures aimed at detecting emerging malware. While deep learning methods offer promise by leveraging multi-level features for more adaptive malware detection, many existing approaches focus primarily on high-level features such as permissions and data flow, potentially limiting their long-term efficacy. To gain a deeper understanding of the nature of these attacks, it is crucial for existing approaches to pay more attention to the essential structural and semantic aspects of Android applications. We propose a multi-level technique utilizing graph-based representations to capture high-level structural information effectively. We extract detailed source-level information by integrating pre-trained large language models (LLMs), learning deeper syntax and semantic features. Combining both, we attribute the graph-based representation of Android applications with source-level features. Leveraging graph convolutional neural networks, we comprehensively process and analyze these graphs. Our proposed methods demonstrate superior results compared to existing and baseline approaches. This work offers an innovative approach to understanding malware at high structural, low source, and semantic levels, enhancing cybersecurity defenses against evolving threats in the dynamic landscape of Android security.

Original languageEnglish
Title of host publicationICT Systems Security and Privacy Protection - 39th IFIP International Conference, SEC 2024, Proceedings
EditorsNikolaos Pitropakis, Sokratis Katsikas, Steven Furnell, Konstantinos Markantonakis
PublisherSpringer Science and Business Media Deutschland GmbH
Pages279-293
Number of pages15
ISBN (Print)9783031651748
DOIs
StatePublished - 2024
Event39th IFIP International Conference on ICT Systems Security and Privacy Protection, SEC 2024 - Edinburgh, United Kingdom
Duration: 12 Jun 202414 Jun 2024

Publication series

NameIFIP Advances in Information and Communication Technology
Volume710
ISSN (Print)1868-4238
ISSN (Electronic)1868-422X

Conference

Conference39th IFIP International Conference on ICT Systems Security and Privacy Protection, SEC 2024
Country/TerritoryUnited Kingdom
CityEdinburgh
Period12/06/2414/06/24

Keywords

  • Android malware
  • Attributed graphs
  • Graph attention network
  • Graph classification
  • LLM

Fingerprint

Dive into the research topics of 'A Structural-Semantic Approach Integrating Graph-Based and Large Language Models Representation to Detect Android Malware'. Together they form a unique fingerprint.

Cite this