International Research Journal of Engineering and Technology (IRJET)
e-ISSN: 2395-0056
Volume: 12 Issue: 07 | July 2025
p-ISSN: 2395-0072
www.irjet.net
AI-DRIVEN SOCIAL ENGINEERING AWARENESS TRAINING MODEL Lavanya K M1, Dr. H P Mohan Kumar2 1Department of MCA, P.E.S College of Engineering, Mandya, Karnataka, India
2Department of CS&E, P.E.S College of Engineering, Mandya, Karnataka, India
---------------------------------------------------------------------***---------------------------------------------------------------------
Abstract - In the rapidly evolving landscape of digital
audio files using MFCC-based feature extraction and deep learning models such as LSTM (Long Short-Term Memory) and CNN (Convolutional Neural Networks). Text classification is based on converting message content into numerical vectors using TF-IDF and then feeding it to classifiers like Logistic Regression and SVM. For voice detection, MFCC features help in capturing speech characteristics which are then used to train the neural network models to distinguish between real and fake (AIgenerated or mimicked) audio.
communication, threats like social engineering have become a serious challenge. Social engineering attacks, particularly those conducted through telephonic conversations and text messages, aim to manipulate individuals into revealing sensitive personal, financial, or security-related information. These attacks bypass traditional technical security measures by exploiting human psychology. The rise of fake calls, scam messages, and voice spoofing has necessitated the development of intelligent systems capable of detecting such malicious attempts in real-time.
2. LITERATURE SURVEY
This project presents a comprehensive system for the detection and classification of telephonic social engineering attacks, combining both textual message data and telephonic audio recordings. The system uses machine learning and deep learning models to classify text messages as either spam (social engineering attempt) or non-spam, and audio files as either fake (synthetic/manipulated) or real (genuine voice).
[1] In an early study, Breda et al. (2017) emphasized the psychological manipulation at the core of social engineering, where attackers exploit human trust, curiosity, and urgency to gain access to sensitive information. Although the study thoroughly discussed real-world incidents and human vulnerabilities, it lacked the development of technical or automated solutions to mitigate these attacks.
Key Words: Machine learning, deep learning, genetic
[2] Expanding on this foundation, Salahudin and Kabocha (2019) conducted a comprehensive survey categorizing social engineering methods, such as phishing, baiting, and pretexting. Their work was pivotal in identifying behavioural triggers and proposing countermeasures like access control and employee training. However, their approach remained reactive and did not suggest intelligent, real-time AI systems for proactive detection.
algorithm, LSTM, TF-IDF, KNN, SVM
1.INTRODUCTION Social engineering is a psychological manipulation technique wherein attackers deceive individuals into revealing confidential information. Unlike traditional hacking, which requires technical expertise, social engineering relies on exploiting human emotions such as fear, urgency, curiosity, or trust. These attacks often come in the form of fraudulent phone calls, messages, or emails. For instance, a caller may impersonate a bank representative and ask for OTPs, PIN numbers, or account details. In other cases, automated fake voices powered by AI technology may attempt to sound convincing enough to bypass voice authentication systems.
[3] With the growing role of social media in daily communication, Naz et al. (2024) explored the risks of social engineering through platforms like Facebook and Twitter. The study showcased how attackers use publicly available information to engineer convincing scams and impersonation. Despite its depth in social media-focused threats, the work lacked any integration of speech-based attack vectors or voice-based detection systems.
Given this context, there is a dire need to develop intelligent systems that can help detect and prevent these manipulative attacks. This project, titled "AI-DRIVEN SOCIAL ENGINEERING AWARENESS TRAINING MODEL", is a step towards addressing this critical issue. It aims to build a detection system that can classify inputs—either in text or audio format—as either genuine or suspicious.
[4] In a more implementation-oriented study, Deepak Singh Malik (2023) analysed phone-based scams and proposed machine learning-based anomaly detection as a potential solution. His recommendation to combine user profiling and voice analysis laid the groundwork for multi-modal security systems. However, the study lacked concrete implementation or experimental validation of the proposed models.
The solution is two-fold: one part deals with text message analysis using natural language processing (NLP) and machine learning (ML), while the second part processes
© 2025, IRJET
|
Impact Factor value: 8.315
|
ISO 9001:2008 Certified Journal
|
Page 836