International Research Journal of Engineering and Technology (IRJET)
e-ISSN: 2395-0056
Volume: 12 Issue: 07 | Jul 2025
p-ISSN: 2395-0072
www.irjet.net
Cardiovascular Risk Prediction Er. Jagmeet Singh1, Er. Charandeep Singh2 1Jagmeet Singh Dept. of Computer science and Engineering, Baba Farid College of Engineering and technology,
Bathinda, Punjab
2Assistant Professor Charandeep Singh Bedi, Dept. of Computer Science and Engineering, Baba Farid College of
Engineering and technology, Bathinda, Punjab ---------------------------------------------------------------------***--------------------------------------------------------------------countless lives. However, diagnosing and treating patients at Abstract - Cardiovascular disease (CVD) impairs the early stages poses a noticeable challenge for cardiologists. function of the heart and blood vessels, which often resulting in death or physical paralysis. Thus, early and Traditional CVD risk-assessment models often assume a automated detection of CVD is important for human linear relationship between each risk factor and CVD lives. Although various studies have goal to achieve this, outcomes. These models tend to simplify complex but there still room for improving performance and relationships, including numerous risk factors with nonreliability. This study makes contribution in this linear interactions. It is essential to incorporate multiple risk direction. It implemented two reliable machine learning factors accurately and examine the nuanced correlations between these factors and outcomes. Till date, no large-scale techniques—multi-layer perceptron (MLP) and Kstudy has utilized routine clinical data and machine learning nearest neighbour (K-NN)—to detect CVD using data (ML) for prognostic CVD assessment. This study aims to from the publicly available University of California Irvine determine if ML can enhance cardiovascular risk prediction repository. Model performance is enhanced by removing accuracy in primary care populations and also identify which outliers and attributes with null values. Experimental ML algorithms yield the most concise and effective results. results show that the MLP model achieves a detection accuracy of 82.47% and an area-under-the-curve (AUC) 1.1 Objectives value of 86.41%, outperforming the K- Nearest 1. Develop a novel cardiovascular risk prediction model Neighbors model. Therefore, the MLP model is using advanced machine learning techniques, integrating recommended for automatic CVD detection. Additionally, comprehensive patient data for improved accuracy. the proposed methodology also applied on detection of other diseases and can be validated using other standard 2. Validate and compare the model's performance with datasets. existing standards, ensuring robustness and generalizability through rigorous testing and evaluation.
Key Words: MLP, AUC, CVD, K-NN.
3. Enhance model interpretability to facilitate clinical integration, providing clear insights into risk factors for personalized patient care.
1. INTRODUCTION Health is most prominent aspect of everyone's life. However, numerous factors, such as work stress, unhealthy lifestyles, and psychological strain, external influences like hazardous work environments, inadequate health services, and pollution, contribute to millions of people worldwide developing chronic ailments like cardiovascular diseases (CVD). These conditions badly affect the heart and blood vessels, which are leading to death or disability. Recent reports shows that a significant proportion of human deaths are due to CVD[1,2]. Conditions associated with CVD include hypertension, hyperlipidaemia, thromboembolism, and coronary heart disease, all are responsible for heart failure, and with hypertension it is known to be as primary cause [3]. In 2012, 7.4 million people died from coronary heart disease, while 6.7 million died from stroke [4]. According to the World Health Organization, nearly 17 million people die annually from CVDs, accounting for approximately 31% of global deaths. Early diagnosis of CVD can potentially save
© 2025, IRJET
|
Impact Factor value: 8.315
1.2 Data Overview The dataset is from an ongoing cardiovascular study on residents of the town of Framingham, Massachusetts. The classification goal is to predict whether the patient has a 10year risk of future coronary heart disease (CHD). the dataset includes data on over 4,000 individuals with coronary heart disease, with 15 characteristics describing each patient.
1.3 Data Cleaning Prior to EDA, cleaning the data is important since it will get rid of anyduplicate information that can have an impact on the results. Education, heartRate, BPMed , totChol BMI , cigsPerDay, and glucose columns have missing or null values. I have filled these columns for missing values by using KNN
|
ISO 9001:2008 Certified Journal
|
Page 636