DATA MINING METHODS FOR OBESITY LEVEL RECOGNITION: A SYSTEMATIC REVIEW OF THE LITERATURE

Page 1

Rafael Luckert, International Journal of Advanced Trends in Computer Applications (IJATCA) Volume 7, Number 2, October - 2020, pp. 12-24 ISSN: 2395-3519

International Journal of Advanced Trends in Computer Applications www.ijatca.com

DATA MINING METHODS FOR OBESITY LEVEL RECOGNITION: A SYSTEMATIC REVIEW OF THE LITERATURE 1

Rafael Luckert, 2Kevin De Alba, 3Jaime Sarmiento, 4Karen Salas Viloria, 5Alexis De la Hoz Manotas, 6 Fabio Mendoza Palechor 1 Department of Computer Science and Electronics, Universidad de la Costa, Barranquilla, Colombia 1 rluckert1@cuc.edu.co, 2kdealba1@cuc.edu.co, 3jsarmient37@cuc.edu.co, 4ksalas@cuc.edu.co, 5 adelahoz6@cuc.edu.co, 6fmendoza1@cuc.edu.co

Abstract: Obesity in teenagers and adults has increased worldwide, with serious impact and consequences for health in the short and long term. Technology has allowed to discover new ways of treating diseases and problems with health issues, and data mining has become a relevant area of research and discovery, especially in recent years due to its precision and reliability analyzing datasets of patients to detect diseases and facilitate their prevention. The goal of this study was to identify the techniques and algorithms in data mining most commonly used, to detect several factors that favor the apparition of obesity issues and to determine the reliability of those methods, based on the results obtained from a data mining model. Data mining methods as simple regression and decision trees, are most commonly used to detect obesity levels, where the simple regression method was found in 19% of the articles reviewed and the decision trees method was used in 11% of them.

Keywords: obesity, data mining, overweight.

I. INTRODUCTION Overweight and obesity are defined as abnormal or excessive fat accumulation that may impair health. In 2016, WHO statistics accounted that 39% of adult people and over 18 years old were overweight, and 13% were obese, more than 340 million children and adolescents aged 5-19 were overweight or obese [1]. Overweight can be measured with the Mass Body Index (MBI), that ties weight with the height of a person, an increase in MBI can increase the risk to contract diseases such as diabetes and cardiac conditions. Data mining has become widely used in the field of medicine, focusing its impact in the use of tools for analysis and classification of information. Many studies about obesity, use several data mining methods to identify possible causes and risks that are present in this condition, having variables such as: the presence of a high caloric intake, sedentarism, hormonal issues and others, the early detection of these factors, can improve methods for preventing obesity in population. The analysis of obesity related problems is a topic of interest for researchers, with a significant number of authors producing solutions based on intelligent

methods for early and efficient detection of this disease, especially contributions from studies as [2-5]. According to literature, a common trend for studies related with obesity are methods like logistic regression [9-10], support vector machines [24], Naïve Bayes [6] and decision trees [15][28][35]. Those data mining techniques and many others, are applied to datasets, collections of information from many and different patients, to determine patterns and models for detecting obesity problems, some of those datasets are PubMed-NCBI [12], ANZCTRN [13], STRIDE [25][34] and DEOL [43]. This systematic review focuses on answering questions such as: ¿which techniques or algorithms in data mining are commonly used to detect factors related to obesity?, ¿What is the reliability of the results shown by a data mining model to detect obesity issues?, to achieve that objective, a sample of articles was observed, all having proposals and implementations of analytics using machine learning and data mining for information validation and decision making. Finally, the study is structured with the following sections: II. Previous Studies, contains the information

www.ijatca.com

12


Turn static files into dynamic content formats.

Create a flipbook
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.
DATA MINING METHODS FOR OBESITY LEVEL RECOGNITION: A SYSTEMATIC REVIEW OF THE LITERATURE by IJATCA - Issuu