International Research Journal of Engineering and Technology (IRJET)e-ISSN: 2395-0056 Volume: 06 Issue: 05 | May 2019
www.irjet.net
p-ISSN: 2395-0072
“BREAST CANCER DISEASE PREDICTION: USING MACHINE LEARNING APPROACH” Bhondve Arti T1, Bhame Vaishali S2, Kadam Aishwarya R3, Kopnar Komal D4 1,2,3,4Department
of Information Technology, SVPM’s COE Malegaon(bk), Baramati
----------------------------------------------------------------***----------------------------------------------------------------outcome is one of the most interesting and Abstract - Cancer is not a single disease, but challenging tasks for physicians. Cancer or tumor is rather many related diseases that all involve a group of disease that involve abnormal cell uncontrolled cellular growth and reproduction. growth with the potential to spread to other parts Breast cancer is the leading cause of death in the of the body but not all tumors are cancerous. There developed world and second in the developing are various types of cancer, including breast world, killing almost 8 million people a year. Since cancer, skin cancer, lung cancer, colon cancer and cancer is many diseases, treating an individual lymphoma. Breast cancer is the one of the popular cancer requires knowing what abnormal behaviors and second leading cause of cancer death. It can be are happening inside the cells. Machine learning is found in both men and women, but it is more the subfield of computer science that studies common for women. It is important to all people programs that generalize from past experience. especially women to be aware of changes in the This project looks at classification, where an breasts and to know the signs and symptoms of algorithm tries to predict the label for a sample. breast cancer. In this project, we propose K-NN and The machine learning algorithm takes many of DT algorithms that functions is a reliable for cancer these samples, called the training set, and builds an prediction. K-NN algorithm use for classification internal model. Machine learning is the study of application of text categorization. The decision tree algorithm and systems that improve their classifier (DTC) is one of the possible approaches performance with experience. Machine learning is to multistage decision making. decision tree use to classify and predict the data. This model is algorithm can be use regression and classification use to predict by using machine learning problems. The general motive of using Decision approaches. This model is use to accurate analysis Tree is to create a training model which can use to of medical data and early breast cancer disease predict class or value of target variables prediction. Using KNN algorithm and decision tree, by learning decision rules inferred from training by clustering tumours are predicted breast cancer data. Decision tree classifier provide flexibility as is benign or malignant. well as accuracy and time / space efficiency. Keywords— machine learning, healthcare, Decision tree provide more unified view of decision tree, big data, K-nearest neighbor Decision tree classifier. Loaded Data set is not prealgorithm. processed. It is noisy , redundant and unreliable data so, data set pre-processing is the final training set. It is well known that data preparation and 1. Introduction filtering steps take considerable amount of processing time in machine learning problems. Over the past decades, a continuous evolution Data pre-processing includes data cleaning, related to breast cancer research has been transformation, feature extraction and selection, performed. Scientists applied different methods, etc. such as screening in early stage, in order to find In this work, we constructed an expert system types of breast cancer before they cause symptoms. called cancer prediction system which predict Moreover, they have developed new strategies for specific cancer “breast cancer” risk, it help the user the early prediction of breast cancer treatment to predict the cancer. It can save cost and time. This outcome. With the advent of new technologies in system help the people to know their cancer risk the field of medicine, large amounts of breast and it also help the people to take appropriate cancer data have been collected and are available decision based on their cancer risk status. to the medical research community. However, the accurate prediction of a breast cancer disease © 2019, IRJET
|
Impact Factor value: 7.211
|
ISO 9001:2008 Certified Journal
|Page 426