GRD Journals- Global Research and Development Journal for Engineering | Volume 5 | Issue 5 | April 2020 ISSN- 2455-5703
Comparative Study of Classification of Cancerous Profile using Deep Learning and Classification Algorithms Snehal Mastud Department of Computer Engineering Smt. Indira Gandhi College of Engineering
Shweta Pandit Department of Computer Engineering Smt. Indira Gandhi College of Engineering
Ankita Mungekar Department of Computer Engineering Smt. Indira Gandhi College of Engineering
Prof. Sachin Desai Department of Computer Engineering Smt. Indira Gandhi College of Engineering
Abstract In this paper, we have implemented the Comparative Study of ML and DL approaches employed in the modelling of cancer progression. Here we are showing the comparative analysis of ML and DL techniques and their classification algorithms. And our system also contains a website where in hospitals lab technician can detect type of cancer easily. Keywords- SVM (Support Vector Machine), ML (Machine Learning), DL (Deep Learning), ANN (Artificial Neural Network), CNN (Convolution Neural Network), NaĂŻve Bayes, Decision Tree
I. INTRODUCTION We aim at creating a system that can detect whether that person having a cancer or not. We have implemented ML and DL techniques and their classification algorithms. Implemented Machine Learning algorithms are SVM, NaĂŻve Bayes, and Decision Tree. Implemented Deep Learning algorithms are ANN and CNN. These algorithms are applied on a Data Set taken from Kaggle. Above algorithms are implemented on a Data set and algorithms gives its accuracy in percentage. Our System also contains a website. This website is useful for lab technician for predicting cancer. By using website we can easily detect two types of cancer. That two types of cancer are Malignant cancer and Metastatic cancer. A. Support Vector Machine Algorithm Support Vector Machine (SVM) is a supervised machine learning algorithm which can be used for both classification challenges. However, it is mostly used in classification problems. In this algorithm, plot each data item as a point in n-dimensional space (where n is number of features you have) with the value of each feature being the value of a particular coordinate. Support Vectors are simply the coordinates of individual observation. In this paper mainly we will consider the input is based upon Support Vector Machine as training data, testing data is decision value. In this method we consider the following steps like Load Dataset, after loading the dataset will Classify Features (Attributes) based on class labels then estimate Candidate Support Value, like the condition is While (instances!=null), Do condition if Support Value=Similarity between each instance in the attribute then finding the Total Error Value. Suppose if any instance < 0 then the estimated decision value = Support Value\Total Error, repeated for all points until it will be empty. Therefore mainly we have calculated the entropy and gini index. B. Artificial Neural Network Artificial Neural Networks (ANN) is an interconnected group of nodes that uses a computational model for information processing. It changes its structure based on external or internal information that flows through the network. ANN can be used to model a complex relationship between inputs and outputs and find patterns in data. C. NaĂŻve Bayes Algorithm NaiĚ&#x2C6;ve Bayes is a relatively simple machine learning technique based on probability models - Bayesian theorem. It belongs to the family of probabilistic classifiers in machine learning based on Bayesâ&#x20AC;&#x2122; theorem with a strong statistic independence assumed between the features. đ?&#x2018;&#x192; â&#x201E;&#x17D;đ?&#x2018;&#x2DC; đ?&#x2018;Ľđ?&#x2018;&#x2014; = đ?&#x2018;&#x192; đ?&#x2018;Ľđ?&#x2018;&#x2014; â&#x201E;&#x17D;đ?&#x2018;&#x2DC; đ?&#x2018;&#x192; â&#x201E;&#x17D;đ?&#x2018;&#x2DC; đ?&#x2018;Ľđ?&#x2018;&#x2014; đ?&#x2018;&#x203A; đ?&#x2018;&#x2013;=0 ;0< đ?&#x2018;&#x2DC; < đ?&#x2018;&#x203A; + 1 ; đ?&#x2018;&#x2013;,đ?&#x2018;&#x2014;, đ?&#x2018;&#x2DC; đ?&#x153;&#x2013; đ?&#x2018;? (2) This classification technique analyses the relationship between each feature and the class for each instance to derive a conditional probability for the relationships between the feature values and the class.
All rights reserved by www.grdjournals.com
18