A Study of Some Data Mining Classification Techniques

Page 1

International Research Journal of Engineering and Technology (IRJET)

e-ISSN: 2395 -0056

Volume: 04 Issue: 04 | Apr -2017

p-ISSN: 2395-0072

www.irjet.net

A Study of Some Data Mining Classification Techniques Mr. Sudhir M. Gorade1, Prof. Ankit Deo2 , Prof.Pritesh Purohit3 1Dept.Of

Computer Science and Engineering, SVCE Indore, M.P, India Dept. Of Computer Science and Engineering, SVCE Indore, M.P. ,India 3Professor Dept. Of Computer Science and Engineering, SVCE Indore, M.P. ,India

2Professor

---------------------------------------------------------------------***--------------------------------------------------------------------1.2 Step 2 - Model used for unknown tuple

Abstract - An Classification is one the most useful and

important techniques. Classification techniques are useful to handle large amount of data. Classification is used to predict categorical class labels. Classification models are used to classifying newly available data into a class label. Classification is the process of finding a model that describes and distinguishes data classes or concepts. Classification methods can handle both numerical and categorical attributes. Constructing fast and accurate classifiers for large data sets is an important task in data mining and knowledge discovery. Classification predicts categorical class labels and classifies data based on the training set. Classification is two steps processes. In this paper we present a study of various data mining classification techniques like Decision Tree, KNearest Neighbor, Support Vector Machines, Naive Bayesian Classifiers, and Neural Networks. Keywords:- classification, prediction ,class label, model, categories.

Testing data set

Classifiers (Model)

Unseen data

Classifying into a class lable

1. INTRODUCTION

Fig.2 - Use of classifier

Classification used two steps in the first step a model is constructed based on some training data set, in seconds step the model is used to classify a unknown tuple into a class label.

2. CHARACTERISTICS OF CLASSIFIERS Each and every classifier has some quality which differential the classifier form other. The properties are known as characteristics of the classifiers. These characteristics are Correctness :- How a classifier classifies tuple accurately is based on these characteristics. To check accuracy there are some numerical values based on number of tuple classify correctly and number of tuple classify wrong. Time :- How much time is required to construct the model? This also includes the time to use by the model to classify then number of tuple (prediction time). In other word this refers to the computational costs. Strength :- ability to classify a tuple correctly even tuple has a noise. Noise can be wrong value or missing value. Data Size :- Classifiers should be independent form the size of the database. Model should be scalable. The performance of the model is not dependent on the size of the database. Extendibility :- Some new feature can be added whenever required. This feature is difficult to implement.

1.1 Step 1 - Construction of a model

Training Data set

Classification algorithm

Classifier (Model) Fig.1 - Model construction step

Š 2017, IRJET

|

Impact Factor value: 5.181

|

ISO 9001:2008 Certified Journal

| Page 3112


Turn static files into dynamic content formats.

Create a flipbook
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.