International Research Journal of Engineering and Technology (IRJET)
e-ISSN: 2395-0056
Volume: 06 Issue: 09 | Sep 2019
p-ISSN: 2395-0072
www.irjet.net
Agricultural Data Modeling and Yield Forecasting using Data Mining Techniques Rithesh Pakkala P.1, Akhila Thejaswi R2 1,2Assistant
Professor, Department of Information Science of Engineering, Sahyadri College of Engineering & Management, Mangaluru, Karnataka, India ---------------------------------------------------------------------***----------------------------------------------------------------------
Abstract - Agriculture encompasses a great impact on the
economy of developing countries. The Global change in climatic conditions and the cost of investment in agriculture are major obstacle for small-holder farmers. The proposed work intends to design a predictive model that provides a cultivation plan for the farmers to high yield of paddy crop using data mining techniques. Data mining techniques extract hidden knowledge through data analysis, unlike statistical approaches. The dataset is collected from the agricultural department. K- Means clustering and various classifiers like Support Vector Machine, Naïve Bayes are applied to meteorological and agronomic data for the paddy crop. The performance of various classifiers are validated and compared. The result of the work is the accurate prediction of crop yield. The final rules extracted by this work are useful for farmers to make proactive and knowledge-driven decisions before harvest.
Key Words: Data mining, Predictive Model, K - Means Clustering, Support Vector Machine, Naïve Bayes classifiers
1. INTRODUCTION Data mining is the process of analysing various hidden patterns of data according to different views for categorization into required information. This data is been collected and gathered from common area, such as agriculture department, for efficient analysis, data mining algorithms, improving business decision making and other information requirements to ultimately reduce the costs and increase revenue. Data mining technique is intended in extracting the hidden, useful and interesting patterns from raw data. Data mining tools predict future trends and behaviours, allowing businesses to make proactive knowledge based decision. Data mining accompanies the use of complicated statistics, analysis tools to find previously unknown, suitable unseen structure and interaction in the huge dataset. It helps to develop a predictive model that provides a cultivation plan for farmers to get high yield of paddy crops. Descriptive data mining tasks featurize the general properties of the data in the database while predictive data mining is used to predict explicit values based on patterns determined from known results. Prediction also involves usage of some fields or variables in the database to predict unknown or future values of other variables that are of
© 2019, IRJET
|
Impact Factor value: 7.34
concern. As far as data mining technique are concerned in most of cases, predictive data mining approaches are been used. Predictive data mining technique is used to predict future crop, pesticides, weather forecasting and fertilizers to be used, revenue to be generated and so on. Forecasting crop productivity is one of the scientific techniques of predicting crop yield before harvest. Data mining techniques like clustering and classification are performed in order to maximize the crop yield prediction. A final prediction model is developed and implemented, that protects farmers from agricultural risks by providing a framework that helps them in scientific decision making in agriculture. Using this predictive model, farmers can plan the cultivation process well in advance. To prevent loss, farmers can identify suitable combinations of varying factors like seed quality, rainfall, temperature and sowing procedure. It is a scientific model that provides suitable cultivation plans to farmers in accordance with the changing agronomic factors. Paddy is a pivotal crop in south India. Yield of paddy crop depends on various meteorological and agronomic factors such as seed quality, rainfall, temperature and sowing procedure. In order to evaluate the relationship between these factors and crop yield and to identify the input variables effecting the output of paddy crop, a realtime data set is collected from farmers cultivating paddy is used in this research. Raw agricultural data are pre-processed and only the necessary factors are established by filtering. The major data mining techniques used in this research are K-means clustering and classifiers such as Support Vector Machine, Naïve Bayes. Performances of the above are compared based on classier accuracy measures. The final knowledge regarding the cultivation plan is discovered, evaluated, and presented. The result of the desire models will help agribusiness associations in equip agriculturists with necessary information as to which factors add to high yield.
2. RELATED WORK This section describes the various works carried out in the relevant fields. Jharna Majumdar et al.[1] proposed a data mining model which is applied on agriculture dataset using different clustering algorithms such as DBSCAN, PAM and CLARA.
|
ISO 9001:2008 Certified Journal
|
Page 448