Big Data with Deep Learning (Supervised Vs. Unsupervised): A Survey

Page 1

International Journal of Computer Science Engineering and Information Technology Research (IJCSEITR) ISSN (P): 2249-6831; ISSN (E): 2249–7943 Vol. 10, Issue 1, Jun 2020, 19–30 © TJPRC Pvt. Ltd.

BIG DATA WITH DEEP LEARNING (SUPERVISED VS. UNSUPERVISED): A SURVEY ABEER ANWAR AL-QASSAB & HALA JAFFAR AL-LAWATI Department of Information Technology, Al-Zahra College for Women Oman ABSTRACT Due to the dramatic increment in technology development in the last few years, a huge volume of data is being produced every day for each person. This data is coming from different types of applications (like google search, online shopping history, etc.) and it is called Big Data. Big Data represents a massive stream of data which is continuously growing and changing, and it is needed to be controlled, analyzed and monetized to help the decision makers of organizations to innovate their companies. In addition to that, this analysis impacts the existing and future technology. Artificial Intelligence and Machine Learning algorithms are being adopted to provide effective automated tools and operations on big data (like data labeling, analyzing, diagnosing and so). One of the techniques that can be used by machine learning is the “neural networks”, and in this case it is called as Deep Learning. Deep learning could be unsupervised learning which is concerned with labeling and segmenting these large amounts of various unlabeled full of noise information. Or, it could be supervised learning which is concerned with recognizing the labeled data to recognize patterns in it, and then

unsupervised deep learning techniques, clarify when and how to be used, and give an overview on the technologies and applications which are adopting them. KEYWORDS: Big Data, Deep Learning, Supervised Learning & Unsupervised Learning

A SURVEY

to be translated into valuable insights for further implementation. This paper is a review that survey the supervised and

Received: Jan 17, 2019; Accepted: Feb 07, 2020; Published: Feb 29, 2020; Paper Id.: IJCSEITRJUN20203

INTRODUCTION The remarkable evolution of communication technologies including (emails, interactive websites, social media applications, etc.) facilitated the exchange and transfer of great amounts of data (big data) in various forms. It also led to the emergence of the need to develop systems capable of storing and utilizing such data. Information systems have maintained and processed customer’s data in a traditional way in order to provide a variety of services like saving, retrieving and calculating these data. Today; it is much beyond that, the ability of systems to provide knowledge-based services is also required. For example, customer now-a-days looks for websites that are capable of suggesting what suits him of goods, movies, jobs and other things. This is done through efforts of studying data, analyzing it, applying algorithms and techniques that enable systems to derive the knowledge required to do so. And from here the term “Data Science” is initiated.

BIGDATA AND DATA SCIENCE Big Data is referred to a massive volume of data which are mostly structured and unstructured, it is so large data, and it is difficult to process using traditional software techniques.

www.tjprc.org

editor@tjprc.org


Turn static files into dynamic content formats.

Create a flipbook
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.