Challenges and opportunities in big data analytics

Page 1

Challenges and Opportunities in Big Data Analytics Prof. Sharad Gore Department of Statistics (Retd.) Savitribai Phule Pune University Pune 411 007. Abstract Big Data Analytics have opened up a vast field of challenges for computer engineers, computer scientists, and statisticians with the potential of a great amount and variety of research and development in hardware, software and human expertise. Data science is becoming an integral part of the big data revolution and statistics plays an important role in the emerging area of big data analytics. This talk is about the challenges and opportunities for statistical science and statisticians to participate in the new era of big data analytics as opposed to the classical statistical analysis for developing inferential or predictive models. The large volume, velocity, and variety of big data is making some of the classical analytical methods either irrelevant or inadequate. At the same time, efficient statistical methods are overcoming the problem of time complexity that arises due to large sizes of data. Survey-based methods are making way for model-based methods, because the pervasive methods of data collection are generating data without any human supervision, resulting in observations that are not obtained through survey sampling. The conclusion is that statistics as a scientific endeavor must modify some of its methodology in order to face the challenges and grab the opportunities presented by the big data revolution. 1. Introduction. The United Nations Economics and Social Council has a Statistical Commission. The forty-fifth session of the Commission took place during March 4-7, 2014 on the theme “Big Data and Modernization of Statistical Systems.” The main conclusions drawn at the session were that big data constitute a source of information that cannot be ignored by statisticians and that statisticians must exploit the opportunities and harness the challenges effectively. Pervasive use of electronic devices and the perpetual generation and availability of digital data have led to a fundamental change in the nature of data. Data that are continuously being generated in enormous quantities are being referred to as big data. Big data consist of “high volume, velocity, and variety of data that demand cost-effective and innovative forms of processing.” Big data have the potential of producing more relevant and timely statistics than traditional sources. Most sources of big data are controlled by the private sector and most of the countries have still not promulgated legislation to permit the use of big data in the public domain. 1


Turn static files into dynamic content formats.

Create a flipbook
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.
Challenges and opportunities in big data analytics by Indira Group of Institutes - Issuu