Is Big Data Replacing Data Warehouse? Busting the myth

Page 2

Now, let us deep dive a bit into both the technologies:

Data Warehousing Data Warehousing refers to data which is extracted from one or more homogeneous or heterogeneous data sources, and then transforming the data before loading it into a data repository for data analysis. This data analysis is useful and helps in better judgement for improving performances and can be used for reporting. The data repository which is generated from the process is the data warehouse. It is a conceptual architecture which is aimed at storing structured, subjectoriented, time variant, non-volatile data for decision making. Data Warehouse typically stores the historical data, a copy of transaction data specifically structured for query and analysis. A Data Warehouse traditionally brings together data from many transactional and operational systems, which is then presented as a consolidated and the best real version to decision makers at all levels of the organization. A well done data warehouse design allows us to access, report and analyze that information from all the relevant and possible angles; which drives consistent and accurate information as a result.

Big Data Big data is a technology that is used to store the unstructured data from various sources and to manage huge volume of data in Exabyte (1 billion GB) and Zettabytes (1 trillion GB). Big Data can store all kinds of data like structured, semi-structured and unstructured data which can consists of video, audio, unstructured text, etc., while using cheaper storage devices. The data is not processed at one place and is spread across several servers for faster processing and is stored in the native format without any planning or modelling applied. The actual usage of the data needs rules to be applied to the data to get the report.


Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.