Characteristics of Big Data: Understanding the Five V’s

Page 1

CharacteristicsofBigData:Understanding theFiveV’s

Volume,Velocity,Variety,Veracity,Value

5V’s ofBig Data

History

It started in the year 2001 with 3V’s, namely Volume,VelocityandVariety. Then Veracity got added, making it4V’s. Then Valuegotadded, making it 5V’s. Later came 8Vs, 10Vsetc.

We will discuss onthe important ones(5V’s) Volume, Velocity, Variety, Veracity, andValue.

1) Volume

It refers to the sizeofBig Data. Data can be consideredBig Data or not is based on the volume. Therapidly increasingvolume data is dueto cloud-computing traffic, IoT, mobiletraffic etc.

www.datacademy.ai Knowledge world

Datagrowth prediction

2) Velocity

It refers to the speedat which the datais getting accumulated. This is mainly due to IoTs, mobiledata, social mediaetc.

In the year2000, Google was receiving 32.8 million searches per day.As for 2018, Google was receiving 5.6billion searches per day!

Approximate monthly active usersasof2018:

Facebook:2.41billion

Instagram: 1billion

Twitter: 320 million

LinkedIn: 575 million

Facebook monthlyactive usersgrowthsince2008

3) Variety

It refers to Structured, Semi-structured and Unstructured data due to differentsources ofdata generatedeitherby humans orby machines.

www.datacademy.ai Knowledge world

Structureddata: It’s the traditional data which is organized and conforms to the formal structure of data. This datacan be stored ina relational database. Example:Bank statement containing date, time, amountetc.

Semi-structureddata: It’s semi-organized data. It doesn’tconform to the formal structure of data.Example:Log files, JSON files, Sensor data, csv files etc.

Unstructureddata: It’s not an organized dataand doesn’tfit into rows and columns structureofa relational database.Example:Text files, Emails, images, videos, voicemails, audio files etc.

4)Veracity

It refers to theassurance of quality/integrity/credibility/accuracy ofthe data. Since the data is collected frommultiple sources, we need to check the data foraccuracybefore using it for business insights.

5)Value

Just because we collected lots ofData,it’s of no value unless we garner some insights out of it. Valuerefers to howuseful the data is indecision making.We need to extract the value oftheBig Data using proper analytics.

WhataretheotherV’s?

Viscosity(complexity or degree of correlation), Variability(inconsistency in data flow), Volatility(durability or how long time data is valid and how longit should be stored),Viability(capability to be live and active), Validity(understandable to find the hidden relationships).

www.datacademy.ai Knowledge world

Turn static files into dynamic content formats.

Create a flipbook
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.
Characteristics of Big Data: Understanding the Five V’s by Datacademy.ai - Issuu