12
Sandeep Chaudhary B.tech(CSE) B 102910059 Sandeep Chaudhary 1

Big data(Sandeep Chaudhary)

Embed Size (px)

Citation preview

Page 1: Big data(Sandeep Chaudhary)

Sandeep ChaudharyB.tech(CSE) – B

102910059

Sandeep Chaudhary1

Page 2: Big data(Sandeep Chaudhary)

Outline What is Big Data ?

What makes Data Big Data ?

3 V’s of Big Data

Why do we need Big Data ?

Filtering Big Data Effectively

Risks of Big Data

Statistics about On-Line Usage

Sandeep Chaudhary 2

Page 3: Big data(Sandeep Chaudhary)

What is Big Data ? Big Data is about liberating

data that is large in Volume, broad in Variety, and high in Velocity.

Big Data refers to Data Sets where the size is beyond the ability of typical database Software tools to capture, store, manage and analyze.

Sandeep Chaudhary 3

Page 4: Big data(Sandeep Chaudhary)

What makes Data “Big Data”Big Data is characterized by the 3 V’s :

Volume : larger than “normal”, a challenge to load and process.

Velocity : Rate of arrival posses real-time constraints on what are typically “batch ETL” operations.

Variety : Mix of Data types and varying degrees of Structure.

Sandeep Chaudhary 4

Page 5: Big data(Sandeep Chaudhary)

3 V’s of Big Data

Sandeep Chaudhary 5

Page 6: Big data(Sandeep Chaudhary)

Why do we need Big Data ?Big Data : is a mix of Structured, Semi-structured and

unstructured data –

Typically breaks barriers for traditional RDB Storage.

Typically breaks limit of Indexing.

Typically requires intensive pre-processing before each query to extract.

Sandeep Chaudhary 6

Page 7: Big data(Sandeep Chaudhary)

Filtering Big Data Effectively The extract, transform and load (ETL) process.

Taking a raw feed of data, reducing it, and producing a uasable set of output.

Sandeep Chaudhary 7

Page 8: Big data(Sandeep Chaudhary)

Risks of Big Data Will be so over-whelmed

Need the right people and solve the right problems.

Costs escalate too fast

Isn’t necessay to captue 100%

Many sources of Big Data is private

Self-Regulation

Legal regulation

Sandeep Chaudhary 8

Page 9: Big data(Sandeep Chaudhary)

Some facts and figures related to Online Data Usage : How many Data in the world :

800 Terabytes, 2000

160 Exabytes, 2006

500 Exabytes, 2009

2.7 Zettabytes, 2012

35 Zettabytes, 2020

How many Data generated in ONE Day ?

7 Terabytes, Twitter

10 Terabytes, Facebook

Sandeep Chaudhary

9

Page 10: Big data(Sandeep Chaudhary)

Sandeep Chaudhary 10

Page 11: Big data(Sandeep Chaudhary)

Any Questions

Sandeep Chaudhary 11

Page 12: Big data(Sandeep Chaudhary)

Sandeep Chaudhary

12

For Your Patience Listening.