14
Big Data Aisha Siddiqa [email protected] C4MCCR, Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur, Malaysia.

Introduction to big data

Embed Size (px)

Citation preview

Big Data

Aisha Siddiqa [email protected]

C4MCCR, Faculty of Computer Science and Information Technology,

University of Malaya, Kuala Lumpur, Malaysia.

What is Big Data

• Data is Data, what is “Big” ???

• A Big thing in the field of computing which generates values from large data sets that cannot be analyzed with traditional computing techniques

• Storage

• Processing

• Visualization

By: Aisha Siddiqa C4MCCR,

Faculty of Computer Science and Information Technology,

University of Malaya, Kuala Lumpur, Malaysia.

What is new

What is new traditional data BIG DATA

Data Type Employee records, bank records

Web search, data mining, scientific and medical databases

Data Accumulation

Staff Users, Machines

Processing Centralized Parallel

By: Aisha Siddiqa C4MCCR,

Faculty of Computer Science and Information Technology,

University of Malaya, Kuala Lumpur, Malaysia.

Explosion of Big Data (I)

By: Aisha Siddiqa C4MCCR,

Faculty of Computer Science and Information Technology,

University of Malaya, Kuala Lumpur, Malaysia.

Explosion of Big Data (II)

By: Aisha Siddiqa C4MCCR,

Faculty of Computer Science and Information Technology,

University of Malaya, Kuala Lumpur, Malaysia.

Features of Big Data

Volume

Velocity

Variety

Veracity

Variability

Value

Complexity By: Aisha Siddiqa

C4MCCR, Faculty of Computer Science and

Information Technology, University of Malaya,

Kuala Lumpur, Malaysia.

Real Statistics of Big Data (I) Facebook:

• Collecting about 600 petabytes of data per day

• An average user creates 90 pieces of content each month

• More than 500 million active users

Twitter:

• 9,401 tweets per second

• 1 billion tweets in less than 2 days

• 50 million users from the past year

0

250

500

750

1000

1250

1500

2008 2009 2010 2011 2012 2013 2014 2015

Facebook Users

By: Aisha Siddiqa C4MCCR,

Faculty of Computer Science and Information Technology,

University of Malaya, Kuala Lumpur, Malaysia.

Real Statistics of Big Data (II)

• 49,252 Google searches per second

• 187 million new users per month

• 300 hours videos uploaded per minute

• Over 1 billion users

By: Aisha Siddiqa C4MCCR,

Faculty of Computer Science and Information Technology,

University of Malaya, Kuala Lumpur, Malaysia.

Real Statistics of Big Data (V)

By: Aisha Siddiqa C4MCCR,

Faculty of Computer Science and Information Technology,

University of Malaya, Kuala Lumpur, Malaysia.

Future of Big Data

By: Aisha Siddiqa C4MCCR,

Faculty of Computer Science and Information Technology,

University of Malaya, Kuala Lumpur, Malaysia.

Big Data is for Smart Organizations

• Every single bit is valuable

• Only smart organizations realize to keep and process Big Data for better decision making, for survival in competing:

– Customers

– Products

– Services

By: Aisha Siddiqa C4MCCR,

Faculty of Computer Science and Information Technology,

University of Malaya, Kuala Lumpur, Malaysia.

Big Data for R&D

• Data is beyond structured, relational databases

• New opportunities for data management in hardware, storage, networking and computing are needed: – Virtualization

– Cloud

By: Aisha Siddiqa C4MCCR,

Faculty of Computer Science and Information Technology,

University of Malaya, Kuala Lumpur, Malaysia.

Bid Data Management

• Functional Requirements

By: Aisha Siddiqa C4MCCR,

Faculty of Computer Science and Information Technology,

University of Malaya, Kuala Lumpur, Malaysia.

Big Data Architecture

By: Aisha Siddiqa C4MCCR,

Faculty of Computer Science and Information Technology,

University of Malaya, Kuala Lumpur, Malaysia.