11
THE BIG DATA COMBAT SPEC INDIA

The Big Data Combat

Embed Size (px)

DESCRIPTION

The size of the data generated in the world explodes. Data is being constantly gathered by various sources. The Data keeps increasing many folds every day. Technology gears up to combat BIG DATA\n - PowerPoint PPT Presentation

Citation preview

Page 1: The Big Data Combat

THE BIG DATA COMBAT

SPEC INDIA

Page 2: The Big Data Combat

WHAT IS IT ALL ABOUT?

The size of the data generated in the world explodes

Data is being constantly gathered by various sources

The Data keeps increasing many folds every day

Technology gears up to combat BIG DATA

Page 3: The Big Data Combat

BIG DATA

A large unstructured big volume data set

Too complex to be handled by commonly used database management systems

RDBMS DBMS

Big data uses statistical inference to determine parameters from a large volume of data

Regressions Nonlinear Relationships Data Dependencies

Page 4: The Big Data Combat

SOURCES OF DATA TODAY

The Internet

Mobile Devices

Remote Sensing

Software Logs

Cameras

Microphones

Radio Frequency Identification (RFID)

Wireless Sensor Networks

Page 5: The Big Data Combat

THE CHALLENGESIn the Growth & Digitization of This Global Information

Storage

Page 6: The Big Data Combat

VOLUME

BIG Volumes The unceasing increase in the amount of data Created everyday Overwhelming in size

Page 7: The Big Data Combat

VELOCITY

Velocity @ The Speed Of Light… Speed of Data in and out Transactions Business Analysis

Page 8: The Big Data Combat

VARIETY

Variety Spices up Big Data too

Data Types Data Sources Challenges in

Capture Curate Store

Interpretation Meaningful Analys Search Data Visualization

Page 9: The Big Data Combat

BIG DATA ROLLOUT

Steps for a mature and meaningful data set Data Profiling Data Cleansing Data Integration of structured and unstructured data Data Merging Data Migration Data Replication ETL / ELT / ETLT Design and Development Interfacing legacy systems with the modern approach

Page 10: The Big Data Combat

BIG DATA TOOLS

Hadoop, a distributed file system

MapReduce, a framework for data abstractions

Hive for data summarization and adhoc queries

Pig for parallel processing

HBase, a structured storage for large tables

Sqoop for data integration of Hadoop with RDBMS

Flume for data transfers of log data to centralized data repositories

Page 11: The Big Data Combat

IT IS BIG & IS GETTING BIGGER

TOO! 

Visit

http://www.spec-india.com/services/bi-bigdata-database-services.html

to request a FREE POC to Test Drive our services