52
THE THINGS AROUND BIG DATA - CLOUD COMPUTING, GOVERNMENT DATA, LINKED DATA 남궁현 [email protected] [email protected]

THE THINGS AROUND BIG DATA

  • Upload
    -

  • View
    425

  • Download
    4

Embed Size (px)

DESCRIPTION

THE THINGS AROUND BIG DATA - CLOUD COMPUTING, GOVERNMENT DATA, LINKED DATA

Citation preview

Page 1: THE THINGS AROUND BIG DATA

THE THINGS AROUND BIG DATA - CLOUD COMPUTING, GOVERNMENT DATA, LINKED DATA

남궁현 [email protected]

[email protected]

Page 2: THE THINGS AROUND BIG DATA

Involved Projects

독립형 컴포넌트 기반 서비스 지향형 페타급 컴퓨팅 플랫폼 기술 개발

빅데이터 활용을 위한 지식자산 구축 및 실시간 Linked Data 응용 기술개발

ExoBrain 컨소시엄 과제

Page 3: THE THINGS AROUND BIG DATA

Big Data

Page 4: THE THINGS AROUND BIG DATA

Buzz Word…?

Page 5: THE THINGS AROUND BIG DATA

What the Hell is BIG DATA?

Page 6: THE THINGS AROUND BIG DATA

3Vs

Page 7: THE THINGS AROUND BIG DATA

Open Data

Linked Data Government Data

Hadoop

And….

Cloud Computing

Echo-System

NOSQL

Page 8: THE THINGS AROUND BIG DATA

Definition?

Page 9: THE THINGS AROUND BIG DATA

Example

Page 10: THE THINGS AROUND BIG DATA

Ex.1 - Daum

대규모 Log분석

Page 11: THE THINGS AROUND BIG DATA

16시간 1.5시간

Page 12: THE THINGS AROUND BIG DATA

Content Logs 단위뉴스별 실시간 분석

실시간 콘텐츠 피드백

Page 13: THE THINGS AROUND BIG DATA

Ex.2 - LinkedIn

Page 14: THE THINGS AROUND BIG DATA

Simple Graph Analyze

Page 15: THE THINGS AROUND BIG DATA

16TB Scalable Cluster

Page 16: THE THINGS AROUND BIG DATA

기존 시스템에서 처리가 힘든 크기의 데이터

Scalable Computing 환경

Page 17: THE THINGS AROUND BIG DATA

Too Large Size Data

6,000,000,000 Files with 60TB Physical Size

Of One Month

Page 18: THE THINGS AROUND BIG DATA

Machine??

Storage space??

Processing time ??

Page 19: THE THINGS AROUND BIG DATA

Scalable Computing Environment

Page 20: THE THINGS AROUND BIG DATA

Cloud Computing

MapReduce

NOSQL DB

Page 21: THE THINGS AROUND BIG DATA

Cloud?

Page 22: THE THINGS AROUND BIG DATA

Cloud Computing

Page 23: THE THINGS AROUND BIG DATA

Job

Result

Cloud Computing

Page 24: THE THINGS AROUND BIG DATA

Easy Scalability

Page 25: THE THINGS AROUND BIG DATA

Network-wired Hadoop Cluster

MapReduce Framework(e.g. Hadoop)

Page 26: THE THINGS AROUND BIG DATA

NoSQL(e.g. MongoDB, Cassandra)

Page 27: THE THINGS AROUND BIG DATA

… …

MongoDB Cluster

Hadoop Cluster

Storing Processing

Storing and Processing Cluster on Cloud Computing

Page 28: THE THINGS AROUND BIG DATA

MongoDB Cluster

Key:@id+time Value: twitt message

Store

Query Access

@id+time

Twits on MongoDB Cluster

Page 29: THE THINGS AROUND BIG DATA

MapReduce Cluster Map Reduce

@id

@id

#tag

#tag

#tag

#tag

#tag

Input Output

Page 30: THE THINGS AROUND BIG DATA

Application /Analyze

Big Data Handling

Page 31: THE THINGS AROUND BIG DATA

MapReduce

NOSQL DB

Page 32: THE THINGS AROUND BIG DATA

국내에선..?

Page 33: THE THINGS AROUND BIG DATA
Page 34: THE THINGS AROUND BIG DATA

Recent Big Data Research in Korea

Social Data

Governmental Data Linked Data

Page 35: THE THINGS AROUND BIG DATA

Social Big Data Analyze

Page 36: THE THINGS AROUND BIG DATA

Social Big Data Analyze

Page 37: THE THINGS AROUND BIG DATA

Governmental Data

Page 38: THE THINGS AROUND BIG DATA

공유자원포탈(http://data.go.kr) by 인터넷 정보화 진흥원

서울 열린 데이터 광장(http://data.seoul.go.kr) by 서울시 정정보화 사업단

Governmental Data

Page 39: THE THINGS AROUND BIG DATA

Linked Data by Tim Berners Lee

Page 40: THE THINGS AROUND BIG DATA
Page 41: THE THINGS AROUND BIG DATA

Social Data

Governmental Data Linked Data

Page 42: THE THINGS AROUND BIG DATA

Big Data Research = Find Forgotten Data

Page 43: THE THINGS AROUND BIG DATA

Data high- dimensional features Hash Code Decoding

Page 44: THE THINGS AROUND BIG DATA

제 경우는요..

Page 45: THE THINGS AROUND BIG DATA

빅데이터 활용을 위한 지식자산 구축 및 실시간 Linked Data 응용 기술 개발

(2012 ~ 2015, 3Years, 8,000 per Year)

주관기관, 데이터확보, Enrichment

데이터 변환/Sync

데이터/플랫폼 제공

RDF데이터 처리

사용자 응용서비스

Page 46: THE THINGS AROUND BIG DATA

VS

Web of Data

Data, API

XML, OpenAPI RDF, Linked Data

Page 47: THE THINGS AROUND BIG DATA

공공DB 공공DB

공공DB

RDF

TextData

공공DB 공공DB

schema

공공데이터 플랫폼 LOD Publish 개발자지원

데이터/인프라 제공

분할 인덱스 LOD

검색/접근/API

Linked Data기반 응용서비스

공공데이터 플랫폼

자체서비스데이터

LOD 데이터

Page 48: THE THINGS AROUND BIG DATA

IBM Watson ExoBrainProject

Page 49: THE THINGS AROUND BIG DATA

Graph Data Storage

Page 50: THE THINGS AROUND BIG DATA

Knowledge = Large Size Graph Data

Page 51: THE THINGS AROUND BIG DATA

Real-time Graph Data Processing

Page 52: THE THINGS AROUND BIG DATA

감사합니다 [email protected] @chungbuk.ac.kr