전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

Preview:

DESCRIPTION

전문가 토크릴레이 2탄 Open data and linked data : 웹사이언스 워크그룹 김학래 박사

Citation preview

Open����������� ������������������  Data����������� ������������������  and����������� ������������������  Linked����������� ������������������  Data����������� ������������������  Making Emergent Creativity

김학래,����������� ������������������  코리아데이터허브,����������� ������������������  2012����������� ������������������  

Data.gov  WikiLeaks  

“This  led  to  changes  in  the  cons6tu6on  and  the  establishment    of  a  more  open  government”  –  WikiLeaks  

Let’s  Think  

Open Data starts with making available the data that you already have, in whatever format.

•  Equal access for all •  Licensing, legal issues •  Transparency •  Changing the way government works

Open Data vs Linked Data Quick Summary

Open Data

Linked Data •  URIs •  HTTPs •  RDF vocabularies •  Standards

3

Introduction Open Data and Open Government

Data

The Semantic Web & Linked Data What We Will Do

This Presentation ..... Today

4

Web in Evolution “a steady progression from a document-centric Web to one that is data-centric, including the mediation of semantics”

Let’s Start

5

(Source: Mike, 2007)

What is the Semantic Web for? Question

6

Search

Inference

Intelligence

Standards

Google’s Semantic Search Case Studies

People should be able to ask questions and we should understand their meaning, or they should be able to talk about things at a conceptual level. ... A lot of people will turn to things like the semantic Web as a possible answer to that.“ - Google Vice President of Search Products & User Experience Marissa Mayer

7

an initiative launched on 2 June 2011 by Bing, Google and Yahoo! to "create and support a common set of schemas for structured data markup on web pages."

http://schema.org/docs/full.html

The Knowledge Graph is a collection of information sources that help discern a user’s specified intent with each individual query. The graph is actually an encyclopedia with structured information obtained from the web. (currently, 200 million entities)

Freebase is an open, Creative Commons licensed repository of structured data of almost 22 million entities. An entity is a single person, place, or thing connected by a graph.

Apple’s Siri Case Studies

Ask Siri how Apple recorded the best quarter in history for a tech company, and her answer should be: "Me."

8

Siri (Speech Interpretation and Recognition Interface) is an intelligent personal assistant and knowledge navigator which works as an application for Apple's iOS. A Brief History - In December 2007 Siri, Inc. was formed by Dag Kittlaus (CEO), Adam Cheyer (VP Engineering), and Tom Gruber (CTO/VP Design). - Siri Inc. went after funding and by November 2009 it had secured $15.5 million investment, resulted in the creation of the first Siri application, which debuted on the iPhone 3GS in February 2010. - Siri acquired by Apple; iPhone becomes the Virtual Personal Assistant

Knowledge Navigator (1987) a concept described by former Apple Computer CEO John Sculley in his 1987 book, Odyssey.

(Source: http://www.youtube.com/watch?v=QRH8eimU_20)

9

Active Ontology Case Studies

A processing formalism where distinct processing elements are arranged according to ontology notions; an execution environment.

Basic concepts * Ontology : A data structure - Formal representation for domain knowledge - Classes, attributes, relations * Active Ontology : A processing environment - Processing elements arranged according to ontology

notions - Communication channels movie

genre actor rating P P P

P

rule set

rule

condition

action

rule

condition

action

rule condition

action

(Baur et al., 2007)

Introduction Open Data and Open Government

Data

The Semantic Web & Linked Data What We Will Do

This Presentation ..... Today

10

Big Data “data that becomes large enough that it cannot be processed using conventional methods”

Let’s Start

11

“Big Data is like Sex in High School–Lots of people are talking about it, but few are having it.” -Eric Hansen, SiteSpect founder and CEO

London 2012: Open Data Olympics Best Practices

12

OpenStreetMap - Project Haiti

“Open” material (data) is open if it can be freely used, reused and redistributed by anyone

“Government data” data and information produced or commissioned by government or government controlled entities.

Source: Open Knowledge Foundation, 2010

14

What is Open (Government) Data? Definition

•  Transparency •  Participation •  Collaboration

“My administration is committed to creating an unprecedented level of openness in Government.” – Barack Obama

“Memorandum for the Heads of Executive Departments and Agencies – Transparency and Open Government” Jan 2009

Data.gov  

•  The  first  phase  of  Data.gov  features  downloadable  federal  data  sets  organized  by  category  and  federal  organiza6on.  

•  Data  sets  are  available  for  download  in  XML,  CSV,  and  shape  file  formats.  

Launched  on  May  21,  2009,  Data.gov  allows  ci;zens  to  par;cipate  by  leveraging  federal  data  sets  to  build  applica;ons,  conduct  analysis,  and  perform  research.  

16  

Data.gov.uk  

Establishment  of  the  Public  Sector  Transparency  Board  chaired  by  Francis  Maude,  Minister  for  the  Cabinet  Office    The  Board  will  be  responsible  for  seRng  open  data  standards  across  the  public  sector,  publishing  further  datasets  on  the  basis  of  public  demand  

Prime  Minister,  David  Cameron,  writes  to  all  government  departments,  31  May  2010:  instruc;ng  them  to  free  up  more  datasets  as  part  of  Transparency  Agenda  

17  

hTp://www.prac6calpar6cipa6on.co.uk/odi/wp-­‐content/uploads/2010/06/Open-­‐Data-­‐Impacts-­‐Timeline-­‐Dra[-­‐0.1.png  18  

Postcode Newspaper Where Does My Money Go World Events Visualiser EU Public Data

Applications Case Studies

19

Source: http://tinyurl.com/44rub56

The State of Open Government Data Public Sector Dataset

20

“The application of the four types of instruments by the five countries is depicted – the larger the circle the more instruments are applied” – Huijboom & Van den Broek, 2011.

Open data instruments Open Data Strategies

21

DK DK

DK DK

US

ES ES

ES

AU

UK

UK ES

AU

US

UK US

AU

AU

UK

US

Education and training

Economic instruments

Voluntary approaches

Legislation and control

Drivers and barries of open data policy implementation Critical factors

22

Strategies and experience in front runner countries 1

2

3

4

5

6

7

8

9

10

Political leadership

Regional initiatives

Citizen initiatives

Market initiatives

Emerging technologies

European legislation

Thought leaders

Possibility of monitoring government

Budgets cuts

Closed government culture

Privacy legislation

Limited quality of data

Limited user-friendliness/information overload

Lack of standardization of open data policy

Security threats

Existing charging models

Uncertain economic impact

Digital divide

Network overload

Source:  Huijboom  and  Van  den  Broek,  2011  

Makes it easy to publish, share, and find dataset. Integrated data storage, processing, viewing and visualization

CKAN – Open Source Data Portal Open Data Portals

23

Introduction Open Data and Open Government

Data

The Semantic Web & Linked Data What We Will Do

This Presentation ..... Today

24

.. a system of interlinked hypertext documents accessed via the Internet

The Web as a Global Data Platform Let’s Start

25

HTTP

World Wide Web

URI HTML

26

All data including documents, services, people ...

DATA DATA links

The Semantic Web is not about links between web pages.

27

“The Semantic Web isn't just about putting data on the web. It is about making links, so that a person or machine can explore the web of data. With linked data, when you have some of it, you can find other, related, data” - TBL.

Linked Data & The Semantic Web Overview

28

5 Stars Open linked data

Make your stuff available on the Web Make it avaiable as structured data Use open, standard formats (instead of

excel) Use a open data format – URLs,

descriptions Link your data to other people’s data

★★

★★★

★★★★

★★★★★

… Linked Data provides the means to reach the goal of the Semantic Web – “the emergence of a Web of Data”

29

Growth of Interlinks Overview

2007-05-01 2007-10-08 2007-11-10 2008-02-28 2008-03-31

2008-09-18 2009-03-05 2009-03-27 2009-07-14 2010-09-22

30  October, 2011 295 interlinked datasets, approximately 31 billions triples

DBpedia

Structured Wikipedia

BBC

Best Buy UK Gov

Multimedia Content

Commercial Product Government Data

Linked Data and Open Government Data Why

31

Applications Case Studies

32

DBPedia BBC New York Times thedatahub

Introduction Open Data and Open Government

Data

The Semantic Web & Linked Data What We Will Do

This Presentation ..... Today

33

34

Roadmap of linked open government data Conceptual Architecture

“the combination of machine power and human power and deliver higher-quality data to a wide range of data consumers via visualization, mashups, and more.”

(Ding et al., 2012)

Rebuild Fireout

“We won’t get there tomorrow, but maybe the day after” – Rufus Pollock

How to Start

Low-hanging fruit, Less conversational data and quick wins.

Expand, with more….. Data Services Efficiency Costs saving Transparency Participation Inclusion

35  

- Charles Baur, Adam Cheyer, Didier Guzzoni, Active, a platform for building intelligent software - Noor Huijboom and Tijs Van den Broek, Open Data: an international comparison of strategies, European journal of ePractices, March/April 2011 - Li Ding, Vassilios Peristeras, and Michael Hausenblas, Linked Open Government Data, IEEE Intelligent Systems, May/June 2012 -  Page 1: http://www.w3.org/DesignIssues/diagrams/websci/Marius%20Watz%20-%20Web%20Science%20artwork.png -  Page 4: http://www.go-gulf.com/60seconds.jpg -  Page 9: http://cloud.frontpagemag.com/wp-content/uploads/2012/03/obama11.jpg -  Page 27: http://www.patentlyapple.com/.a/6a0120a5580826970c0168e5ccdd81970c-800wi -  Page 29: http://programminggeeks.com/wp-content/uploads/2010/05/Programming-Geeks-Web-Science.jpg -  Page 29: http://3.bp.blogspot.com/-C0Kyck90Djo/T4KZTg3k1XI/AAAAAAAAAsE/RUp165S0FCQ/s1600/Commitment.jpeg

Page 2 Case Studies -  http://www.guardian.co.uk/commentisfree/2012/aug/03/london-2012-olympics-open-data -  http://www.bbc.co.uk/news/uk-19050139 -  http://london2012.nytimes.com/results -  http://www.guardian.co.uk/sport/interactive/2012/jul/23/could-you-be-a-medallist -  http://www.guardian.co.uk/sport/datablog/2012/aug/13/olympics-2012-data-journalism -  http://www.guardian.co.uk/sport/datablog/interactive/2012/jul/26/london-2012-price-olympic-games-visualised

References

36

For more information contact Haklae Kim via haklae.kim@gmail.com Twitter: haklaekim Or see more activities at: http://blogweb.co.kr http://thedatahub.kr http://getthedata.kr

Recommended