49
can change social science research (including yours) Frank van Harmelen Computer Science Department VU University Amsterdam ative Commons License: owed to share & remix, must attribute & non-commercial

How the Web can change social science research (including yours)

Embed Size (px)

DESCRIPTION

A presentation for a group of PhD students from the Leibniz Institutes (section B, social sciences) to discuss how they could use the Web, and even better the Web of Data, as an instrument in their research.

Citation preview

Page 1: How the Web can change social science research (including yours)

How the Web can change

social science research (including yours)

Frank van HarmelenComputer Science

DepartmentVU University Amsterdam

Creative Commons License: allowed to share & remix,but must attribute & non-commercial

Page 2: How the Web can change social science research (including yours)

Using the web (of data) for e-science

in Social SciencesFrank van HarmelenComputer Science

DepartmentVU University Amsterdam

Creative Commons License: allowed to share & remix,but must attribute & non-commercial

Health Warning:Computer Scientist!

Page 3: How the Web can change social science research (including yours)

This talk is aboutusing the web

as an observational instrument

using the web of data as an even better observational instrument

using the web of data as a data-sharing platform

Page 4: How the Web can change social science research (including yours)

This talk is not aboutit's NOT social science about e-science

(e.g Oxford research center)

it's NOT about high-performance computing(that's just boring infrastructure, let the computer scientists will deal with that)

I don’t discuss online social experiments(crowd sourcing, social games, mech. turk, etc)

Page 5: How the Web can change social science research (including yours)

Who are you?

who is using large computerised data-sets ?

who is using data extracted from the web ?

who is using semantic web data ?

Page 6: How the Web can change social science research (including yours)

This talk is aboutusing the web & the web of data

as an observational instrument &as a sharing platform

Through:A whole bunch of realistic examplesA sketch of the technology

Message = yes, you can do this too!

Page 7: How the Web can change social science research (including yours)

Philosophical confessionI take a strongly positivistic stance

Page 8: How the Web can change social science research (including yours)

Revolution ahead?

Page 9: How the Web can change social science research (including yours)

Effects of observation instruments

Page 10: How the Web can change social science research (including yours)

Effects of observation instruments

Page 11: How the Web can change social science research (including yours)

Effects of observation instruments

Page 12: How the Web can change social science research (including yours)

Effects of observation instruments

Page 13: How the Web can change social science research (including yours)

Effects of observation instruments

Page 14: How the Web can change social science research (including yours)

Example:Political science

Page 15: How the Web can change social science research (including yours)

Question: Is the content of party-political programmes and election speeches predictive of government coalition attempts?

Data • All party manifesto’s, • half a year of all Dutch newspapers

Page 16: How the Web can change social science research (including yours)

Example:Communication science

Page 17: How the Web can change social science research (including yours)

Question: Can we predict the social network at Tn from the content at Tn-1?

Data• Discussions from online forum nl.politiek • 21.000 participants talking about 19 Dutch political parties during 259 weeks

Page 18: How the Web can change social science research (including yours)

Example:Science dynamics

Page 19: How the Web can change social science research (including yours)

Question: Is thematic co-occurence at Ynpredictive of co-authoring at Yn+1?

Data:5 year conference series, 1000 papers/year, 3000 authors/year

Page 20: How the Web can change social science research (including yours)

AmCAT3: Keyword search

Page 21: How the Web can change social science research (including yours)

This works…. sort of….Methods:web scrapingnat. lang. analysis

(parsing, stemming, synonyms, homonyms)

identity resolutionRequiredPhysical Interoperability Syntactic InteroperabilitySemantic Interoperability

Page 22: How the Web can change social science research (including yours)

Web of Data to the rescue

Page 23: How the Web can change social science research (including yours)

General idea of Web of Data(a.k.a. “Semantic Web”)

1. Make data available on the Webin machine-understandable form (formalised)

2. Structure the data and meta-data in ontologies

Page 24: How the Web can change social science research (including yours)

Warning:technical content

coming up

Page 25: How the Web can change social science research (including yours)

Bluffer’s Guide to RDF• Express relations between things:

• Results in labelled network (“graph”)• All labels are actually web-addresses (URIs)• You can “ping” any label and find out more• Bits of the graph can live at physically different

locations & have different owners

Frank y

x

AuthorOf

AuthorOf MITpublishedBy

Subject ObjectPredicate

Page 26: How the Web can change social science research (including yours)

Bluffer’s Guide to RDF Schema

• types for subjects & objects & predicates• Types organised in a hierarchy• Inheritance of properties

Frank y

x

AuthorOf

AuthorOf MITpublishedBy

author book publisher

person artifact

man

Page 27: How the Web can change social science research (including yours)

Ontologies (= hierarchical conceptual vocabularies)Identify the key concepts in a

domainIdentify a vocabulary for these

conceptsIdentify relations between these

conceptsMake these precise enough

so that they can be shared between• humans and humans• humans and machines• machines and machines

Page 28: How the Web can change social science research (including yours)

Biomedical ontologies (a few..) Mesh

• Medical Subject Headings, National Library of Medicine

• 22.000 descriptions EMTREE

• Commercial Elsevier, Drugs and diseases• 45.000 terms, 190.000 synonyms

UMLS• Integrates 100 different vocabularies

SNOMED• 200.000 concepts, College of American

Pathologists Gene Ontology

• 15.000 terms in molecular biology NCBI Cancer Ontology:

• 17,000 classes (about 1M definitions),

Page 29: How the Web can change social science research (including yours)

On the Web of Data, anyone can link anything to anything

x T

[<x> IsOfType <T>]

differentowners & locations

<institute>

Page 30: How the Web can change social science research (including yours)

SPARQL: Bluffer’s GuideSELECT ?country_name ?population

WHERE { ?country a type:LandlockedCountries ; ?country rdfs:label ?country_name ; ?country prop:populationEstimate ?

population . FILTER (?population > 15000000) .

SELECT ?name ?img ?hp ?locWHERE {

?a a mo:MusicArtist ; ?a foaf:name ?name .

OPTIONAL { ?a foaf:homepage ?hp } . }

Page 31: How the Web can change social science research (including yours)

Example:science dynamics

Page 32: How the Web can change social science research (including yours)

Faculteit der Exacte Wetenschappen

MEET JULIE

PhD Student “institutional influences on collaboration patterns in interdisciplinary research”

Page 33: How the Web can change social science research (including yours)

Faculteit der Exacte Wetenschappen

Julie needs data

33

Page 34: How the Web can change social science research (including yours)

Faculteit der Exacte Wetenschappen34

Page 35: How the Web can change social science research (including yours)

Faculteit der Exacte Wetenschappen

DBLP: RDF & RDF Schema

Page 36: How the Web can change social science research (including yours)

Faculteit der Exacte Wetenschappen36

SELECT ?author ?affiliation ?uriAffiliation WHERE { GRAPH <$graph> { {<$article> swrc:author ?author.

OPTIONAL{?author swrc:affiliation ?uriAffiliation.} OPTIONAL{?author swc:affiliation ?affiliation.} }

}}

DBLP Query: 2 weeks 15 mins.

UNION { <$article> foaf:maker ?author.

OPTIONAL{?author swrc:affiliation ?uriAffiliation.} OPTIONAL{?author swc:affiliation ?affiliation.}

} UNION { <$article> dc:creator ?author.

OPTIONAL{?author swrc:affiliation ?uriAffiliation.} OPTIONAL{?author swc:affiliation ?affiliation.}}

Page 37: How the Web can change social science research (including yours)

Example:Dutch census data

(1795 – 1971)

Page 38: How the Web can change social science research (including yours)

40.745.554.078 triples Semantically rich

Page 39: How the Web can change social science research (including yours)

Who’s doing it?

Page 40: How the Web can change social science research (including yours)

The World Bank is also doing it!

http://data.worldbank.org/7,000 indicators from World Bank data

sets.

Page 42: How the Web can change social science research (including yours)

already many billions of facts & rules

Everybody’s doing it!

Encyclopedia

Geographic names (millio

ns)

names of artists & art w

orks

(10.000’s)

scientific bibliographies

hierarchical dictionaries

(UK, FR, N

L)

life-science databases

any CD ever recorded (a

lmost)

May ‘09 estimate > 4.2 billion triples + 140 million interlinks

basic facts on every country on the planet

common sense rules & facts (1

00.000’s)

It gets bigger every month

Page 43: How the Web can change social science research (including yours)

It gets bigger every month

25 billion facts &

relations…

Page 44: How the Web can change social science research (including yours)

And many more

• Reuters• New York Times• EU (EUROSTAT, others)• BBC• Facebook• ….

Page 45: How the Web can change social science research (including yours)

So how good is this observational instrument ? Studies on validity (e.g. in science

dynamics)

methods for provenance & trustmethods for attribution & citation

Page 46: How the Web can change social science research (including yours)

For real ?

“ use the power of information to explore social and economic life on Earth ”

1bn€ over 10 years

Page 47: How the Web can change social science research (including yours)

Pfew….

Page 48: How the Web can change social science research (including yours)

Take home messageuse the web & the web-of-data

to obtain your datause the web-of-data to share your

datayes, you can do this too!

Collaborate with computer scientistsreflect on deeper consquences

for the social sciences(methodological, theoretical, etc)

Page 49: How the Web can change social science research (including yours)

AcknowledgementsI’ve freely used material from the work

ofShenghui WangPaul GrothJulie BirkholzWouter van AtteveldtLaurens van RietveldRinke Hoekstraand many in the Semantic Web

community