Click here to load reader

Ehsan Zamiri Supervisor: Dr. Kahani Ferdowsi University of Mashad FOAF: Semantic Based NameSpace for Social Networking

Embed Size (px)

Citation preview

  • Slide 1
  • Ehsan Zamiri Supervisor: Dr. Kahani Ferdowsi University of Mashad FOAF: Semantic Based NameSpace for Social Networking
  • Slide 2
  • Outline Motivation Introduction The six popular ontologies FOAF vocabulary Why FOAF Building FOAF Document collection FOAF Document Identification FOAF Document Discovery Popular Properties of foaf:Person Applications Personal Information Fusion Social Network Analysis
  • Slide 3
  • Semantic Web The semantic web vision is that information and services are described using shared ontologies in KR-like markup languages, making them accessible to machines (programs). How do we get there? What kind of ontologies? IEEE SUO? Cyc? What kind of languages? RDF? OWL? RuleML? Its reasonable to start with the simple and move toward the complex From Dublin Core to CYC From RDF to OWL and beyond Significant semantic web content exists today Using simple vocabularies (e.g., FOAF) and RDF/RDFS
  • Slide 4
  • The Semantic Web The more important word in Semantic Web is the latter The KR aspects of the SW were taken off the shelf, the result of 25 years of research done in the AI community Remember hypertext? It was a nice research backwater going back to the 50s (recall Memex and Xanadu) Hypertext was forever change by the Web So maybe the web will forever change KR TBL: The Semantic Web will globalize KR, just as the WWW globalize hypertext
  • Slide 5
  • Web of what? What features does the web bring to the table? Anyone can say anything about anything The meaning of RDF terms will be (partly) determined socially Its a web of documents, services, agents and people
  • Slide 6
  • What kind of Ontologies? General Logical constraints Terms/ glossary Thesauri narrower term relation Formal is-a Frames (properties) Informal is-a Formal instance Value Restriction Disjointness, Inverse, part of Taxonomies Wordnet CYC RDFDAML OO DB SchemaRDFS IEEE SUOOWL UMLS Vocabularies Simple Ontologies Expressive Ontologies
  • Slide 7
  • The Semantic Web Today There are several simple RDF vocabularies that are widely used today Dublin Core RSS FOAF Its instructive to study how these are being used today And to track how their usage changes
  • Slide 8
  • The Six Most Popular Ontologies includes the terms necessary for describing vocabularies RDF DC RSS FOAF RDFS MCVB The statistics is generated by http://swoogle.umbc.edu
  • Slide 9
  • A usecase: FOAF FOAF (Friend of a Friend) is a simple ontology to describe people and their social networks. See the foaf project page: http://www.foaf-project.org/ CS Department of UMBC crawled the web and discovered over 1,500,000 valid RDF FOAF files. Most of these are from seveal blogging system that encode basic user info in foaf See http://apple.cs.umbc.edu/semdis/wob/foaf/ Tim Finin 241037262c252e \\ cryptographical hash of email address
  • Slide 10
  • FOAF vocabulary http://xmlns.com/foaf/0.1/
  • Slide 11
  • FOAF: why RDF? Extensibility! FOAF vocabulary provides 50+ basic terms for making simple claims about people FOAF files can use other RDF terms too: RSS, MusicBrainz, Dublin Core, Wordnet, Creative Commons, blood types, starsigns, RDF guarantees freedom of independent extension OWL provides fancier data-merging facilities Result: Freedom to say what you like, using any RDF markup you want, and have RDF crawlers merge your FOAF documents with others and know when youre talking about the same entities.
  • Slide 12
  • No free lunch! Consequence: We must plan for lies, mischief, mistakes, stale data, slander Dataset is out of control, distributed, dynamic Importance of knowing who-said-what Anyone can describe anyone We must record data provenance Modeling and reasoning about trust is critical Legal, privacy and etiquette issues emerge Welcome to the real world
  • Slide 13 Tim Finin"> Tim Finin"> Tim Finin" title="FOAF example using XML Tim Finin">
  • FOAF example using XML Tim Finin
  • Slide 14
  • FOAF example using XML (Contd) Tim Finin Tim
  • Slide 15
  • FOAF example using XML (Contd) Tim Finin Anupam Joshi
  • Slide 16
  • FOAF isnt the only one Other ontologies are used to publish social information Swoogle finds >360 RDFs or OWL classes with the local name person.
  • Slide 17
  • Lots of FOAF tools
  • Slide 18
  • Why FOAF Information Creators Community membership management Unique Person Identification (privacy preserved) Indicating Authorship Information Consumers Provenance tracking Social networking Expose community information to new comers Match interests Trust building block
  • Slide 19
  • Studying how FOAF is being used What counts as a FOAF document? How can we find foaf documents?
  • Slide 20
  • Identify a FOAF document 1.D is an RDF document. 2.D uses FOAF namespace 3.The RDF graph serialized by D contains the sub-graph below 4.D defines one and only one Person instance 1.D is an RDF document. 2.D uses FOAF namespace 3.The RDF graph serialized by D contains the sub-graph below 4.D defines one and only one Person instance D is a generic FOAF document when 1,2,3 met D is a strict FOAF document when 1,2,3,4 met X foaf:Person Z foaf:Y rdf:type
  • Slide 21
  • Different FOAF collections DS-Swoogle Foaf documents selected from Swoogles database of ~340K semantic web documents Swoogle selects at most 1000 documents from any site DS-FOAF Custom crawler found 1.5M foaf documents, most from a few large blog sites (e.g., livejournal) DS-FOAF-Small Subset of ~7K non-blog foaf documents from ~1K sites defining ~37K people
  • Slide 22
  • FOAF document Discovery Bootstrap: using web search engine (Got 10,000 docs) Discovery: using rdfs:seeAlso semantics (Got 1.5M docs) Top 7 FOAF websites
  • Slide 23
  • From DS-Swoogle 17 SWDs add to the definition of foaf:Person e.g., defining superclasses, disjointness, etc. 162 properties are defined for foaf:Person e.g., properties whose domain is foaf:Person 74 properties defined as relations between people e.g., properties with both domain and range of foaf:Person 582 properties used e.g., used to assert something of a foaf:Person instance
  • Slide 24
  • Popular properties of foaf:Person non-blog (26,936) liveJournal.com (20,298,073) DS-FOAF-SMALL * (33,790) 1foaf:mbox_sha1sum (0.84)foaf:mbox_sha1sum (1.0)foaf:name(0.80) 2foaf:homepage (0.66 )dc:description(1.0)foaf:mbox_sha1sum(0.71) 3foaf:name (0.64)dc:title (1.0)foaf:nick (0.51) 4foaf:nick (0.61)foaf:nick (1.0)foaf:homepage (0.40) 5foaf:weblog (0.60)foaf:page (1.0)foaf:depiction (0.35) 6foaf:knows (0.44)foaf:weblog (0.99)foaf:weblog (0.30) 7foaf:mbox (0.38)rdfs:seeAlso (0.85)foaf:knows (0.28) 8foaf:img (0.38)foaf:knows (0.85)foaf:surname (0.27) 9bio:olb (0.35)foaf:dateOfBirth (0.71)foaf:firstName (0.26) 10rdfs:seeAlso (0.34) foaf:interest (0.67)rdfs:seeAlso (0.26) 11foaf:mbox (0.26) *DS-FOAF-SMALL is a newly dataset in Oct 2004, based on 7276 evenly sampled documents. Top 10 popular properties (per document)
  • Slide 25
  • Popular properties of foaf:Person non-blog (26,936) liveJournal.com (20,298,073) DS-FOAF-SMALL * (33,790) 1 foaf:name (0.84)dc:title (1.74)foaf:name(0.69) 2 foaf:knows (0.79)foaf:interest (1.68)foaf:mbox_sha1sum(0.65) 3 foaf:homepage (0.63)foaf:nick (1.04)rdfs:seeAlso (0.39) 4 foaf:mbox_sha1sum (0.51)foaf:weblog (1.00)foaf:nick (0.26) 5 rdfs:seeAlso (0.40)rdfs:seeAlso (0.99)foaf:homepage (0.18) 6 dc:title (0.31)foaf:knows (0.95)foaf:mbox (0.15) 7 foaf:nick (0.22)foaf:page (0.95)foaf:weblog (0.15) 8 foaf:weblog (0.18)dc:description (0.046)foaf:firstName (0.11) 9 foaf:mbox (0.15)foaf:mbox_sha1sum (0.046)foaf:surname (0.11) 10 daml:equivalentTo (0.13)foaf:dateOfBirth (0.046)foaf:depiction (0.10) 11 foaf:knows (0.07) Top 10 popular properties (per instance)
  • Slide 26
  • Collecting Personal Information http:www.cs.umbc.edu/~dingli1/foaf.rdf http://www-2.cs.cmu.edu/People/fgandon/foaf.rdf
  • Slide 27
  • Caution: Collision? Mistake! http://www.mindswap.org/~katz/2002/11/jordan.foaf http://www.ilrt.bris.ac.uk/people/cmdjb/webwho.xrdf caution
  • Slide 28
  • Instances of foaf:Person per doc Zipfs distribution Sloppy tail: few foaf documents contain thousands of instances
  • Slide 29
  • Degree analysis For social networks, the in-degree and out-degree measure of a person is of interest Can be used to identify hubs and authorities or to compute other interesting properties or rankings Analyzing most large social networks reveals that in-degree and out-degree follows a power law or Zipf distribution We found that to be the case for social networks induced by foaf documents
  • Slide 30
  • In-degree of group Zipfs Distribution Sharp tail: few FOAF documents have large in-degrees
  • Slide 31
  • Out-degree of group Zipfs distribution Sloppy tail: few person directory documents
  • Slide 32
  • Patterns of FOAF Network Four types of group Isolated Only in only one inlink (97%) Only out Both (intermediate) Basic Patterns: Singleton: (isolated) Star: (only out) an active person publishes friends Clique: a small group
  • Slide 33
  • Growth of FOAF network The data suggests that there is a natural evolution for a social network (1) disjointed star-like, connected components (2) link together to form trees and forests, (3) eventually forming a scale-free network
  • Slide 34
  • Growth of FOAF network
  • Slide 35
  • The Map of FOAF network
  • Slide 36
  • Conclusions The semantic web is evolving There is a growing volume of RDF content FOAF is one of the one of the early successes. FOAF data is being used FOAF data is relatively easy to collect and analyze FOAF data is a good source for social network information
  • Slide 37
  • Thanks for your attention