166
Semantic empowered Digital Library System Colloquium2 DRTC 2 nd sem ISIBC

Semantic Empowered Digital Library System

Embed Size (px)

DESCRIPTION

Libraries play a critical role in the dissemination of Knowledge and serve as repositories of knowledge. Internet has been instrumental in delivering digital information worldwide. With the advent of semantic web, there has been a paradigm shift from digital libraries (DL) to semantic digital libraries (SemDL) to address the issues and challenges of DL. Semantic Web aims at transforming the current web, dominated by unstructured and semi-structured documents into a "web of data". SemDL is an initiative that allows the system to assist end users in retrieval of the most relevant content with respect to a description of their information needs. Semantic techniques have been considered from the perspective of DL and how it enhances the functioning of DLs. The key players of the semantic techniques in the context of DL are XML, XML Schema, RDF, RDF Schema, Web Ontology Language (OWL). Jerome DL is one such success story that shows the potentials of the semantic techniques to sort the problems/challenges of DL and how it improves browsing and searching of resources. The future tends to focus more keenly on the sharing of user knowledge and not merely Information Retrieval. One of the obvious outcomes is the advent of Social Semantic Digital Library (SSDL) to improve user benefits by empowering user interfaces and social networking.

Citation preview

Page 1: Semantic Empowered Digital Library System

Semantic empowered Digital Library System

Colloquium2 DRTC 2nd sem

ISIBC

Page 2: Semantic Empowered Digital Library System

INTRODUCTION

Anwesha Bhattacharya

Page 3: Semantic Empowered Digital Library System

“My two favourite things in life are libraries and bicycles. They both move people

forward without wasting anything”

~Peter Golkin~

Page 4: Semantic Empowered Digital Library System
Page 5: Semantic Empowered Digital Library System

What is a library?

A building or room containing collections of books, periodicals, and sometimes films and recorded music for use or borrowing by the public or the members of an institution.

Page 6: Semantic Empowered Digital Library System

An electronic library (colloquially referred to as a digital library) is a focused collection of digital objects that can include text, visual, audio, video materials, stored as electronic media formats along with means for organizing, storing, and retrieving them. 

Page 7: Semantic Empowered Digital Library System
Page 8: Semantic Empowered Digital Library System
Page 9: Semantic Empowered Digital Library System

Having computers understand things more like human beings do…

Page 10: Semantic Empowered Digital Library System

The Semantic Web is an extension of the current web in which information is given well-defined meaning, better enabling computers and people to work in co-operation. [Tim Berners-Lee , 2001] 

Page 11: Semantic Empowered Digital Library System
Page 12: Semantic Empowered Digital Library System

How is semantics related to Web?

The Semantic Web aims at converting the current web, dominated by unstructured and semi-structured documents into a "web of data“.

Page 13: Semantic Empowered Digital Library System

Let’s introduce Prof. Dutta

With the traditional web modelhe has the following…

Page 14: Semantic Empowered Digital Library System

A faculty page

Page 15: Semantic Empowered Digital Library System

A faculty page

A research page

Page 16: Semantic Empowered Digital Library System

A faculty page

A research page

A blog

Page 17: Semantic Empowered Digital Library System

A faculty page

A research page

A blog

A staff listing page

Page 18: Semantic Empowered Digital Library System

The content of thesesite is fine but there

are no linkages between the data

So visitor find it difficult

to get all the information

they need quickly and easily

Page 19: Semantic Empowered Digital Library System

That’s where the Semantic web comes in…

Page 20: Semantic Empowered Digital Library System

Using code we can create relationships between

websites, people and events…

These can then beunderstood by the browser

and interpreted in a helpful way

Page 21: Semantic Empowered Digital Library System

The semantic web

With all this data beingable to be displayed

simply it provides a muchricher user experience and

offers information that previously might not have

been exposed.

Page 22: Semantic Empowered Digital Library System

Well that’s exciting stuff. But how do we go about getting onboard with semantic web?

There are a few different ways we’ll have a quick and brief look at it

Page 23: Semantic Empowered Digital Library System

YouAre

Here!

Where are we in the “Semantic Web layer cake”?

Page 24: Semantic Empowered Digital Library System

TOPIC SPEAKER

1. Evolution in SDL Manasa Rath

2. Semantic techniques and technologies

Manash Kumar, Mohit Garg

3. Jerome DL Tanmay Mondal, Dibakar Sen, Shiv Shakti Ghosh

4. Semantic techniques in practice

Sandip Das, Jayanta Kr Nayek

5. Conclusion Samhati Soor

Page 25: Semantic Empowered Digital Library System

Evolution in SDL

Manasa Rath

Page 26: Semantic Empowered Digital Library System

Tim

World Wide Web or WWW or Web as defined in Wikipedia

It is a system of interlinked hypertext documents accessed via the Internet.

Who proposed it ? Tim Berners-Lee

INTERNET WEB

Network of Computers Service which runs on the network

• Tim Berners-Lee proposed WWW1989

Page 27: Semantic Empowered Digital Library System

Web 1.0

ScreenShot of msn.com from the year1995

Most of the pages were static

There were only images(mostly animate GIFs..) and hyperlinks

Readers or Users were unable contribute to the site

Page 28: Semantic Empowered Digital Library System

Web 2.0

Pages are made dynamic

Users or readers are allowed to participate with the website

and contribute their views to web.

Technologies widely used in Web 2.0

Page 29: Semantic Empowered Digital Library System

WEB 2.0 for LIBRARY

PROFESSIONALS

Vertical Search

Features

RSS

Social Bookmarking

BOOK CATALOG

Library Thing Group

Extension services

BOOK Information

Review

Ask the Librarian

Global Tag Cloud

Social Information

View

New Reader

FeaturesBLOGS

Information

Page 30: Semantic Empowered Digital Library System

Towards a Semantic Web

-Tasks often require to combine data on the Web-Humans combine these information easily -Sort catalogues on the Web environment-Making the web more meaningful

Page 31: Semantic Empowered Digital Library System

EVOLUTION OF WEB

Page 32: Semantic Empowered Digital Library System

How HTML5 helps to make a semantic web

<dl>

<dt>Name<dt><dd>Mark Zuckerberg</dd>

<dt>Position</dt> <dd><span >Developer advocate</span> for <span>Google, Inc.<span></dd></dl>

Part of a normal webpage

Make it more meaning full…

<dl>

<dt>Name<dt>

<dd itempprop=“name”>Mark Zuckerberg</dd>

<dt>Position</dt>

<dd><span itempprop=“title”>CEO</span> for <span itempprop=“company”>facebook</dd>

</dl>

Page 33: Semantic Empowered Digital Library System
Page 34: Semantic Empowered Digital Library System

Towards Semantic Digital Library- Examples Library Services Semantic Web

Response to information abundance

Library to digital library is developed since the abundance of information increased

Semantic Web was initiated as a means to more effectively manage and take advantage of the increased amount of digital data

Missions grounded in service, information access, and knowledge discovery

Objectives, goals serve the purpose to facilitate information;

Semantic Web strives to allow data to be shared and reused across applications, enterprises, and community boundaries. It is a collaborative effort led by W3C and partners, based on the Resource Description Framework (RDF)

Part of society’s fabric

Part of life, for all walks, in all types, physically and virtually

Current Web is any indication of Semantic Web’s reach, which seems quite logical, the Semantic Web will surely impact millions of people’s lives daily.

Page 35: Semantic Empowered Digital Library System

Examples Library Services Semantic Web

Advancement via international and national standards

Libraries consolidated development of cataloging codes; formalized classificatory and verbal systems; and encoding/communication standards (International Bibliographic Description (ISBD) and Machine Readable Cataloging (MARC), many metadata schemes, Functional Requirements for Bibliographic Records (1998), and Resource Description and Access (RDA)

The Semantic Web has followed a similar path as evidenced by a collection of information standards: eXtensible Markup Language (XML), RDF, OWL, Friend Of A Friend (FOAF), and Simple Knowledge Organizations System (SKOS).

Collaborative spirit

American Library Association, Association of Library Collections and Technical Services, Cataloging and Classification Section (ALA/ALCTS/CCS), committees review cataloging polices and standards, and interact with international organizations (e.g.., IFLA and the Dublin Core Metadata Initiative).

All of the enabling technologies/standards listed above (RDF, OWL, FOAF, and SKOS) have been developed through working groups and public calls for comment.. The World Wide Web Consortium (W3C), the home of the Semantic Web, involves academic, research, and industry members

Page 36: Semantic Empowered Digital Library System

Semantic Web Development

Traditional Services Semantic Web Services

Collection development Semantic Web selection

Cataloging ‘Semantic metadata’ representation

Reference Semantic Web reference service

Classification Knowledge representation

Page 37: Semantic Empowered Digital Library System

Semantic techniques and technologies

Manash Kumar

Page 38: Semantic Empowered Digital Library System

Two characteristics for the construction of the semantic web-

Downward compatibility

Agents fully aware of a layer should also be able to interpret and use information written at lower levels. For example, agents aware of the semantics of OWL can take full advantage of information written in RDF and RDF Schema.

Upward partial understanding

On the other hand, agents fully aware of a layer should take atleast partial advantage of information at higher levels. For example, an agent aware only of the RDF and RDF Schema semantics can interpret knowledge written in OWL partly, by disregarding those elements that go beyond RDF andRDF Schema.

Semantic web architecture

Page 39: Semantic Empowered Digital Library System

Semantic Web Stack

Semantic Web Stack

Illustrates the architecture of the semantic web

Show hierarchy of languages used to create semantic web

Shows how the technologies are organised to make semantic web possible

Page 40: Semantic Empowered Digital Library System

Semantic Web Stack

Page 41: Semantic Empowered Digital Library System

UNICODE

Unicode provides a unique number for every character

no matter what the platform.

no matter what the program.

no matter what the language.

Page 42: Semantic Empowered Digital Library System
Page 43: Semantic Empowered Digital Library System

URI

Uniform Resource Identifier (URI) is an Internet Standard. It's a string of characters used to identify a name or a resource on the Internet. Such identification enables interaction with representations of the resource over a network (typically the World Wide Web) using specific protocols. Schemes specifying a concrete syntax and associated protocols define each URI.

It has several component parts: A scheme name (http)A domain name (www.xxx.com)A path (/sa/edu/yuc/index.html)

An URI identifies a resource either by location, or a name, or both. A URI has two specializations known as URL, URN .

Page 44: Semantic Empowered Digital Library System

XML: Extensible Markup Language

It is a general purpose markup Language for creating specific purpose mark-up languages

Follows the SGML-standards (Standard Generlised Markup Language) With XML the single users can create their own tags (which is not possible with HTML)

Page 45: Semantic Empowered Digital Library System
Page 46: Semantic Empowered Digital Library System

RDF: Resource Description Framework

RDF is a general-purpose language for representing information in the web

Useful to represent metadata about Web resources

RDF describes resources (Both abstract or concrete subjects) identifiable via an URI

The syntax of RDF is based on XML

RDF-documents are written as XML-documents with the tag rdf:RDF

Page 47: Semantic Empowered Digital Library System

RDF Statements

A RDF-statement is described by a triple (S, P, O)

S= Subject of the statement (It’s a URIref)

P= Property (Predicate) of the statement (URIref)

O= Object

Page 48: Semantic Empowered Digital Library System

Graphical Representation of a RDF statement

(subject, predicate, object)

04/10/2023 48

https://www.facebook.com/SudhirShivaramPhotography

Sudhir Shivaram

creator

Resource

Property Type

Property Value

Page 49: Semantic Empowered Digital Library System

RDF-Schema

RDF schema provides a way of building a object model from which the actual data is referenced and which tells us what things really mean.

RDFS allows users to define resources with classes, properties and values

This allows resources to be defined as instances of classes, and subclasses of classes.

Page 50: Semantic Empowered Digital Library System

Semantic techniques and technologies (contd)

Mohit Garg

Page 51: Semantic Empowered Digital Library System

SEMANTIC WEB ARCHITECTURE

Page 52: Semantic Empowered Digital Library System

Ontology?

● Study of existence in philosophy.● Ontology is the philosophical study of the

nature of being, becoming, existence, or reality, as well as the basic categories of being and their relations.

● A data model that represents knowledge as a set of concepts within a domain and the relationship between these concepts.

● It is concerned with the fundamental questions of “what is being?” and “what kinds of things are there?”

Page 53: Semantic Empowered Digital Library System

Example

● Ontology for a people of DRTC

PEOPLE

FACULTY RES. SCH.MASTER STU.

Prof A.ProfJRF SRF 1st Year 2nd Year

Page 54: Semantic Empowered Digital Library System

Ontology Vocabulary

● Vocabulary ==> context-less list of terms, with no defined interrelationships.

● Ontology Vocabulary used to describe (a particular view of) some domain.

-how concepts should be classified.● Examples:-

1.Man

2.Vegetarian

3. Non-veg

Page 55: Semantic Empowered Digital Library System

SPARQL

● Simple Protocol and RDF Query Language● SPARQL is SQL-like language, but uses

RDF triples and resources for both matching part of the query and for returning results of the query.

● Since both RDFS and OWL are built on RDF, SPARQL can be used for querying ontologies and knowledge bases directly as well. Note that SPARQL is not only query language, it is also a protocol for accessing RDF data.

Page 56: Semantic Empowered Digital Library System

The Proof/Rule layer

● rule: informal notion● rules are used to perform inference

over ontologies● rules as a tool for capturing further

knowledge

(not expressible in OWL ontologies)

Page 57: Semantic Empowered Digital Library System

Rule Layer

● Natural form of expressing knowledge in some domain of interest are rules that reflect the notion of consequence.

● Rules come in the form of IF-THEN constructs and allow to express various kinds of complex statements.

● The IF part is also called the body of a rule, while the THEN part is also called its head.

Page 58: Semantic Empowered Digital Library System

Logic layer

● rules, have been formalised using logic to give them a precise semantics.

● Without such a precise formalisation they are vague and ambiguous, and thus problematic for computational purposes.

● The most prominent and fundamental logical formalism classically used for knowledge representation is the “first-order predicate calculus”, or first-order logic.

Page 59: Semantic Empowered Digital Library System

Cryptography layer

● Cryptography (Greek word: Kryptos, which means hidden secrets)

● It is the practice and study of techniques for secure communication in the presence of third parties.

● For reliable inputs, cryptography means are to be used, such as digital signatures for verification of the origin of the sources.

Page 60: Semantic Empowered Digital Library System

The Trust layer

● SW top layer: support for provenance/trust● Provenance: where does the information

come from?● how this information has been obtained?● can I trust this information?

Page 61: Semantic Empowered Digital Library System

Jerome DL

Page 62: Semantic Empowered Digital Library System

Motivation and Overview

Tanmay Mondal

Page 63: Semantic Empowered Digital Library System

Existing Semantic Digital Library Systems

BRICKS

Fedora

SIMILE

JeromeDL

Page 64: Semantic Empowered Digital Library System

BRICKS BRICKS

● Building Resources for Integrated Cultural Knowledge Services (BRICKS) is an open-source

● software framework for the managementof distributed digital assets.

Page 65: Semantic Empowered Digital Library System

The Fedora repository system is an open source, digital object repository system using public APIs exposed as web services.

Fedora

Page 66: Semantic Empowered Digital Library System

● SIMILE = Semantic Interoperability of Metadata In unLike Environments

● motivated by DSpace, repository for storing, indexing, preserving, and redistributing digital assets,

● jointly developed by Hewlett-Packard Research Labs and the MIT Libraries.

SIMILE

Page 67: Semantic Empowered Digital Library System

Joint effort of DERI, National University of

Ireland, Galway and Gdansk University of Technology (GUT).

Distributed under BSD Open Source license.

Digital library build on semantic web technologies to answer requirements from: librarians, scientists and communities.

JeromeDL

Page 68: Semantic Empowered Digital Library System

† JeromeDL has been installed in a number of locations ; the two most user are-

DERI Galway library

WBSS at Gdansk University of Technology

‡ serve their community of users in

everyday activities.

USERS

Page 69: Semantic Empowered Digital Library System
Page 70: Semantic Empowered Digital Library System

Services in JeromeDL

JeromeDL allows librarians to maintain and use the following controlled vocabularies:

authority files - with a list of authors, editors

and publishers;

Classification taxonomies- such as DMoz or DDC, for annotating resources with topics;

WordNet dictionary, for specifying keywords

Page 71: Semantic Empowered Digital Library System

Structure Ontology

Modern digital library systems not only store bibliographic metadata.

They also manage an electronic representation of the content itself.

The structure of the content might, however, depend on the type of the resource.

Page 72: Semantic Empowered Digital Library System

The key feature of every digital library system is-

making bibliographic resources accessible which involves-• domain (topic) categorization of a

resource from the WordNet dictionary.

Support for Legacy Information

Page 73: Semantic Empowered Digital Library System

Resource management

■ Each resource is described by the semantic descriptions according to the JeromeDL core ontology.

■ Additionally a fulltext index of the resource’s content and MARC21, and BibTEX bibliographic descriptions are provided

Page 74: Semantic Empowered Digital Library System

Retrieval features

JeromeDL provides searching and browsing features based on Semantic Web data

Page 75: Semantic Empowered Digital Library System

MultiBeeBrowse allows to browse unstructured metadata represented as an RDF graph.

It consist of access to resource, search services, filter service, similar service, related service, combination service ( conjunction, sum, difference,binding, on two given sets of results).

MultiBeeBrowse (MBB)

Page 76: Semantic Empowered Digital Library System
Page 77: Semantic Empowered Digital Library System

Communication link

►The content of the JeromeDL database can be searched not only through the web pages of the digital library

► But also from the other digital libraries another web applications

Page 78: Semantic Empowered Digital Library System

Social bookmarking

► Users can allow others to see their bookmarks and annotations and share their knowledge within a social network.

►JeromeDL can also treat a single library resource as a blog post.

►Users can comment the content of the resource and reply to others’ comments and this way create new knowledge.

Page 79: Semantic Empowered Digital Library System

Delicious is the leading bookmarking service to save, organize, and discover interesting links on the web.

Page 80: Semantic Empowered Digital Library System

Bottom layer provides a service for a flexible and extendable electronic representation of objects;

The middle layer offers information retrieval and identity management services.

Top layer in the semantic digital library stack utilizes benefits from engaging community of users into annotating and filtering resources

Architecture

Page 81: Semantic Empowered Digital Library System

Architecture (Bottom & Middle layer)

Dibakar Sen

Page 82: Semantic Empowered Digital Library System
Page 83: Semantic Empowered Digital Library System

Bottom layer● The bottom layer handle the physical representation of

resources, their structure and provenance.

● It provides a flexible and extendable electronic representation of objects with its structure ontology.

Page 84: Semantic Empowered Digital Library System

Structure ontology in JeromeDL

Page 85: Semantic Empowered Digital Library System

Bottom layer (contd.)

● For knowledge organisation Jerome DL provides

- authority files, with a list of authors, editors and publishers;

- classification taxonomies, such as DMoz or DDC, for annotating resources with topics;

- WordNet thesaurus, for specifying keywords (domain categorisation).

Page 86: Semantic Empowered Digital Library System

AUTHORITY FILE

Page 87: Semantic Empowered Digital Library System
Page 88: Semantic Empowered Digital Library System
Page 89: Semantic Empowered Digital Library System

CLASSIFIFCATION TAXONOMY

Page 90: Semantic Empowered Digital Library System
Page 91: Semantic Empowered Digital Library System
Page 92: Semantic Empowered Digital Library System

THESAURUS

Page 93: Semantic Empowered Digital Library System
Page 94: Semantic Empowered Digital Library System
Page 95: Semantic Empowered Digital Library System
Page 96: Semantic Empowered Digital Library System

MIDDLE LAYER

• Lifts up legacy bibliogrphic description to semantic level;

• A mediation standard like MarcOnt Ontology is used to dissolve the problem of heterogeneity of different standard (MARC 21, BibTex, Dublin Core).

Page 97: Semantic Empowered Digital Library System

Example of BibTex format

@article{ahu61,

author={Arrow, Kenneth J. and Leonid Hurwicz and Hirofumi Uzawa},

title={Constraint qualifications in maximization problems},

journal={Naval Research Logistics Quarterly},

volume={8},

year = 1961,

pages = {175-191}

}

Page 98: Semantic Empowered Digital Library System

Legacy format00688nam 22001817a

4500001000900000003000400009005001700013020002400030100005200054245008600106250001200192260005200204650004500256942001800301999001700319952008800336952008200424#92005291#XYZ#20140307062926.0# #a0152038655 #c$15.95#1 #aSandburg, Carl,#d1878-1967.#0(San Esteban)54449#1 #aArithmetic /#cCarl Sandburg ; illustrated as an anamorphic adventure by Ted Rand.# #a2nd ed.# #aSan Diego :#bHarcourt Brace Jovanovich,#cc1993.# 0#aArithmetic#xJuvenile poetry.#0(XYZ)86352# #2ddc#cAUDIO#n0# #c31980#d31998# #w2014-03-07#yAUDIO#r2014-03-07#40#00#936532#bWCCC#10#d2014-03-07#70#2lcc#g0.00#aWCCC# #w2014-03-07#70#yAUDIO#r2014-03-07#40#2lcc#00#936533#bWCCC#10#d2014-03-07#aWCCC##

Page 99: Semantic Empowered Digital Library System

XML Format<collection>

<record>

<controlfield tag="001"> 92005291 </controlfield>

<controlfield tag="003">DLC</controlfield>

<datafield tag="020" ind1=" " ind2=" ">

<subfield code="a">0152038655 :</subfield>

<subfield code="c">$15.95</subfield>

<datafield tag="100" ind1="1" ind2=" ">

<subfield code="a">Sandburg, Carl,</subfield>

<subfield code="d">1878-1967.</subfield>

</datafield>

<datafield tag="245" ind1="1" ind2="0">

<subfield code="a">Arithmetic /</subfield>

<subfield code="c">Carl Sandburg ; illustrated as an anamorphic adventure by Ted Rand.</subfield>

</datafield>

<datafield tag="250" ind1=" " ind2=" ">

<subfield code="a">2nd ed.</subfield>

</datafield>

<datafield tag="260" ind1=" " ind2=" ">

<subfield code="a">San Diego :</subfield>

<subfield code="b">Harcourt Brace Jovanovich,</subfield>

<subfield code="c">c1993.</subfield>

</datafield>

<datafield tag="650" ind1=" " ind2="0">

<subfield code="a">Arithmetic</subfield>

<subfield code="x">Juvenile poetry.</subfield>

</datafield>

Page 100: Semantic Empowered Digital Library System

Bibliographic (MarcOnt) Ontology in JeromeDL

Page 101: Semantic Empowered Digital Library System

Middle layer (contd.)● JeromeDL delivers RDF query service to

be able to act as a mash-up sevice.● Protocols like Z39.50, OAI-PMH are used

for the communication purpose.● Natural language querry

“show me ... written by ...”

Regular Expression● Tagstree map

Page 102: Semantic Empowered Digital Library System

EXAMPLE OF REGULAR EXPRESSION

Page 103: Semantic Empowered Digital Library System

TAGSTREE MAP

Page 104: Semantic Empowered Digital Library System
Page 105: Semantic Empowered Digital Library System
Page 106: Semantic Empowered Digital Library System
Page 107: Semantic Empowered Digital Library System
Page 108: Semantic Empowered Digital Library System
Page 109: Semantic Empowered Digital Library System
Page 110: Semantic Empowered Digital Library System
Page 111: Semantic Empowered Digital Library System

Top Layer (Social Services & Semantics in

Use)

Shiv Shakti Ghosh

Page 112: Semantic Empowered Digital Library System

Social Services in JeromeDL

• Involve users into sharing knowledge– Blogs – comments and discussions about

documents and resources

– Tagging – collaborative classification

– Wikis – collaboratively edited additional

descriptions, such as summaries and

interesting facts.

Page 113: Semantic Empowered Digital Library System

FOAF - Describing Social Networks• FOAF - Stands for Friend-of-a-Friend

• Defines properties for a person (but it does not have

to be a person, can be an “agent”)

• FOAFRealm

Page 114: Semantic Empowered Digital Library System

Evaluation of friendship between Person A & Person B

Page 115: Semantic Empowered Digital Library System

Identity management with FOAFRealm

Distance between owner and requester

Friendship level between owner and requester, calculated

across digraph of social network

• Support for single registration and sign on

• Distributed identity management with HyperCuP (“D-FOAF”)

• FOAFRealm is currently implemented as a plugin for Tomcat

(Realm/Valve implementation), with PHP and .NET versions

coming soon

Page 116: Semantic Empowered Digital Library System

Social Networks in Digital Libraries

Resource

xfoaf:Annotation

user_C

creator_B

foaf:knows

marcont:hasCreator

creator_A

foaf:knows

foaf:knows

xfoaf:Directory

user_D

xfoaf:owns

xfoaf:linksTo

xfoaf:isIn

Page 117: Semantic Empowered Digital Library System

JeromeDL – Delivering Semantic Content

• Providing semantic annotations during uploading process:

– open module for handling any taxonomies

– keywords based on WordNet and free tagging.

– defining structure of resources in the JeromeDL ontology

• Social Semantic Collaborative Filtering:• . Catalogs can include (transclusion) friend's catalogues• Access to catalogues can be restricted with social networking-

based polices• SSCF delivers:

– Community-oriented, semantically-rich taxonomies

– Information about a user's interest

– Flows of expertise from the domain expert

– Recommendations based on users previous actions

Page 118: Semantic Empowered Digital Library System

JeromeDL – Semantic Information In Use

• Searching:– Keyword-based search with semantic query expansion– Semantic search:

• Direct RDF quering• Natural language templates

• Browsing– Exibit

• Sharing:– Social Semantic Collaborative Filtering– Semantically Interlinked Online Communities

• Heterogeneous communication

Page 119: Semantic Empowered Digital Library System

Information Retrieval in JeromeDL

Fulltext Index

StructureRepository

MarcOntRepository

Resources’Content

FOAFRealmRepository

(typed)keywords

RDF & NLQuery

OpenSearchRSS

collaborativefiltering

localinterface

distributedinterface

types translation

semantic queryexpansion

RDF Repositories Secure Snapshot

Page 120: Semantic Empowered Digital Library System

Networks of Digital Libraries

• ELP (Extensible Library Protocol) implementation

– communication within JeromeDL network

– adapters for communication with other networks

• D-FOAF integration (distributed user profile

management)

– single sign on and single registration within D-FOAF

network

• HyperCuP integration (scalable P2P network)0 0

11

0

0

11

0

2 2

22

Page 121: Semantic Empowered Digital Library System

Semantic techniques in practice

Sandip Das

Page 122: Semantic Empowered Digital Library System

• Digital libraries identifier.• DOIs are a specific type of URI and similar to the

(ISBN)•  DOIs can be used to retrieve metadata

Source : http://www.tomw.net.au/2003/domains.html

URI

Page 123: Semantic Empowered Digital Library System

UNICODE

The Unicode Standard was designed to be :UniversalEfficientUniformUnambiguous

The present version is Unicode 3.0 covers 49,194 characters of all the scripts in the world and many other symbols

Problems with Unicode :Operating systems

Source : http://michaelseiler.net/2013/08/05/unicode-characters-in-html/

Page 124: Semantic Empowered Digital Library System

XML

• visualize the information on the web

• it doesn't provide described information

• we can use our own tags

• To define a resource “book” titled “Prolegomena to Library Classification” authored by “S. R. Ranganathan”, can be represented in a XML document as

<book>

<title> Prolegomena to Library Classification</title>

<author>S. R. Ranganathan</author>

</book> Source : https://www.google.co.in/search?q=RDF+Storage+:+Jena&rlz=1C2SNJF_enIN574IN574&source=lnms&tbm=isch&sa=X&ei=2nwcU-

Page 125: Semantic Empowered Digital Library System

RDF

• To represent the knowledge in a web page• To provide better search engine capabilities• In cataloging for describing the content• For describing IPR• For expressing the privacy preferences

Page 126: Semantic Empowered Digital Library System

RDF schema

• Describe RDF• Provides a data-modelling vocabulary for

RDF data• Describing groups of related resources.

Page 127: Semantic Empowered Digital Library System

RDF Storage : Sesame

• For querying and analyzing RDF data• Features :

Highly scalable RDF storageHigh  query performance Support for several RDF query

languages

Page 128: Semantic Empowered Digital Library System

RDF Storage : Jena

• Jena provides persistent storage of RDF.

• The Jena layout enables faster insertion and retrieval for  fine-grained API.

• Reduce storage

Page 129: Semantic Empowered Digital Library System

Web Ontology Language (OWL)

• Designed to meet the needs of WWW

• It’s syntax is nearly identical to RDF’s.

• Three variations of OWLOWL LiteOWL DLOWL Full

• An ontology written in OWL DL could be extensively used in Digital Libraries.

Source : http://www.mycutegraphics.com/graphics/owl/polka-dot-owl-books.html

Page 130: Semantic Empowered Digital Library System

Tools for Building ontology

• The are many Ontology tools are available in the present times such as Protégé, OntoEdit, Ontolingua, OilEd, pOWL etc.

Source : http://www.riversmead.org.uk/your-home/shared-owners/repairs-and-maintenance/

Page 131: Semantic Empowered Digital Library System

Ontology editors : Protégé

• Protégé is a free open-source ontology editor.

• Created by Stanford Center for Biomedical Informatics Research.

• Support creation, visualization• Export Ontology into different languages.

Source : http://protege.stanford.edu/download/protege/4.1/installanywhere/Web_Installers/

Page 132: Semantic Empowered Digital Library System

Semantic techniques in practice

Jayanta Kr

Nayek

Page 133: Semantic Empowered Digital Library System

Semantic Web Technologies

OWL

OWL DL

RDF

RDFs

RDFa

SKOS

SPARQL

Control Vocabularies

Page 134: Semantic Empowered Digital Library System

Application area of

SW Technolog

y

Semantic Annotatio

n

Content Manage

ment

CustomizationData

Integration

Domain modeling

Improved search

Portal

Page 135: Semantic Empowered Digital Library System

Benefits of SW technology

open mode

interoperability

data integration

share and re-use data

Multilinguality

service reuse

rapid response to

change

Page 136: Semantic Empowered Digital Library System

LIST OF CASE STUDIES

• Online resource for information on aquatic sciences/Spain July 2009.

• Enriching and sharing cultural heritage data in Europeana, June 2012.

• Use of SWT in Natural language interface to Business Applications, April2007.

• Publishing STW thesaurus for Economics as linked open data, Germany June 2009.

Page 137: Semantic Empowered Digital Library System

Case Study

Publishing STW Thesaurus for Economics as Linked Open Data in German National Library of Economics (ZBW), Germany.

by Timo Borst and Joachim Neubert

June 2009

Page 138: Semantic Empowered Digital Library System
Page 139: Semantic Empowered Digital Library System

Facets

•Activity area:• library, public institution and publishing

•Application area of SW technologies: • semantic annotation, improved search, content management, domain modeling, and data integration

•SW technologies used: • RDF, SPARQL, RDFa, and SKOS

•SW technology benefits: • open mode, rapid response to change, service reuse, and share and re-use data

Page 140: Semantic Empowered Digital Library System

Features of ZBW

• Provides a high-level taxonomy of subject categories.

• Thousands of keywords (“descriptors”) and tens of thousands of both synonyms and links between the thesaurus concepts.

• The media items are indexed with descriptors from this thesaurus. They can be retrieved by these descriptors through the library catalog ECONIS.

Page 141: Semantic Empowered Digital Library System

Challenges

• First, to improve web-based presentation of STW.

• Second, to foster precision of search results by actively suggesting preferred terms from STW.

• Third, to support the integration of STW into other indexing or retrieval environments.

• Fourth, to induce third-party reuse of the STW data, e.g. for customizing the vocabulary.

• Finally, to establish anchor points for linking to other vocabularies and datasets.

Page 142: Semantic Empowered Digital Library System

Solutions

• The “SKOS - Simple Knowledge Organization System” however, built within the Semantic Web community by vocabulary experts and targeting thesauri, classifications, folksonomies.

• Since SKOS is inherently multi-lingual, preferred and alternate labels (synonyms) in English could be attached to concepts as easily as their German equivalents. “Related”, “narrower” and “broader” relations.

• Mapped nicely to the according SKOS properties such as publisher, version and licensing information were added seamlessly through the use of other RDF vocabularies (e.g., Dublin Core).

Page 143: Semantic Empowered Digital Library System

Solutions (contd…)

• From the RDF file, they generated an XHTML page for each concept in the thesaurus and embedded all of the data into this page using RDFa.

• They assigned a persistent, language- and version-independent URI to each page.

• Thus, the set of pages forms a highly interlinked network of semantic relations, usable for both humans and machines.

• Web server content negotiation is used to deliver the format

• (RDF/XML or XHTML, English or German) most appropriate to the request.

Page 144: Semantic Empowered Digital Library System
Page 145: Semantic Empowered Digital Library System
Page 146: Semantic Empowered Digital Library System
Page 147: Semantic Empowered Digital Library System

Conclusion

Samhati Soor

Page 148: Semantic Empowered Digital Library System

LIMITATIONS OF SDL

VASTNESS

VAGUENESS

UNCERTAINTY

INCONSISTENCY

Page 149: Semantic Empowered Digital Library System

PROBLEMS DUE TO LIMITATION

Digital libraries should not be for librarians only, but for average people

Concentration on delivering

content/information, not on knowledge

sharing within a community of users

Digital libraries have lost human-part of

their predecessors

Page 150: Semantic Empowered Digital Library System

SOLUTION Making users/readers involved in the

content annotation process

Allowing users/readers to share their knowledge within a community

Providing better communication between users in and across communities

Achieved through SSDL (SOCIAL SEMANTIC DIGITAL LIBRARY)

Page 151: Semantic Empowered Digital Library System

SSDL The social semantic digital library is an attempt to

restore the collaborative approach to sharing knowledge.

The semantic services help

# to enhance search and browsing features

# to interconnect different systems and exchange data

The social services help

# to gather relevant information from expertise of others

# to improve high rank knowledge sharing in a digital

library

Page 152: Semantic Empowered Digital Library System

HEADING TOWARDS SSDL

LIBRARY

DIGITALLIBRARY

SEMANTICDIGITALLIBRARY

SOCIALSEMANTIC

DIGITAL LIBRARY

ORGANISED COLLECTION

ONLINE, EASY SEARCHING

WITH A FULL-TEXT SEARCH

ACCESSIBLE THROUGHSEMANTIC WEB

INVOLVED THE COMMUNITY

INTO SHARING KNOWLEDGE

Page 153: Semantic Empowered Digital Library System

HELPFULNESS

The social semantic digital library will help digital library

to build heterogeneous networks of Semantic Web. It may deliver more robust, user-friendly, adaptable

search and browsing interfaces empowered by

semantics.

Page 154: Semantic Empowered Digital Library System

RESOUCES AND METADATA

COMMUNITY DRIVEN ONTOLOGIES

ONTOLOGIZED METADATA

SSDL

SDL

DLRESOURCE

1RESORUCE

2RESOURCE

n

Page 155: Semantic Empowered Digital Library System

COMMUNITY ONTOLOGY

Page 156: Semantic Empowered Digital Library System

Social Semantic Digital Library Services in e-Learning

Introducing Personal Learning Environments (PLEs) Addressing some of the open research challenges

in Technology Enhanced Learning (TEL) Enabling effective and reliable

mechanisms for managing various types of knowledge relevant for providing personalized learning experiences in online learning environments

Ability to preserve the semantics of this knowledge while sharing

Interaction with during the learning process

Page 157: Semantic Empowered Digital Library System

Social Semantic Digital Library Services in e-Learning

Page 158: Semantic Empowered Digital Library System

QUESTIONS ARISE…

Do the social and semantic services increase the quality of the answers provided by the users in response to given problems?

Do the social and semantic services increase the accuracy of the references provided by the users to answer given questions?

Do the social and semantic services increase overall satisfaction of using the digital library?

Which services, i.e., semantic, social, or recommendations,are found to be most useful by the end users?

Page 159: Semantic Empowered Digital Library System

FUTURE WORK Future research on semantic features should concentrate more on

improving accuracy of automated recommendations services and usability of existing solutions.

Our future work in the domain of semantic digital libraries, and JeromeDL in particular, will focus on adapting research on the semantic web, web 2.0 and adaptive hypermedia to our system;we work on delivering wiki-like and faceted navigation features for JeromeDL.

Page 160: Semantic Empowered Digital Library System

PERORATION...

Semantic Web digital library would contain features like

semantic blogs

semantic wikis

semantic search

social semantic digital libraries

semantic social networks

semantic social information spaces etc.

These will have

open access - open information - open source

(OPEN MANTRA)

Page 161: Semantic Empowered Digital Library System

PERORATION...

There are some other factors constraint the librarian to initiate and adopt the Social Semantic Web like:

Communication barriers

Absence of metadata representations

Absence of user-friendly applications

Limited available literature

But, librarians need to participate inontologies and social semantic-based conferences to explore the technology more andalso to give wider coverage to their skills and talent.

Page 162: Semantic Empowered Digital Library System

REFERENCESFinding the Concept, Not just the Word: A librarian’s guide to ontologies

and semantics. By- Brandy E. king and kathy Reinold

http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1000204

http://drtc.isibang.ac.in/xmlui/bitstream/handle/1849/72/I_unicode.pdf?sequence=2

http://books.google.co.in

http://www.ieee-tcdl.org/Bulletin/v6n1/Yang/yang.html

http://drtc.isibang.ac.in/xmlui/bitstream/handle/1849/26/D_Semanticweb_sneha.pdf?sequence=2

http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.105.9150&rep=rep1&type=pdf

http://www.ukoln.ac.uk/nkos/nkos2006/presentations/kruk.pdf

Page 163: Semantic Empowered Digital Library System

http://publications.drdo.gov.in/ojs/index.php/djlit/article/view/1101

http://www.contexta.cl/mediawiki/images/7/72/The_anatomy_of_a_Social_Semantic_Digital_Library.pdf

http://aran.library.nuigalway.ie/xmlui/bitstream/handle/10379/395/3debevcu_1.pdf?sequence=1

http://www.researchgate.net/publication/45458737_Semantic_Web_Technologies_for_Digital_Libraries_From_Libraries_to_Social_Semantic_Digital_Libraries_(SSDL)_Over_Semantic_Digital_Libraries_(SDL)

http://link.springer.com/chapter/10.1007%2F978-3-540-85434-0_10#page-1

http://www.w3.org/2001/sw/sweo/public/UseCases

http://www.w3.org/standards/semanticweb/

Page 164: Semantic Empowered Digital Library System
Page 165: Semantic Empowered Digital Library System

ACKNOWLEDGMENT

Page 166: Semantic Empowered Digital Library System