View
449
Download
5
Tags:
Embed Size (px)
Citation preview
Presentations by Johannes Keizer is licensed under a
Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.
Dr. Caterina CaraccioloDr. Johannes KeizerOffice of Knowledge Exchange, Research and ExtensionFood and Agriculture Organization of the UN
The VocBench, a collaborative tool for Vocabulary management
Presentation at NSTL, Beijing, China 2012-09-16
johannes keizer/caterina caracciolohttp://aims.fao.org
www.fao.org
johannes keizer/caterina caracciolohttp://aims.fao.org
johannes keizer/caterina caracciolohttp://aims.fao.org
AGROVOC
johannes keizer/caterina caracciolohttp://aims.fao.org
AGROVOC
• In 1980: • AGROVOC thesaurus was born• It was in 3 languages: English, Spanish, French• It was a paper publication
• In 2012: • AGROVOC is an RDF/SKOS resource• AGROVOC is available in 21 languages• 4 languages are under development
johannes keizer/caterina caracciolohttp://aims.fao.org
Number of terms per language in AGROVOC
Linked Open Vocabularies
johannes keizer/caterina caracciolohttp://aims.fao.org
AGROVOC LOD record
johannes keizer/caterina caracciolohttp://aims.fao.org
The problem Thesaurus editors old fashioned, not viable
to edit SKOS
Ontology editors to complex one one side, not specific enough on the other
None of the existing tools had an integrated workflow environment
No possibility of collaborative editing, most tools were not web applications
johannes keizer/caterina caracciolohttp://aims.fao.org
The solution: VocBench
Is a web-based working environment for managing the AGROVOC Concept Scheme
Facilitate the collaborative editing of multilingual terminology and semantic concept information
It includes administration and group management features
It includes workflows for maintenance, validation and quality assurance of the data pool
It is accessible freely to everybody to facilitates collaborative editing
10
johannes keizer/caterina caracciolohttp://aims.fao.org
VocBench
11
johannes keizer/caterina caracciolohttp://aims.fao.org
MODULES
johannes keizer/caterina caracciolohttp://aims.fao.org
GROUP/MODULE PERMISSION
13
VocBench 2.0
johannes keizer/caterina caracciolohttp://aims.fao.org
Coming features
johannes keizer/caterina caracciolohttp://aims.fao.org
VocBench 2.0
VB 2.0 will natively support SKOS
Separation of the business and presentation layers from the data layer
This allows VB to be attached to any SPARQL-compliant triple-store database (Sesame, Jena, Allegrograph, OWLLim)
First release of VB 2.0 will retain the same interface as that of VB 1.0
VB 2.0 will incorporate all the functionalities available in VB 1.0
Multi-user support, roles and right administration, and publishing work flow management will inherited from VB 1.0.
Current Protégé implementation of VB 1.0 will be replaced by Semantic Turkey
All the business logic and data access layers related to ontology management will be handled by Semantic Turkey
johannes keizer/caterina caracciolohttp://aims.fao.org
SYSTEM OVERVIEW
Version 2.0
Front end Back end
Administrative Database(MySQL)
Triple Store
Middleware
Hibernate Layer
SemanticTurkey/
OWLART API
Gilead
Service Wrapper
Layer
Google Web Toolkit(GWT)
Graph Visualization
GWT Incubator
Web services
johannes keizer/caterina caracciolohttp://aims.fao.org
SYSTEM ARCHITECTURE
Version 2.0
Ontology repository (SKOS)System Data Repository
Semantic TurkeyJDBC (MYSQL)
Validation
Stati
stics
Use
r Man
agem
ent
Gro
up
Man
agem
ent
Syst
em P
refe
renc
e
GWT
Conc
ept
Man
agem
ent
Rela
tions
hip
Man
agem
ent
Sear
ch
Sche
me
Man
agem
ent
Impo
rt
Expo
rt
Cons
iste
ncy
Chec
k
VOCBENCH INTERFACE
DEMO
johannes keizer/caterina caracciolohttp://aims.fao.org
LOGIN PAGE
johannes keizer/caterina caracciolohttp://aims.fao.org
Multi-lingual
21
johannes keizer/caterina caracciolohttp://aims.fao.org
Managing change
johannes keizer/caterina caracciolohttp://aims.fao.org
Top concepts
johannes keizer/caterina caracciolohttp://aims.fao.org
Browse concepts
johannes keizer/caterina caracciolohttp://aims.fao.org
Concept relationships
johannes keizer/caterina caracciolohttp://aims.fao.org
Concept notes
johannes keizer/caterina caracciolohttp://aims.fao.org
Browse relationships
johannes keizer/caterina caracciolohttp://aims.fao.org
Validation
johannes keizer/caterina caracciolohttp://aims.fao.org
Export
johannes keizer/caterina caracciolohttp://aims.fao.org
Export
johannes keizer/caterina caracciolohttp://aims.fao.org
Concept Graph view
Further Material
REGISTRATION
33
34
MENUBAR
• Link to different modules
• Sign out link to logout from the system.
• Post Comment link to send the comments about that specific module
• Help link to view the help for specific module
• Display Language link to choose the languages for data to be displayed
MultilingualFilter languages to display terms for each concepts
35
COMMENTS
Allows provision of feedbacks for each module
36
HELP
Steps by steps procedure for each module
37
HOME - RECENT CHANGES
• Lists all the changes in reverse chronological order
38
SEARCH
39
40
SEARCH
• by string: the user can specify if the system should search by exact match, beginning with, contains or fuzzy
• by URI or term code; or by range of term code (e.g. between 123 and 9876)
• by classification schemes
• by creation or modification date
• by specific relationships (e.g. search all concepts using the “has_pest”)
• by status, language
• by notes/attributes
41
SEARCH CRITERIA• Exact match (“water” will find only water)
• Starting with (“water” will find “water”, “water management”, and “watergate”)
• Contain (“water” will find “water”, “water management”, “inland water”, and “watergate”)
• Exact word (“water” will find “water”, “water management”, “inland water” but not “watergate”)
• Case Sensitive (“ABc” will find only for the word “ABc” and not “abc”
• Include Description
– Search also content of definitions, images, editorial notes, scope notes
• Include Spelling Variant
– Search also on different spelling variants
42
CONCEPT MANAGEMENT• Visualize / Navigate Concepts
– depending on the access level, user can edit
• Show selected concept details
– all terms and other information
• Show selected concept relationships
• Delete concept
• Can view the URI of the concept enabling ‘Show URI’ option
CONCEPT MANAGEMENT
43
ADD/DELETE CONCEPT
• Add term to the newly created concept in one language• Can delete the concept and all the relationship associated with that concept
44
45
CONCEPT RELATIONSHIP
• Can create the concept-concept relationship• Inverse relationship is also created
automatically– Ex: If we create A affect B, then B is affected by A
relationship is also created
CONCEPT IMAGE
• Name of the image with description
• URL will point to the image which will open in an external
• Provide the source of the image
• Can add more translation in different language
46
SCOPE/EDITORIAL NOTES• In Editorial Notes, we can post the modifications
made to the selected concepts• In Scope Notes, we can post information for indexing• Both can contain multiple values in multiple
languages
47
CONCEPT DEFINITION• Add definition to the selected concept • Add translation in different languages• Provide the source of the definition• Creation and modified date are set automatically
48
49
TERM MANAGEMENT
• Visualize / Navigate Terms – depending on the user access level and role
• Show selected term details• Show selected term relationships• Show term variances• Add/Edit/Delete term
ADD/EDIT/DELETE TERM• Term actions will be associated to the selected concept • Term can be created in different languages• Term can be set as preferred or non preferred• An option allows preferred terms visualization only in the concept tree
browser
50
CONCEPT GRAPH VISUALIZATION
51
RELATIONSHIP MANAGEMENT
52
LABEL/DEFINITION
• Can add/edit/delete the label for relationship in multiple languages
• Can add/edit/delete definition of the relationships in multiple languages
53
PROPERTIES/ DOMAIN & RANGE
Can assign functional, transitive, inverse & symmetric properties to the selected relationship
Only functional properties can be selected for data type properties
Can specify the domain & range
54
VALIDATION
• Allows to verify the new concept/term/definition/relationships/notes/attributes created, edited or deleted.
55
STATISTICS
Provides statistical data onconceptstermsusersrelationships
56
57
EXPORT
• Formats– SKOS– RDBMS-SQL– TBX– OWL
CLASSIFICATION SCHEMES
• Add/Edit/Delete different classification schemes
• Can create new category or assign existing domain concept to the classification schemes
• Mapping categories to domain concepts is possible
58
USER MANAGEMENT
• Manages user detail information.• Assign each user different group privilege and languages.
59
GROUP MANAGEMENT
• Manages different groups, their permission and members.• Assign allowed actions for each group.
60
CONSISTENCY CHECK
• Checks if there is any inconsistency in the model data.
61
SYSTEM PREFERENCES
• Manage user information/preferences• VocBench interface
62
63
RSS
• All the changes on the VocBench content is published through RSS.
• Helps people who regularly use VocBench to stay informed about the changes.
• Shows
– all the concept/terms/relationships created/edited/deleted
– Users/Groups created/edited/deleted
64
WEB SERVICES
VocBench maintains access
response
uses
SKOS
TripleStore
Other Applications
65
WEB SERVICES
• AGROVOC Concept Server is accessible via web services, which can be called from any client application.
• The web services are realized on Apache Axis running on Tomcat.
• Using web services, –helps third party application for easy access to WB content–changes on the content of AGROVOC Concept Server can be
accessed immediately–reduces the time and effort necessary to download and
incorporate the latest version of the AGROVOC Concept Server content into applications
66
WEB SERVICES
67
CURRENT AGROVOC RELEASES
• AGROVOC Home Page
– http://aims.fao.org/agrovoc
• Latest stable release version 1.0:
– http://agrovoc.mimos.my
• Web Services Test Client
– http://202.73.13.50:55381/ACSWWebserviceV1Client/sampleACSWWebServiceProxy/TestClient.jsp
TECHNICAL DETAILS
68
SYSTEM ARCHITECTUREVersion 1.3
69
Ontology repository (OWL)System Data Repository
Protégé OWL APIJDBC (MYSQL)
Validation
Stati
stics
Use
r M
anag
emen
t
Gro
up
Man
agem
ent
Syst
em
Pref
eren
ce
GWT
Conc
ept
Man
agem
ent
Rela
tions
hip
Man
agem
ent
Sear
ch
Sche
me
Man
agem
ent
Impo
rt
Expo
rt
Cons
iste
ncy
Chec
k
VOCBENCH INTERFACE
SYSTEM OVERVIEWVersion 1.3
70
Front end Back end
Administrative Database(MySQL)
Protégé Triple Store
Middleware
Hibernate Layer
ProtégéOWL API
Gilead
Intermediate Layer
Google Web Toolkit(GWT)
Graph Visualization
GWT Incubator
Web services
VocBench 1.0 2.0
Version 1.x
Version 2.0
71
SYSTEM OVERVIEW• Front End
– GWT/GWT Incubator• Google Web Toolkit
– Graph Visualization• Java Applet
• Middle ware– Service wrapper layer
• separates client and service layer• allows to switch between old Protégé implementation and Semantic Turkey
– Semantic Turkey/OWL Art API– Gilead
• integrating GWT and Hibernate
• Back end– Administrative DB : MySQL– Triple Store: Sesame/Jena/Allegrograph,..
72
Semantic Turkey/OWLArt API• Semantic Turkey provides
– business logic including OWL and SKOS support, – import for OWL and SKOS, – triple-store agnostic design and – an OSGi-compliant plug-in architecture.
• Semantic Turkey uses the OWLArt API which is already partially integrated within VocBench.
• OWLArt API offers– a basic RDF triple query and SPARQL access layer, which need to be
implemented for any specific triple store– methods more closely connected with operations related to the standard
vocabularies of the RDF family (from RDF to RDFS, OWL, SKOS and SKOS-XL).
73
Semantic Turkey Architecture
74
75
TOOLS/TECHNOLOGIES
Tools Description
EclipseEclipse an open source community, whose projects are focused on building an open development platform comprised of extensible frameworks, tools and runtimes for building, deploying and managing software across the lifecycle.
GWT Google Web Toolkit (GWT) is an open source Java software development framework that makes writing ajax web application.
Protégé OWL APIProtégé-OWL API is an open-source Java library for the Web Ontology Language and RDF(S) providing classes and methods to load and save OWL files, to query and manipulate OWL data models, and to perform reasoning.
OWL ART API OWL ART API offer an abstraction layer over different RDF triple store technologies. Currently available for Sesame or Jena Implementation.
Semantic TurkeySemantic Turkey is a platform for Semantic Bookmarking and Ontology Development realized by the ART Research Group at the University of Rome, Tor Vergata
MySQL MySQL is an open source relational database system
Hibernate Hibernate is a powerful, high performance object/relational persistence and query service.
Apache Tomcat Apache Tomcat is the servlet container that is used in the official Reference Implementation for the Java Servlet and JSP technologies.
76
TOOLS/TECHNOLOGIES VersionTool/Technology Version Reference
JAVA 1.5 or abovehttp://java.sun.com/javase/
downloads/index.jsp
Mysql 5 or above http://dev.mysql.com/downloads/
Hibernate 3.1 https://www.hibernate.org/
Gilead 1.2.3 http://noon.gilead.free.fr/gilead/
GWT 1.7http://code.google.com/webtoolkit/
versions.html
GWT Incubator gwt-incubator drop for gwt 1.7 Revision 1686
http://code.google.com/p/google-web-toolkit-incubator/
Protégé OWL API 3.4 Build 130http://protege.cim3.net/download/old-releases/3.4%20betas/build-130/full/
OWL ART API 2.0.4 http://art.uniroma2.it/owlart/
Semantic Turkey 0.8.5.x http://semanticturkey.uniroma2.it/
Apache Tomcat 6 or abovehttp://tomcat.apache.org/download-
55.cgi