35
Ontology-Based Mediation in the Amine System Project Pisa June 2007 Prof. Dr. José F. Aldana Montes ([email protected]) Prof. Dr. Francisca Sánchez-Jiménez Ismael Navas Delgado Raúl Montañez Almudena Pino-Ángeles Aurelio A. Moya-García José Luis Urdiales http://asp.uma.es

SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Ontology-Based Mediation in the Amine System Project

Pisa June 2007

Prof. Dr. José F. Aldana Montes ([email protected]) Prof. Dr. Francisca Sánchez-Jiménez

Ismael Navas DelgadoRaúl Montañez

Almudena Pino-ÁngelesAurelio A. Moya-García

José Luis Urdiales

http://asp.uma.es

Page 2: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Outline

- Family of problems to be solved

- Proposed solution: from Semantics to Data Integration

- Semantic Directories

- Ontology-Based Mediator

- Specific Problem: Use Case

- Conclusions

Page 3: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Outline

- Family of problems to be solved

- Proposed solution: from Semantics to Data Integration

- Semantic Directories

- Ontology-Based Mediator

- Specific Problem: Use Case

- Conclusions

Page 4: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Family of Problems

- Systems biology is the study of an organism, viewed as an integrated and interacting network of genes, proteins and biochemical reactions which give rise to life. (Institute of Systems Biology)

- Instead of focusing on individual parts, the focus is on a complete system

Need of integrated access

to different data sources

to enable the study of the system

Page 5: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Outline

- Family of problems to be solved

- Proposed solution: from Semantics to Data Integration

- Semantic Directories

- Ontology-Based Mediator

- Specific Problem: Use Case

- Conclusions

Page 6: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Proposed Solution

-Requirements:

- Easily extensible with new resources

- Reusable elements

- Possibility of developing different kind of applications (not only data integration)

- Decisions:

-Take advantage of the Semantic Web

- Annotate data sources with respect to ontologies

- Reuse previous works

Page 7: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Proposed Solution

-Requirements:

- Easily extensible with new databases

- Reusable elements

- Possibility of developing different kind of applications (not only data integration)

- Decisions:

-Take advantage of the Semantic Web

- Annotate data sources with respect to ontologies

- Reuse previous works

Page 8: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Outline

- Family of problems to be solved

- Proposed solution: from Semantics to Data Integration

- Semantic Directories

- Ontology-Based Mediator

- Specific Problem: Use Case

- Conclusions

Page 9: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Semantic Directories

Page 10: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

SD-Core Metadata

OMV

SDMO

OMV Instance

OMV Instance

OMV Instance

OMV Instance

OMV InstanceOMV

InstanceOMV

InstanceOMV

InstanceOMV

InstanceSDMO

Instance

Semantic RegisterResou

rce M

etada

ta Rep

ositor

yOntology Metadata Repository

Reasoner Componen

t

Semantic Directory

Page 11: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

OMV

OMV (ONTOLOGY METADATA VOCABULARY) is described with OWL, and each instance of the class OntologyImplementation represents an ontology registered in the Semantic Directory. It is possible to describe some relationships between ontologies

Page 12: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

SDMO

- OMV: link resources with registered ontologies.

- Resource: store information (query capabilities, schema, query interface, name and URI) about resources.

- Mapping: set the relationships between resources and ontologies. Each mapping is related with a similarity instance that establishes the similarity between ontology concepts and resource elements. The mapping class is related with OMV, Resource and Similarity class.

- Similarity: contains three properties (concept1, concept2 and similarityValue) to establish the similarity between an ontology concept and a resource element.

- User: deal with users in the applications.

Page 13: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Interfaces Semantic Directory+listResources(out vector)+listResources(in url, out vector)+listResources(in relatedOntology, out vector)+listResources(in relatedOntology, in relevantConcepts, out vector)+listMappings(in resourceName, out vector)+listMappings(in url, out vector)+getSchema(in resourceName, out vector)+getSchema(in url, out file)+getRelatedEelements(in ontologyName, in concept, out vector)«interface»Semantic Directory::Resource Metadata Repository+listOntologies(out vector)+listOntologies(in conceptName, out vector)+listOntologies(in conceptNameVector, out vector)+getOntology(in ontologyName, out vector)+getOntology(in url, out ontologyFile)«interface»Ontology Metadata Repository+registerOntology(in url, in name)+registerOntology(in ontologyFile, in name)+registerResource(in resourceName, in queryMethod, in schemaMethod, in queryCapabilities)+registerResource(in resourceName, in queryMethod, in schemaMethod, in queryCapabilities, in ontologyName)+registerResource(in resourceName, in queryMethod, in schemaMethod, in queryCapabilities, in ontologyName, in mappings)«interface»Semantic Register

Our goal is to provide applications which will make the semantics of the resources explicit through their commitment with an ontology registered in the Semantic Directory. The applications that can be developed using the Semantic Directory components depend on the extension of the infrastructure by means of new components (built on top of the Semantic Directory)

Page 14: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

SD Conclusions

- Generic Infrastructure

- Basic Functionality

- Extended Functionalities requires Core Extensions: new metadata, interfaces, …

- Fully implemented:

- V 0.9.A: Java, BD MySQL, Racer (Concepts Classification), Web Services

- V 0.9.B: Java, Metadata Files, Jena, Web Services

- V 0.9.C: Java, Metadata Files, Jena, Corba CCM

Page 15: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Outline

- Family of problems to be solved

- Proposed solution: from Semantics to Data Integration

- Semantic Directories

- Ontology-Based Mediator

- Specific Problem: Use Case

- Conclusions

Page 16: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Mediator (Data Integration)

Discover availabledata sources

Select datasources & tools

Interact with eachdata source

Combineresults

Users manually interact with each data source getting partial results

Users must select which data sources include relevant information for themselves

Users must locate data sources that are available online and can solve their problems

Users combine partial results to get a partial solution of the problem

Building the full solution

Page 17: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Approach

- GAV (BioBroker):

+ Easy query rewriting

- Extension Global Schema Changes

- LAV

+ Easy addition of new data sources

- Complex Rewriting Process simpler components will allow partial improvements

Page 18: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Approach

- GAV (BioBroker):

+ Easy query rewriting

- Extension Global Schema Changes

- LAV

+ Easy addition of new data sources

- Complex Rewriting Process simpler components will allow partial improvements

Page 19: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Main CharacteristicsUser Query, Ontology (Q,O)

Result (ontology Instances )

Ontology Search /reasoning

Mappings Search

Resource Search

Q, O, QP, {R1, …, Rn}Q, O, Query Plan (QP)

(Q,O)

Controller

Query Planner Q, O, Query Plan (QP)

Reasoner ComponentSemanticsSemantic RegisterResource metadata repositoryOntology Metadata Repository

Data Service

Q, O, QP, {R1, …, Rn}

Q, O, Result (Ontology Instances)

Query Plan Solver/

EvaluatorIntegrator

Data Service

Data Service

Data Service

Mediator

Wrappers developed as Web Services

Resource Semantic Descriptions

Technical Users or Software Developers

Page 20: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Component DivisionUser Query, Ontology (Q,O)

Result (ontology Instances )

Ontology Search /reasoning

Mappings Search

Resource Search

Q, O, QP, {R1, …, Rn}Q, O, Query Plan (QP)

(Q,O)

Controller

Query Planner Q, O, Query Plan (QP)

Reasoner ComponentSemanticsSemantic RegisterResource metadata repositoryOntology Metadata Repository

Data Service

Q, O, QP, {R1, …, Rn}

Q, O, Result (Ontology Instances)

Query Plan Solver/

EvaluatorIntegrator

Data Service

Data Service

Data Service

Mediator

Page 21: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Partial ImprovementsUser Query, Ontology (Q,O)

Result (ontology Instances )

Ontology Search /reasoning

Mappings Search

Resource Search

Q, O, QP, {R1, …, Rn}Q, O, Query Plan (QP)

(Q,O)

Controller

Query Planner Q, O, Query Plan (QP)

Reasoner ComponentSemanticsSemantic RegisterResource metadata repositoryOntology Metadata Repository

Data Service

Q, O, QP, {R1, …, Rn}

Q, O, Result (Ontology Instances)

Query Plan Solver/

EvaluatorIntegrator

Data Service

Data Service

Data Service

Page 22: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Mediator

Reuse Elements

Share Semantics

Reuse sources

User Query, Ontology (Q,O)Result (ontology Instances)Ontology Search/reasoningMappings SearchResource SearchQ, O, QP, {R1, …, Rn}Q, O, Query Plan (QP)(Q,O)ControllerQuery PlannerQ, O, Query Plan (QP)

Reasoner ComponentSemanticsSemantic RegisterResource metadata repositoryOntology Metadata Repository

Data ServiceQ, O, QP, {R1, …, Rn}Q, O, Result (Ontology Instances)Query Plan Solver/EvaluatorIntegratorData ServiceData ServiceData Service

Web Services

XQuery and XML

Ontology Instances

LAV

Mappings

interact with the

user interfacefind a

query plan (QP)

for the user

query

performs the corresponding call to the data

services involved in the sub-queries

Results from data services

(R1, ..., Rn) are composed

Page 23: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Mediator Conclusions

- LAV

- Ontology Based

- Enabled partial improvements

- Limited Reasoning

- First beta version implemented (testing phase):

- Test 1: Bioinformatics Resources

- Test 2: Second Hand Car Resources

Page 24: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Outline

- Family of problems to be solved

- Proposed solution: from Semantics to Data Integration

- Semantic Directories

- Ontology-Based Mediator

- Specific Problem: Use Case

- Conclusions

Page 25: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Use CaseASP:

Amine System Project

http://asp.uma.esA common and useful strategy to find the 3D structure of a protein, which cannot be obtained by its crystallization, is to

apply comparative modeling techniques. These work from the primary sequence of

the unsolved protein and predict its 3D structure by comparing it to those of

solved homologous proteins

Page 26: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Domain Ontology

OrganismMetabolic_processCatalytic_activityProtein_bindingProtein_domain_specific_bindingAmino_acid_sequenceHomologue_sequencePolypeptideMacromoleculeMolecular_entityMolecular_3D_StructureFunctional_domainbelong_tobelong_tobelong_tobelong_tois_formed_byMetabolitehasis_characterizedis_formed_byamino_acid_sequencehomologue_sequence_aashomologue_sequence_polypeptidedevelops

SWISS-ProtPDB

BlastP Modeller

JMol

PubChemKegg

BrendaProsite

Page 27: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Domain OntologyASP:

Amine System Project

http://asp.uma.esSWISS-ProtPDB

BlastP Modeller

JMol

PubChemKegg

BrendaProsite

OrganismMetabolic_processCatalytic_activityProtein_bindingProtein_domain_specific_bindingAmino_acid_sequenceHomologue_sequencePolypeptideMacromoleculeMolecular_entityMolecular_3D_StructureFunctional_domainbelong_tobelong_tobelong_tobelong_tois_formed_byMetabolitehasis_characterizedis_formed_byamino_acid_sequencehomologue_sequence_aashomologue_sequence_polypeptidedevelops

Page 28: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Next StepUser Query, Ontology (Q,O)

Result (ontology Instances )

Ontology Search /reasoning

Mappings Search

Resource Search

Q, O, QP, {R1, …, Rn}Q, O, Query Plan (QP)

(Q,O)

Controller

Query Planner Q, O, Query Plan (QP)

Reasoner ComponentSemanticsSemantic RegisterResource metadata repositoryOntology Metadata Repository

Data Service

Q, O, QP, {R1, …, Rn}

Q, O, Result (Ontology Instances)

Query Plan Solver/

EvaluatorIntegrator

Data Service

Data Service

Data Service

Page 29: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Application

Page 30: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Outline

- Family of problems to be solved

- Proposed solution: from Semantics to Data Integration

- Semantic Directories

- Ontology-Based Mediator

- Specific Problem: Use Case

- Conclusions

Page 31: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Conclusions

- SD: Generic Infrastructure

- Two Ontologies to manage metadata

- SDMO: needs improvements and extensions

-Mediator

- Needs testing and improvement

- Study the addition of reasoning in the integration

Page 32: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Conclusions

Use Case: Protein structures contain fundamental information regarding their

function, location and interactions, which is most of the information in their biological

missions. Combining information integration with prediction techniques (as an automatic

process) results in efficient information retrieval and expands the applicability spectrum of structural bioinformatics techniques to non experienced users.

Page 33: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Conclusions

- The problem presented is important in our context, as performing this process automatically will reduce the effort required to solve it.

- Genome Projects have exponentially increased the number of known polypeptide sequences. Thus, any effort to improve efficiency for the extraction of structural information at its highest level should help the advance of many on-going Systems Biology projects.

Page 34: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Conclusions

The main limitation found is the maintenance of the data services, because the developed ones make use of public databases that are not under our control.

The long term success of this and similar proposals rely on the collaboration of data and tool owners.

Page 35: SOFT SYSTEM METHODOLOGY Criticism - NETTAB · 2007. 6. 29. · Title: SOFT SYSTEM METHODOLOGY Criticism Author: student Created Date: 6/29/2007 2:03:12 PM

Contact

José F. [email protected]

http://asp.uma.es

Thank YOU !Thank YOU !.