22
A Virtual Distributed Database Model for Creating a Database Federation Jaspreet Singh Pahwa PhD Year 2 School of Computer Science Cardiff University

A Virtual Distributed Database Model for Creating a Database Federation

  • Upload
    anika

  • View
    53

  • Download
    0

Embed Size (px)

DESCRIPTION

A Virtual Distributed Database Model for Creating a Database Federation. Jaspreet Singh Pahwa PhD Year 2 School of Computer Science Cardiff University. Overview. Motivation Hypothesis Product Class Product Class Database Grid Enabled Multiple Database Search Service (MDSS) - PowerPoint PPT Presentation

Citation preview

Page 1: A Virtual Distributed Database Model for Creating a Database Federation

A Virtual Distributed Database Model for Creating a Database Federation

Jaspreet Singh PahwaPhD Year 2

School of Computer Science

Cardiff University

Page 2: A Virtual Distributed Database Model for Creating a Database Federation

Overview

• Motivation

• Hypothesis

• Product Class

• Product Class Database

• Grid Enabled Multiple Database Search Service (MDSS)

• MDSS Federation

• Conclusions and further work

Page 3: A Virtual Distributed Database Model for Creating a Database Federation

Motivation

• Information held by large number of data sources in a Wide Area Network.

• Access to such information held by large number of organisations.

• Information processing with aid of independent mechanisms, and

• Sharing of information with those who need it.

Page 4: A Virtual Distributed Database Model for Creating a Database Federation

Hypothesis

• Enabling product definition infrastructure.• Creating standard product definition using the infrastructure.• Subscription of product definitions and product creation by

product suppliers.• Building a virtual distributed database using the Grid

infrastructure. • Participation of product suppliers in the virtual distributed

database as part of a federation.• Advertisement of products in the virtual distributed database by

Product Suppliers. • Product search using product definition and the Grid.

Page 5: A Virtual Distributed Database Model for Creating a Database Federation

The Product Class

• A standard product definition for providing product information.

• Acts as a template and provides meta information for creating actual product information.

• Is made up of a number of specifications conforming to different specification types.

• Can be used by Product Suppliers for product advertisement in the federation.

Page 6: A Virtual Distributed Database Model for Creating a Database Federation

Sub Product Class Specification

Specification Group

Specification

Sub-Specification Group

Table Spec

List Specification

Product Class Specification

Specification

Value

Table Specification

List of Column Specifications where each column specification has list of values relative to other column specification values.

List Specification

Value

Value

Product Class

The Product Class and its various specification types

Page 7: A Virtual Distributed Database Model for Creating a Database Federation

Product Class Versioning

• Versioning of Product Class/Specifications.• New products emerge.• Existing products evolve and are assigned

more attributes.• New versions created and assigned.• Product suppliers notified.• New classes downloaded.

Page 8: A Virtual Distributed Database Model for Creating a Database Federation

Product Class Subscription

Table

Table

Table

Table

XmlParse

Service

<xml> … … …</xml>

Product Class Database

Transport Layer

Transport Layer

Product Supplier Database

Page 9: A Virtual Distributed Database Model for Creating a Database Federation

The Product Class Database

• A tool for creating Product Classes and Product Class Specifications.

• Enables Product Class versioning.

• Releases Product Class for subscription by Product Suppliers.

• Provides meta information for building product search criteria.

Page 10: A Virtual Distributed Database Model for Creating a Database Federation

Product Class Database Structure

Product Class

Product Category Product Class Version

Product Class DefinitionCategory Hierarchy

Category

Specification List Version Specification Group Version

Table Version

List Specification

Sub Product Class Version

Specification Group

Table Specification

Sub Product Class Definition

List DefinitionSpecification Group Definition

Table Definition

Sub Product Class

Page 11: A Virtual Distributed Database Model for Creating a Database Federation

The MDSS

• Multiple Database Search Service.• A database centric Grid service for accessing

and processing information from large number of data sources.

• Searches supplier data sources based on a search criteria.

• A channel for product suppliers for advertising products to a large user base.

• Built using Globus Toolkit 3.0.2 (Core).

Page 12: A Virtual Distributed Database Model for Creating a Database Federation

The Search

• What to search?

- a search criteria submitted by the user.• Where to search?

- data sources that match user’s request.• What are the available resources to propagate the

search?

- the available Database Search Services in the MDSS

Page 13: A Virtual Distributed Database Model for Creating a Database Federation

Search CriteriaWhere to Search? <NewDataSet> <Supplier> <IDSupplier>1</IDSupplier> <SupplierWS>http://131.251.42.40/test/categories.asmx</SupplierWS> </Supplier> …</NewDataSet>

Available GSHs <NewDataSet> <GridServiceHandle> <IDGsh>1</IDGsh> <GSH>http://131.251.42.40:8080/ogsa/services/DatabaseSearchService</GSH> </GridServiceHandle> <GridServiceHandle> <IDGsh>2</IDGsh> <GSH>http://131.251.47.110:18080/ogsa/services/DatabaseSearchService</GSH> </GridServiceHandle> …</NewDataSet>

Page 14: A Virtual Distributed Database Model for Creating a Database Federation

The Architecture

Two major components:• Master Grid Service: identify resources available,

register new resources, job allocation, data parsing and data aggregation.

• Database Search Service: data search, data aggregation, data parsing, collaborative working.

Page 15: A Virtual Distributed Database Model for Creating a Database Federation

- MDSS Architecture -

WS Layer WS Layer WS LayerWS Layer WS Layer WS Layer

Grid DB Search ServiceGrid DB Search Service Grid DB Search Service

Apache Axis Soap Server

<<soap messages >> <<soap messages >><<soap messages >>

Master Grid Service * Metadata Query *Job Allocation Grid Service * Collation of datasets

Single user

VO

Portal (Search Criteria Specified here)

MetadataProduct Classes

Grid Engine(based on the OGSA Model)

Supplier databases

Page 16: A Virtual Distributed Database Model for Creating a Database Federation

Design Features

• Ability to register new data sources.• Ability to enhance MDSS capability by adding more

machines in the Grid cluster.• Supplier databases free of Grid Complexities.• Ability to identify data sources to be searched in real

time.• Distribute search equally among available GSHs.• New instance of Database Search Service for each

operation.

Page 17: A Virtual Distributed Database Model for Creating a Database Federation

GSH Document

Parser

Supplier Document

Parser

Xml Converter

Database Search Service

Job Execution

Grid Service Manager

Data Aggregation

Master Grid

Service

OGSA Container

Master Grid Service Modules

Page 18: A Virtual Distributed Database Model for Creating a Database Federation

• Information retrieval model for retrieving product information without the need of creating federated schema.

• Encompasses the MDSS System and all participating SD Systems.

• Cooperation to the centralised MDSS System for advertisement of products to VOs.

• Provides an environment for data models to compete with each other.

• Database access mechanism Grid enabled and not data sources.

• Component data models participate in the federation by providing web service interface.

MDSS Federation

Page 19: A Virtual Distributed Database Model for Creating a Database Federation

The MDSS Federation

The MDSS System Database Search Job Allocation

Collation of Datasets

Single user / VO

Component Schema(Product Class)

Web Service Layer

Supplier 1

Component DBS

Component Schema(Product Class)

Web Service Layer

Supplier 2

Component DBS

Component Schema(Product Class)

Web Service Layer

Supplier n

Component DBS

Page 20: A Virtual Distributed Database Model for Creating a Database Federation

Design Features

• Component DBS retain local autonomy.• Schema integration or creation of federated

or external schemas not required.• Uniform data model addresses the issue of

semantic heterogeneity.• Supports different kinds of autonomies.

Page 21: A Virtual Distributed Database Model for Creating a Database Federation

Conclusions & Further Work

• MDSS Federation – a Grid enabled virtual distributed database model for product advertisement.

• Creation of downloadable product classes.• Product class subscription.• Multiple instances of Master Grid Services.• Data Streaming.• Memory Resident databases.

Page 22: A Virtual Distributed Database Model for Creating a Database Federation

Questions?