17
PoolParty PoolParty Search Server 2.0 Alexander Kreiser

PoolParty Search Server

Embed Size (px)

DESCRIPTION

PoolParty Semantic Search Server is described technologically. How to use SKOS thesauri to map data from different sources and how to generate a semantic index. How to build precise faceted search.

Citation preview

Page 1: PoolParty Search Server

PoolParty

PoolParty Search Server 2.0

Alexander Kreiser

Page 2: PoolParty Search Server

Agenda

• System Architecture

• Indexing and Mapping

• Search API

2

Page 3: PoolParty Search Server

System Architecture

3

poolparty product family

Document Index

Search Services

Search Application

metadata

mapping

Page 4: PoolParty Search Server

System Architecture

4

components

Search Services

Search Application

Collector<xml>

Semantic Indexer

Document Index

Cartridge

Page 5: PoolParty Search Server

System Architecture

• Built upon robust, flexible and well documented frameworks and technologies:

– Apache Tomcat 6.0

– Spring Framework 3.0.5

– Spring Security 3.0.3

– ExtJS 3.3.1

• Using SKOS (RDF) as metadata format – standardized by the W3C

5

technologies

Page 6: PoolParty Search Server

Indexing and Mapping

• Rich metadata in a standardized, extensible format (SKOS / RDF)

• Central metadata repository

• Thesaurus is managed with PoolParty

• Document metadata is mapped to concepts in the thesaurus

• Semantic information improves search experience

6

concepts

Page 7: PoolParty Search Server

Indexing and Mapping

7

mapping

Title: Document TitleCreated: 2010-03-15M_Countries: AustriaM_Vendors: Atlassian, ApacheAbstract: …

Document PoolParty Project

Concept Scheme

Concept Scheme

Concepts

Concepts

Page 8: PoolParty Search Server

Indexing and Mapping

8

mapping

Title: Document TitleCreated: 2010-03-15M_Countries: AustriaM_Vendors: Atlassian, ApacheAbstract: …

Document PoolParty Project

Concept Scheme

Concept Scheme

Concepts

Concepts

Page 9: PoolParty Search Server

Indexing and Mapping

9

Collector

Semantic Indexer

Document Index

<xml>

M_Countries: AustriaM_Vendors: Atlassian, Apache

process

Page 10: PoolParty Search Server

Indexing and Mapping

10

Collector

Semantic Indexer

Document Index

M_Countries: AustriaM_Vendors: Atlassian, Apache

<rdf>

process

Page 11: PoolParty Search Server

Indexing and Mapping

11

Collector

Semantic Indexer

Document Index

M_Countries: http://company.com/thesaurus/AT M_Vendors: http://company.com/thesaurus/Atlassian, http://company.com/thesaurus/Apache

enriched

<rdf>

process

Page 12: PoolParty Search Server

Indexing and Mapping

12

Document Index

Search Services

Search Application

metadata

Language: englishLanguage: german

"Austria""Österreich"

localized metadata

Page 13: PoolParty Search Server

Indexing and Mapping

• New metadata values are

– automatically created in the thesaurus in a special container(Free Concepts from PPX)

– put into the mapped concept schemee.g. "Regions"

• Thesaurus manager can edit and/or merge the new concepts

13

new metadata

Page 14: PoolParty Search Server

Search API

• Available web services:

– Search Service

– Suggest Service

– Similarity Service

• Supported formats:

– JSON

– XML

– RSS

14

services and formats

Page 15: PoolParty Search Server

Search API

• Search Service

– 10 parallel clients (5000 total requests)

– Time: 31 ms – 2000 ms; average: 303 ms (network delay: min. 25 ms)

15

performance tests

Page 16: PoolParty Search Server

Search API

• Suggest Service

– 10 parallel clients (5000 total requests)

– Time: 31 ms – 1094 ms (average: 118 ms) (network delay: min. 25 ms)

16

performance tests

Page 17: PoolParty Search Server

Contact

Andreas BlumauerManaging [email protected]+43-1-8974122-27

Alexander KreiserSystem [email protected] +43-1-8974122-38

17

punkt. netServices GmbH

Lerchenfelder Guertel 43A—1160 Wien / Austria

http://www.punkt.at/ http://poolparty.punkt.at/

http://twitter.com/PoolParty_Teamhttp://www.facebook.com/pages/

PoolParty/134189476633941