17
Company LOGO Peer To Peer Full Text Search Authors : Tanya Finkel , Victor Antonov Advisor: Maxim Gurevich

Company LOGO Peer To Peer Full Text Search Authors : Tanya Finkel, Victor Antonov Advisor: Maxim Gurevich

  • View
    236

  • Download
    1

Embed Size (px)

Citation preview

Page 1: Company LOGO Peer To Peer Full Text Search Authors : Tanya Finkel, Victor Antonov Advisor: Maxim Gurevich

Company

LOGO

Peer To Peer Full Text Search

Peer To Peer Full Text Search

Authors :

Tanya Finkel , Victor Antonov

Advisor: Maxim GurevichAdvisor: Maxim Gurevich

Page 2: Company LOGO Peer To Peer Full Text Search Authors : Tanya Finkel, Victor Antonov Advisor: Maxim Gurevich

Problem DescriptionProblem Description

Sometimes we need to find some document.

We solve this problem by using peer to peer.

It’s good to search by looking through it, not only looking at its name.

Here comes the Full Text Search,but it runs locally only. Not Enough

Results!!!

Page 3: Company LOGO Peer To Peer Full Text Search Authors : Tanya Finkel, Victor Antonov Advisor: Maxim Gurevich

Local SearchLocal Search

Definition - this is the name for the field of search tools which search the contents of a user's own computer files.

Full Text Search – is a search, which engine examines all of the words in every stored document as it tries to match search words supplied by the user.

Let’s see what Full Text Searches we have for today

Page 4: Company LOGO Peer To Peer Full Text Search Authors : Tanya Finkel, Victor Antonov Advisor: Maxim Gurevich

1.Egothor - Impressive ,and it’s worth a look. Key features include: HTML, PDF, PS….

2.Carrot2 - is a research framework for experimenting with automated querying of various data sources, processing search results and their visualization.

3. MG4J - build compressed full-text indices for large collections of documents using sophisticated techniques such as interpolative coding.

4. OpenGrok - It can understand various program file formats and version control histories like SCCS, RCS, CVS and Subversion.

Local SearchLocal Search

5. Lucene - The de-facto open source search index used almost everywhere. Features include Ranked searching,

boolean and phrase queries, fielded searching and date-range searching .

Page 5: Company LOGO Peer To Peer Full Text Search Authors : Tanya Finkel, Victor Antonov Advisor: Maxim Gurevich

Peer To peer Peer To peer

A peer to peer (or "P2P") computer network uses diverse connectivity between participants in a network and the cumulative bandwidth of network.

• Rather than conventional centralized resources where a relatively low number of servers provide the core value to a service or application. P2P networks are typically used for connecting nodes via largely ad hoc connections.

Page 6: Company LOGO Peer To Peer Full Text Search Authors : Tanya Finkel, Victor Antonov Advisor: Maxim Gurevich

Project Distribution Project Distribution

1. GUI1. GUI

3.a. Local Full Text Search

3.a. Local Full Text Search

2. System Management2. System Management

3. b. Network Extension

3. b. Network Extension

Page 7: Company LOGO Peer To Peer Full Text Search Authors : Tanya Finkel, Victor Antonov Advisor: Maxim Gurevich

Local Full Text Search And Network Extension

Local Full Text Search And Network Extension

• Lucene-Indexation and Search on local hardware

• JXTA –Dynamic group management of subscribes and communication between them.

1. GUI1. GUI

2. System Management2. System Management

3.a. Local Full Text Search

3.a. Local Full Text Search 3. b. Network

Extension

3. b. Network Extension

Page 8: Company LOGO Peer To Peer Full Text Search Authors : Tanya Finkel, Victor Antonov Advisor: Maxim Gurevich

Use case Diagram Use case Diagram 1. GUI

1. GUI

2. System Management2. System Management

3.a. Local Full Text Search

3.a. Local Full Text Search 3. b. Network

Extension

3. b. Network Extension

Page 9: Company LOGO Peer To Peer Full Text Search Authors : Tanya Finkel, Victor Antonov Advisor: Maxim Gurevich

Local Search-merging algorithmLocal Search-merging algorithm1. GUI

1. GUI

2. System Management2. System Management

3.a. Local Full Text Search

3.a. Local Full Text Search 3. b. Network

Extension

3. b. Network Extension

Lucene scores are depend on the local host document,

In the two different host you will return two different score for the same document.

1. For each document count number of words referred to query .

2. If number of words in Local search document bigger than

Number of words in network search document

Then:

local search document will before Network

search document in the list .

Otherwise: back ward.

Page 10: Company LOGO Peer To Peer Full Text Search Authors : Tanya Finkel, Victor Antonov Advisor: Maxim Gurevich

Local Search-snippet algorithmLocal Search-snippet algorithm1. GUI

1. GUI

2. System Management2. System Management

3.a. Local Full Text Search

3.a. Local Full Text Search 3. b. Network

Extension

3. b. Network Extension

1. Look for whole query

2. If not enough sentences was found search for each word separatly.

If number of maxSentence <numberOffound sentences

or number of max Words<number of found word

Then stop the search

If not found sentences then

“no snippet available”

Page 11: Company LOGO Peer To Peer Full Text Search Authors : Tanya Finkel, Victor Antonov Advisor: Maxim Gurevich

Local Search-Class DiagramLocal Search-Class Diagram1. GUI

1. GUI

2. System Management2. System Management

3.a. Local Full Text Search

3.a. Local Full Text Search 3. b. Network

Extension

3. b. Network Extension

Page 12: Company LOGO Peer To Peer Full Text Search Authors : Tanya Finkel, Victor Antonov Advisor: Maxim Gurevich

Local Search Class DiagramLocal Search Class Diagram1. GUI

1. GUI

2. System Management2. System Management

3.a. Local Full Text Search

3.a. Local Full Text Search 3. b. Network

Extension

3. b. Network Extension

Page 13: Company LOGO Peer To Peer Full Text Search Authors : Tanya Finkel, Victor Antonov Advisor: Maxim Gurevich

Network search algorithmNetwork search algorithm1. GUI

1. GUI

2. System Management2. System Management

3.a. Local Full Text Search

3.a. Local Full Text Search 3. b. Network

Extension

3. b. Network Extension

a)Searching the query in local history, if there’re no results in history cash, the search is as local search (see local search).

b) After searching the filtration is performed (removes results that

not for public view).

c) In this step we have a Vector of Results that we transform into Vector of NetworkResults. To each NetworkResult we add current peer ID, so in the future we can download from this peer a result file.

d) Now the peer (program) connects to the NRS from where the order came and sends the Vector of NetworkResults when connection is established.

Page 14: Company LOGO Peer To Peer Full Text Search Authors : Tanya Finkel, Victor Antonov Advisor: Maxim Gurevich

Network search class diagramNetwork search class diagram1. GUI

1. GUI

2. System Management2. System Management

3.a. Local Full Text Search

3.a. Local Full Text Search 3. b. Network

Extension

3. b. Network Extension

Page 15: Company LOGO Peer To Peer Full Text Search Authors : Tanya Finkel, Victor Antonov Advisor: Maxim Gurevich

Network search class diagramNetwork search class diagram1. GUI

1. GUI

2. System Management2. System Management

3.a. Local Full Text Search

3.a. Local Full Text Search 3. b. Network

Extension

3. b. Network Extension

Page 16: Company LOGO Peer To Peer Full Text Search Authors : Tanya Finkel, Victor Antonov Advisor: Maxim Gurevich

Demonstration Demonstration 1. GUI

1. GUI

2. System Management2. System Management

P2PFTS

Tesla

Local Search

Network Search

File View Options AboutFile View Options About

C:\People\NikolaTeslaBiography.pdfTesla is best known for many revolutionary contributions in the field of electricity and magnetism in the late 19th and early 20th centuries. …

C:\music\Tesla.peaceOfTime.mp3Remote: WardenclyffeProject.docThe Tesla Wardenclyffe Project -- Established to preserve Wardenclyffe, the century-old laboratory of electrical pioneer Nikola Tesla, located in Shoreham, ...

3.a. Local Full Text Search

3.a. Local Full Text Search 3. b. Network

Extension

3. b. Network Extension

Page 17: Company LOGO Peer To Peer Full Text Search Authors : Tanya Finkel, Victor Antonov Advisor: Maxim Gurevich