Jongwook Woo
HiPICHiPIC
CSULA
e-Business Architecture with Enterprise Search Engine
KOCSEA 2008
Jongwook Woo, PhDJongwook Woo, PhD
High-Performance Internet Computing Center
Computer Information Systems Department
California State University, Los Angeles
HiPICHiPIC
Jongwook WooCSULA
Abstract e-Business architecture has been updated dramatically for the
past 10 years. The architecture moved from the traditional client-server architecture to n-tier architecture since Internet and Web came out to the world. Internet and web even create new product and revenue as companies have invested in Information Technology seriously not as an option. Content is one of the examples in this new era. Content has become intellectual property and a company or organization that has content can make profit by providing the content to the customers and the users. As the content increases high, there is a need for search engine in order to provide the proper content to the user quickly. Thus, search engine has received highlight in the content industry, mostly portal sites. The paper introduces search engine - especially internal search engine not web search engine - and its fundamentals. Then, e-Business architecture with search engine is illustrated.
HiPICHiPIC
Jongwook WooCSULA
Search Engine
Information retrieval system
Help users find what they need– In Internet or Intranet
Minimize time required to find information
search engines
Web search engineEnterprise search engineEx: Apache Lucene, Solr, FAST
HiPICHiPIC
Jongwook WooCSULA
Search Engine Basics
Indexing
Querying
Ranking
Lemmatization and Phrase search
Geo search
Search Navigation
HiPICHiPIC
Jongwook WooCSULA
Querying
Basic of all search engine
The user uses query to search With search terms AND, OR, NOT, NEAR
The process of specify criteria about an item on interest and have the engine find the matching items.
Syntax of querying
Formalized Natural Language Search
HiPICHiPIC
Jongwook WooCSULA
Indexing
How data is collected and stored then retrieve once a user do a search.Optimize the speed and performance of
finding relevant documents.
Need to determine ranking by Relevance and similarityDetermines the order in which it is presented
to the user
HiPICHiPIC
Jongwook WooCSULA
Lemmatized/Phrase Search
LemmatizedA search which allows the user to enter
a lemma and return the inflected formsEx: hamburger vs hamburgers
PhraseA query that matches documents
containing a particular sequence of terms.
Ex: “The Great Wall”
HiPICHiPIC
Jongwook WooCSULA
Others
Geo-search Parse text and identify the coordinates of the
geographic entities mentioned in the query Ex: East Palo Alto CA -> Latitude: 37.47 N,
Longitude: 122.14 W
Facet BrowsingSearch result can be categorized and displays the counts of each search results
categorized
HiPICHiPIC
Jongwook WooCSULA
Local (Enterprise) Search
Used by corporation to search in their own intranets to retrieve information.
Either used in public or not
Portal sites provides and sells contents
Contents is digital info: – text, multimedia data etc
Ex: citysearch.com, insiderpages.com, superpages.com etc
HiPICHiPIC
Jongwook WooCSULA
Web Search
Used to search the World Wide Web
Ex:www.Google.comwww.msn.comwww.Yahoo.com
Probably no more market to compete with these dinosaurs
HiPICHiPIC
Jongwook WooCSULA
Legacy N-Tier Architecture
Presentation LogicOnly for the user
Business Logic
Data Access LogicNormally DBSearch query communicates with DB
HiPICHiPICn-Tier Architecture
Jongwook WooCSULA
N-Tier Web Architecture
DB Server
DB Access Library
Business Logic
Presentation Logic
The user searches the site
HiPICHiPIC
Jongwook WooCSULA
Latest N-Tier Architecture
Presentation Logic
Both for the user and the author
Business Logic
Data Access Logic
Both for DB
Search Access Logic
Connects Search Engine that search query communicates with
HiPICHiPICn-Tier Architecture
Jongwook WooCSULA
N-Tier Web Architecture with Enterprise Search Engine
DB Server
DB Access LibrarySearch Access Library
Business Logic
Presentation Logic
Search Engine
Search DB
The user searches the site
Load/index DB to Search DB
HiPICHiPIC
Jongwook WooCSULA
Summary
Search Engine should not be optional for Internet Business
Enterprise Search has been popular by providing contents to the user and the third parties
as contents are productDon’t need to compete with Web Search
Business such as Google, Yahoo, MSN etc