17
Introduction to erik.hatcher @ lucidimagination.com 1 Sunday, October 16, 2011

Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies

  • Upload
    others

  • View
    8

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies

Introductionto

erik.hatcher@

lucidimagination.com

1Sunday, October 16, 2011

Page 2: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies

AbstractApache Solr serves search requests at the enterprises and the largest companies around the world. Built on top of the top-

notch Apache Lucene library, Solr makes indexing and searching integration into your applications straightforward.

Solr provides faceted navigation, spell checking, highlighting, clustering, grouping, and other search features. Solr also scales query volume with replication and collection

size with distributed capabilities. Solr can index rich documents such as PDF, Word, HTML, and other file types.

Come learn how you can get your content into Solr and integrate it into your applications!

2Sunday, October 16, 2011

Page 3: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies

About me...

3Sunday, October 16, 2011

Page 4: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies

http://lucene.apache.org/solr/

4Sunday, October 16, 2011

Page 5: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies

http://lucene.apache.org/

5Sunday, October 16, 2011

Page 6: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies

Solr

• Simple: easy to use

• Powerful: feature rich and scales

• Open Source

• from the "Lucene people"

• encapsulates Lucene best practices

6Sunday, October 16, 2011

Page 7: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies

Fire It Up

• cd example

• java -jar start.jar

• [cd example/exampledocs;

• java -jar post.jar *.xml]

7Sunday, October 16, 2011

Page 8: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies

Indexing

• /update[/csv|/json|/extract]

• stream from local, remote, or POST data

• tutorial:

• cd example/exampledocs

• java -jar post.jar *.xml

• Tip: java -jar post.jar -help

8Sunday, October 16, 2011

Page 9: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies

Indexing JSON

POST to /update/json[ {"id" : "1", "title" : "Doc One"}, {"id" : "2", "title" : "Doc Two"}]

9Sunday, October 16, 2011

Page 10: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies

Indexing CSV

curl http://localhost:8983/solr/update/csv --data-binary @data.csv -H 'Content-type:text/plain; charset=utf-8’

10Sunday, October 16, 2011

Page 11: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies

Indexing Rich Documents

http://localhost:8983/solr/update/extract ?stream.file=/path/to/file.doc &stream.contentType=application/msword &literal.id=ds1-file.doc"

11Sunday, October 16, 2011

Page 12: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies

Other conduits

• DataImportHandler (DIH)

• API's: SolrJ, RSolr, (py)solr(.py), etc

• It's just data over HTTP

• "Enterprise"

• LucidWorks: SharePoint, (split) crawling, S3, HDFS, etc; including access control

12Sunday, October 16, 2011

Page 13: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies

Searching• http://localhost:8983/solr/select?q=*:*

• Typical looking Solr request - http://localhost:8983/solr/select +

• ?q=ipod

• &facet=on

• &facet.field=cat

• &fq=cat:electronics

• [&rows=10&start=20]

• [&fl=id,name,price&sort=price asc]

• [&wt=xml|json|csv|ruby|python|php|xslt|velocity&indent=on]

• [&debugQuery=true]

13Sunday, October 16, 2011

Page 14: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies

/browse

14Sunday, October 16, 2011

Page 15: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies

http://www.apache.org/

"Heavy Committing"

15Sunday, October 16, 2011

Page 16: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies

... works

search platform

www.lucidimagination.com

16Sunday, October 16, 2011

Page 17: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies

Events

17Sunday, October 16, 2011