18

Davis mark advanced search analytics in 20 minutes

Embed Size (px)

DESCRIPTION

Kitenga's ZettaVox and ZettaSearch products support SOLR and Lucene ecosystems at both the ingestion point and for the search user. In this talk, I will show how ZettaVox, our professional content mining platform on Hadoop, can be used to index content and rich metadata into a LucidWorks Enterprise installation. Being built on Hadoop, ZettaVox scales up by scaling out. I will then create an end-user search and analytics experience using our ZettaSearch solution that leverages the faceted metadata to enhance information discovery and analysis. All in about 20 minutes.

Citation preview

Page 1: Davis mark   advanced search analytics in 20 minutes
Page 2: Davis mark   advanced search analytics in 20 minutes

Kitenga reinventing information

Mark Davis Founder/CTO

Page 3: Davis mark   advanced search analytics in 20 minutes

Advanced Search and Analytics

in 20 minutes

Page 4: Davis mark   advanced search analytics in 20 minutes

Scalable  Big  Data  

Analytics  

Advanced  Search  

Built  on  an  open  source  foundation  

Page 5: Davis mark   advanced search analytics in 20 minutes

Conquer  “Big  Data”  

Overcome  information  overload  

Transform  data  into  actionable  intelligence  

Find  the  needle  in  the  haystack  

Page 6: Davis mark   advanced search analytics in 20 minutes

Big  Data    

Enormous  transactional  data  Enormous  unstructured  information  Too  big  for  databases  New  tools  are  needed    

Page 7: Davis mark   advanced search analytics in 20 minutes

Get  Document  

Extract  Information   Index  

Page 8: Davis mark   advanced search analytics in 20 minutes
Page 9: Davis mark   advanced search analytics in 20 minutes

¡  Scalable  ¡  Fault-­‐tolerant  ¡  Network/rack  aware  

¡  Parallel  programming  model:  MapReduce  

¡  Cottage  industry  ¡  Complex  MapReduce  model  

¡  Stability  ¡  Command-­‐line  tools  

Page 10: Davis mark   advanced search analytics in 20 minutes

 The  voice  of  Big  Data  

 Out  of  the  box  

MapReduce  components  for  content  mining  

 Reduce  time-­‐to-­‐action  

 Integrated  visualization  

and  analytics    

ZettaVox

Page 11: Davis mark   advanced search analytics in 20 minutes
Page 12: Davis mark   advanced search analytics in 20 minutes

 Faceted  search  for  complex  metadata  

 Analytics  and  search  

together    

Revolutionize  enterprise  search  

 Built  on  open  source  

success  

ZettaSearch

Page 13: Davis mark   advanced search analytics in 20 minutes
Page 14: Davis mark   advanced search analytics in 20 minutes

START  THE  TIMER  

Advanced Search and Analytics

in 20 minutes

Page 15: Davis mark   advanced search analytics in 20 minutes

DID  IT  WORK?  

Advanced Search and Analytics

in 20 minutes

Page 16: Davis mark   advanced search analytics in 20 minutes

ZettaVox  1.4  ¡  Drag-­‐and-­‐drop  Hadoop  analysis  ¡  Natural  Language  Processing  ¡  Cluster  monitoring  ¡  HDFS-­‐aware  analysis  tools  ¡  Integrated  information  visualization  

ZettaSearch  1.0  ¡  JSP  Custom  Taglib  search  designer  ¡  Charts  and  tools  tied  to  metadata  ¡  Available  for  free  (soon)  

ZettaSearch  2.0  ¡  Drag-­‐and-­‐drop  user  search  designer  ¡  Personalization  ¡  Rich  visualization  options  

Page 17: Davis mark   advanced search analytics in 20 minutes

Questions?  

Page 18: Davis mark   advanced search analytics in 20 minutes