solr 24 x 7 from development to productionpeople.apache.org/~sgoeschl/presentations/solr...solr...

34
SOLR 24 x 7 From Development To Production Sonntag, 14. April 13

Upload: others

Post on 20-Sep-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

SOLR 24 x 7From DevelopmentTo Production

Sonntag, 14. April 13

Page 2: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

Source : http://www.oewa.at/index.php?id=2

Sonntag, 14. April 13

Page 3: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

More Facts & Numbers

• 1,61 million classified ads

• 12,9 million page impression every day

• 24,4 full-text search requests every sec

• 20,0 new classified ads every minute

Sonntag, 14. April 13

Page 4: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

The Business Case

Sonntag, 14. April 13

Page 5: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

The Business Case

• Linux is no longer supported

• Current license limited to 200 req / sec

• Overall FAST performance decreased

• No FAST experts available

• Provide better user experience

Sonntag, 14. April 13

Page 6: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

Sonntag, 14. April 13

Page 7: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

Where isfull-text search used?!

Sonntag, 14. April 13

Page 8: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

Where isfull-text search used?!

Everywhere

Sonntag, 14. April 13

Page 9: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

Sonntag, 14. April 13

Page 10: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

Sonntag, 14. April 13

Page 11: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

Sonntag, 14. April 13

Page 12: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

• A Java open source search engine

• Indexes content sources, processes query requests, returns search results.

• Based on Lucene but adds full enterprise search server features and capabilities.

• A web-based application that processes HTTP requests and returns HTTP responses.

What Is Apache SOLR?

Sonntag, 14. April 13

Page 13: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

The Road Ahead

• Create SOLR prototype

• Replace FAST with SOLR

• Improved search experience

• Provide 24 x 7 availability

Sonntag, 14. April 13

Page 14: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

SOLR Prototype

• Based on Solritas

• Uses Velocity template engine

• Gives you a head start

• Easy to customize

Sonntag, 14. April 13

Page 15: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

Sonntag, 14. April 13

Page 16: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

The Road Ahead

• Create SOLR prototype

•Replace FAST with SOLR

• Improved search experience

• Provide 24 x 7 availability

Sonntag, 14. April 13

Page 17: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

Willhaben FAST Integration

• FAST Search Server tightly integrated

• Configuration & metadata in database

• Hard to integrate SOLR in parallel

• FAST to SOLR query translation

‣ Sort of hack

‣ Custom recursive parser

Sonntag, 14. April 13

Page 18: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

FAST2SOLR Translation

Sonntag, 14. April 13

Page 19: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

SOLR Performance

• FAST Search Server tightly integrated

• Configuration & metadata in database

• Hard to integrate SOLR in parallel

• FAST to SOLR query translation

‣ Sort of hack

‣ Custom recursive parser

Sonntag, 14. April 13

Page 20: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

The Road Ahead

• Create SOLR prototype

• Replace FAST with SOLR

• Improved search experience

• Provide 24 x 7 availability

Sonntag, 14. April 13

Page 21: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

Implicit Wildcard Search

• Classified Ads are very diverse and short

• The German language is hard

• Willhaben users are not computer geeks

• Using “Implicit Wildcard Search”

‣ DictionaryCompoundWordTokenFilter

Sonntag, 14. April 13

Page 22: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

Sonntag, 14. April 13

Page 23: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

SOLRCustom Components

Sonntag, 14. April 13

Page 24: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

System Monitoring

Sonntag, 14. April 13

Page 25: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

Slow Query Log

Sonntag, 14. April 13

Page 26: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

Frequent Search Terms

Sonntag, 14. April 13

Page 27: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

Best User SearchTerms

Sonntag, 14. April 13

Page 28: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

Best User SearchTerms

• gümnastikband

Sonntag, 14. April 13

Page 29: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

Best User SearchTerms

• gümnastikband

• Auti Q5

Sonntag, 14. April 13

Page 30: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

Best User SearchTerms

• gümnastikband

• Auti Q5

• Mitzbuichi

Sonntag, 14. April 13

Page 31: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

Best User SearchTerms

• gümnastikband

• Auti Q5

• Mitzbuichi

• Star Woars

Sonntag, 14. April 13

Page 32: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

Best User SearchTerms

• gümnastikband

• Auti Q5

• Mitzbuichi

• Star Woars

• kopjuter

Sonntag, 14. April 13

Page 33: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

Best User SearchTerms

• gümnastikband

• Auti Q5

• Mitzbuichi

• Star Woars

• kopjuter

• kinderdreirad mit 3 breiten gummireifen aus den 60er jahren

Sonntag, 14. April 13

Page 34: SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR Performance • FAST Search Server tightly integrated • Configuration & metadata

Questions & Answers

Sonntag, 14. April 13