better search engine testing for solr

25
BETTER SEARCH ENGINE TESTING FOR SOLR CVILLE MEETUP | [email protected] | @DEP4B 1

Upload: opensource-connections

Post on 27-Jan-2015

113 views

Category:

Technology


0 download

DESCRIPTION

A refactoring of my earlier presentation targeted more towards Solr, and less on testing. I'm basing my presentation for Lucene EuroCon on this. Would love some more feedback!

TRANSCRIPT

Page 1: Better search engine testing for solr

BETTER SEARCH ENGINE TESTING

FOR SOLR

CVILLE MEETUP | [email protected] | @DEP4B

1

Page 2: Better search engine testing for solr

WHY AM I QUALIFIED TO BE UP HERE?

• Principal of OpenSource Connections - Solr Search Consultancy

•Member of Apache Software Foundation

• Presenter at conferences (OSCON, ApacheCON, UberConf, ExpoQA, STPcon 2009/2011!)

• Fascinated by the art of software development

2

Page 3: Better search engine testing for solr

AUTHOR

3.X Coming

Soon

3

Page 4: Better search engine testing for solr

FATHER

4

Page 5: Better search engine testing for solr

AGENDA

Why Test Search?

Two General Approaches

Test Tools for Solr

Wrap Up

5

Page 6: Better search engine testing for solr

REDUCE HUMAN ERROR

6

Page 7: Better search engine testing for solr

photo credit to Jay Grieves @ www.bigswingingdeveloper.com

7

Page 8: Better search engine testing for solr

TWO APPROACHES TO TEST

Humans

Automated Tests

8

Page 9: Better search engine testing for solr

HUMAN TESTING

• Better deals with ambiguity

9

Page 10: Better search engine testing for solr

RELEVANCY IS LIKE PORNOGRAPHY....

10

Page 11: Better search engine testing for solr

I KNOW IT WHEN I SEE IT!

http://en.wikipedia.org/wiki/Les_Amants

11

Page 12: Better search engine testing for solr

AUTOMATED TESTING12

Page 13: Better search engine testing for solr

SWEET SPOT

• Repeat Testing for Relevancy Tuning

• Performance Testing: hitting SLA?

• Regression Testing (Typical use case for automated testing)

13

Page 14: Better search engine testing for solr

“ONE TRUE ANSWER”

• Absolute Truth / Matrix / Grid / TREC / Relevancy Assertions

• The correct answers for each search are known ahead of time

• Humans judges often decide these correct answers, stored as Relevancy Assertions

• Can be labor intensive to setup

• A “Numerical Grade” is produced for comparision

14

Page 15: Better search engine testing for solr

SO WHAT IS A JUDGEMENT LIST?

15

Page 16: Better search engine testing for solr

16

Page 17: Better search engine testing for solr

COMPARABLE RESULTSOriginal AjaxSolr WebSolr

gata 4HIF 1 alpha

HIF-1 alpha

HIF1 alpha

HIF1AiNOSIRE-1 alpha

T7 TagCollagen I

Collagen 1

Collagen-1

40.0 80.0 80.090.0 100.0 30

100.0 100.0 30

70.0 100.0 100.0

100.0 100.0 100.060.0 60.0 60.030 90.0 90.0

0.0 100.0 90.050.0 90.0 90.0

40.0 90.0 40.0

10.0 50.0 40.0

17

Page 18: Better search engine testing for solr

WRAP UP

18

Page 19: Better search engine testing for solr

User  Interface

Search  Engine

Informa3on

Search  Stack

19

Page 20: Better search engine testing for solr

20

Page 21: Better search engine testing for solr

21

Page 22: Better search engine testing for solr

A HYBRID APPROACH?

Humans

Automated Tests

22

Page 23: Better search engine testing for solr

A HYBRID APPROACH?

Humans

Automated Tests

Tool SupportedTesting

Judgements23

Page 24: Better search engine testing for solr

James Bach,satisfice.com

TESTING IS A HUMAN ACTIVITY

24

Page 25: Better search engine testing for solr

25

• twitter: dep4b

• speakerrate: http://www.speakerrate.com/epugh/

• email: [email protected]

THANK YOU!

25