knowledge base tuning

50
© 2008 RightNow Technologies, Inc. © 2008 RightNow Technologies, Inc. Knowledge Base Tuning September 24, 2008 Zuzana Gedeon – Research Labs

Upload: angus

Post on 03-Feb-2016

57 views

Category:

Documents


0 download

DESCRIPTION

Knowledge Base Tuning. September 24, 2008 Zuzana Gedeon – Research Labs. Overview. Key Ideas KB – what do we/you mean by KB Searching Background on Search Engine Technology KB Configuration & Tuning. K B in general terms. Knowledge base Information available to the user - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Knowledge Base Tuning

September 24, 2008Zuzana Gedeon – Research Labs

Page 2: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Overview

Key Ideas– KB – what do we/you mean by KB

Searching– Background on Search Engine Technology

KB Configuration & Tuning

Page 3: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

KB in general terms

Knowledge base – Information available to the user

Data mining – making KB available – Applications helping user to access this

information

Page 4: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Who uses the Knowledge Base?

End users

Marketers

Managers

Subject experts

CSRs

External documents

Answer database

Community Forum

Internal database

Page 5: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Types of search - architectureFilter based Direct database query – built into views engine

• product/category filtering• Date, customer email address, …

Most runtime selectable filters in Reports

Text/Index based“Google style” searchDocuments -> index– Boosting and weight calculation

KB Browse – Navigational, exploratory search

No Search !!– Get what you need without need for search

Page 6: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Mashup

Report filters with index based search

Incident searchAnswer search pages

Filter > search_thread (search_xxx) Sort by match_wt!!!

Page 7: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Types of search - architecture

Filter based Direct database query – built into views engine

• product/category filtering• Date, customer email address, …

Most runtime selectable filters in Reports

Text/Index based“Google style” searchDocuments -> index– Boosting and weight calculation

KB Browse – Navigational, exploratory search

No Search !!– Get what you need without need for search

Page 8: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

EU Knowledge sources and delivery

External documents

Answer database

Community Forum

VoiceKB Search

Syndication widget

KB Browse

Pro Services integration

Page 9: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

No Search !!

Users find what they are looking for without any search just by showing them the right stuff as soon as they access page.

Fact: A large percentage of user sessions do NOT do a search

Page 10: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

How do we do that?Good content – Well-chosen category and product organization– Good descriptive titles– Concise information (generic vs. specific)– Consistency

Administrator– Topic/Add words– User specifiable content tags to start/stop indexing for

searching– Answer as a file attachment or URL versus just Q&A pair– SmartGuide to create branching (script-like) Answers– Publish-on and review-on dates– Place on top (“fix on top” really sparingly)– Answer access level conditional sections

Users– Users ranking helpfulness - explicitly– Ants leaving pheromone trail – implicit ranking

Page 11: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Find information where they search

Sitemap: exporting KB to search engines

What are Sitemaps?Sitemaps are an easy way for webmasters to inform search engines such as

Google and Yahoo about pages on their sites that are available for crawling.

Sitemap Feature Description:Facilitates Google’s (and other search engine’s) spidering of your public

RightNow knowledgebase content.

Benefits:– Allows you to control how search engine spiders visit and consume your

knowledgebase content.– If you desire, this can help your content go to the front of the line in

Google/Yahoo web spiders.

Page 12: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Information placement

Knowledge Syndication Widgetwith Product filter

Page 13: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

How do we do that?

Good content – Well-chosen category and product

organization– Good descriptive titles– Concise information (generic vs. specific)– Consistency

Page 14: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

How do we do that?

Administrator– Topic/Add words– User specifiable content tags to start/stop

indexing for searching– Answer as a file attachment or URL versus

just Q&A pair– Publish-on and review-on dates– Answer access level conditional sections– Place on top (“fix on top” really sparingly)

Page 15: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Allows KB administrator to associate either a WWW document or KB Answer to a specific single search term The given document appears first in the list of search results Document can be set to always be shown Useful for directed information presentation, advertising,

notices, announcements, etc.

Topic Words for Search

Page 16: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Page 17: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

How do we do that?

Administrator– Topic/Add words– User specifiable content tags to start/stop

indexing for searching– Answer as a file attachment or URL versus

just Q&A pair– Publish-on and review-on dates– Answer access level conditional sections– Place on top (“fix on top” really sparingly)

Page 18: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Stop/start index

This text is being indexed <!--stopindex--> this text is not being indexed <!--startindex--> And this text is again indexed

Page 19: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

How do we do that?

Administrator– Topic/Add words– User specifiable content tags to start/stop

indexing for searching– Answer as a file attachment or URL versus

just Q&A pair– Publish-on and review-on dates– Answer access level conditional sections– Place on top (“fix on top” really sparingly)

Page 20: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Users– Users ranking helpfulness - explicitly– Ants leaving pheromone trail – implicit ranking

AI – aging of the information – agedatabase

Administrator– Promoting new answers

How do we do that?

Page 21: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

No Search !!

Users find what they are looking for without any search just by showing them the right stuff as soon as they access page.

Page 22: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Users + AI

Common-> knowledge base -> Answer search:SA_SOLVED_WEIGH_PREF – long term or short term preference

Page 23: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Smart Assistant

Page 24: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Smart Assistant

Page 25: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Relationships Between Answers

Sibling Answers section must be

enabled from workspace property

Can manually relate answers together

Page 26: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Use Smart Assistant

Set up Smart Assistant RulesTry to answer the question without admin interaction

Help in populating KB – respond to customer inquiries – propose new answers

Page 27: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Smart Assistant tuning

Limit by matching Browse topics RNT UI → Support → SA_NL_MATCH_THRESHOLD

Enables the ability to restrict SmartAssistant suggested answers to answers that have the same or closely matching locations in the browse tree. The accepted values are: 0 - do not restrict, 1 - use answers from any closely matching clusters, and 2 - use only best matching clusters. If SA_DM_FREQ is set to 0, the value of SA_NL_MATCH_THRESHOLD will be forced to 0 regardless of the value set here. Default is 1.

Page 28: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Suggested Searches

Using history of end-user searches we use a data-mining technique to establish relationships between similar search phrases

EU_SUGGESTED_SEARCHES_ENABLE – Each search phrase suggested to an end-user must pass these tests

• Each word spelled correctly• Positive SmartSense value• No words in blacklist• Be complementary to current search

SEARCH_SUGGESTIONS_DISPLAY – 0 no recommendations– 1 turn on recommended products– 2 turn on recommended categories– 4 turn on recommended Browse topics

MAX_SEARCH_SUGGESTIONS

Page 29: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Web Like SearchTraditional keyword searching on the internet or within an operating system.

User’s mental model (Google, Yahoo, MSN)

Attributes of SearchIndexes the ‘entire’ corpus of information.Almost never results in a zero matches.User Testing in Jan 08 showed that Google is expected

behavior whenever the term ‘Search’ is paced next to a text box on the web.

Page 30: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Answer Search

Page 31: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

External documents search

Web pagesAnswers

Page 32: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

What’s an Index?

The index is where all the information about what is searchable is stored

Indexes are used to speed finding search results by not requiring each document to be scanned during the search process

Most search engines (including ours) use an ‘inverted index’ which means that they map words to documents, or words to locations within documents- Similar to the index in the back of a book- Vs “find a word with your finger”

Indexes are pre-computed when documents are created/edited

Page 33: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Example of an Index

Liberty:

The condition of being free from restriction or control.

Four score and seven years ago our fathers brought forth on this continent, conceived in Liberty, and

Score:

A group of 20 items. Hence, four score is 4x20, or 80.

The North American Continent consists of the countries: the United States of America, Canada, Mexico,years

united

states

seven

score

restriction

north

mexico

liberty

LocationWord

Index

Page 34: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Stopwords and Word Stemming

Stopwords are human-language connector words that are not

generally useful in information retrieval

– a, an , the, or, on , for, …

– “To be or not to be”– RightNow Feature: multiple editable stop word lists

• Incidents • Answers

Word Stemming

– Standard natural language processing technique

– Unique stemmer for each language

– CONNECT CONNECTED CONNECTING CONNECTION CONNECTIONS

=> CONNECT

- Generalizes searches (exact matches not considered)

Page 35: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Query Processing and Result Ranking

How does a search query work?– Query is processed via word stemming and removal of

stopwords

– Aliases are added to the search terms (non stopwords, original form)

– Search terms are looked up in the index

– The total hits are gathered and sorted by document via weighting formula(s)

– The documents’ attributes (title, link, etc.) are fetched and returned to the browser

– postprocessing algorithm may be used before display

Page 36: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Answer Search

Page 37: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Word Bias Configuration

Some words are relatively more important than others based upon

location

– Words in the Subject & Keywords field are more important than words in the body of a document or the attachments

RightNow Configuration OptionsSRCH_KEY_WEIGHT 50 KeywordsSRCH_PROD_WEIGHT 50 Product WordsSRCH_CAT_WEIGHT 50 Category WordsSRCH_SUBJ_WEIGHT 45 Subject/Title WordsSRCH_DESC_WEIGHT 30 Question WordsSRCH_BODY_WEIGHT 4 Answer WordsSRCH_ATTACH_WEIGHT 4 File-Attach. Words

Set these to be the same across interfaces!

Page 38: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

AND vs. OR Query Processing

Do the search results contain ALL words in the search text or just SOME words?

– All major Internet search engines use AND

– We use OR by default with a heavy multi-word weight bias .. “AND-like ordering”

– Why do we use OR? AND does not work well for small documents sets (under 10,000 answers).

– Why does AND perform badly on small document sets? It’s too easy for a user to construct a query with no search results.

Page 39: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Result Focusing and Truncation

Dynamic Truncation Bias (Answers)– Truncate Search Results to those most scoring best

• RNT UI: SEARCH_RESULT_LIMITING – natural breaks• RNT UI: ANS_SRCH_THRESHOLD – break by weight

• RNT UI: ANS_SRCH_SUB_THRESHOLD – avoid 0 results

Concept-biased Search– Focus Search results based upon matching of query to existing

KB learned topics

• RNT UI: SEARCH_RELEVANCE_FOCUS (Answers)

• RNT UI: SA_NL_MATCH_THRESHOLD (SmartAssistant)

Page 40: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

External documents search

Web pagesAnswers

Page 41: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

External documents and tuning No much of content control– spider– uses only externally available content

Not much tuning control– Title and body weight

• SRCH_KEY_WEIGHT Meta+ products, categories• SRCH_SUBJ_WEIGHT Title• SRCH_DESC_WEIGHT Text

– HtDig with Clucene

File Attachment Size– FATTACH_MAX_SIZE Core Engine

Search Pulldowns – Kill them– ANS_SEARCH_BY_ENABLED – ANS_SORT_BY_ENABLED

Page 42: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Important Files in the File Manager

exclude_answers.txt

End-user Stopwords

exclude_incidents.txt

Incident Stopwords

aliases.txt Always-On Search Thesaurus

thesaurus.txt Thesaurus for similar search

smartsense.txt Emotional Word Ratings

blacklist.txt No-Show words for Sugg. Searches

userdic.tlx Custom Dictionary for Spellchecker

Page 43: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Wizard exclude_answers.txt

Page 44: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Aliases

Establishes a link between two words to treat them as synonyms for every search type

– FBI = Federal Bureau of Investigation

– Whiskey = Scotch

Page 45: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Analytics

Keyword Searches report– Frequent searches (important content)– Searches with no answers (missing content)– Searches with too many answers (configuration

and tuning needed)

Gap report

Page 46: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Keyword Searches Report

Page 47: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Information Gap Report

Use the Gap Report to identify ‘holes’ in the end-user KB.

Compares recent incidents to existing Answers.

Gap Report Config Options: GAP_FREQUENCY & GAP_TIME_PERIOD – default 7 days for both.

Page 48: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Information Gap Report Screenshot

Page 49: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Other Customization

 EU_BROWSER_SEARCH_PLUGIN - Enables the Answer and External Document search pages to provide an interface for web browsers to query them directly from their built-in search bars, such as those provided by Google or Yahoo!. Default is disabled (No).

 EU_SYNDICATION_ENABLE – widgets

 ANS_SORT_BY_ENABLED Enables the Sort By drop-down menu on the Find Answers page. This setting overrides any view settings. Default is disabled (No). – this is the reason to have limited results set!!!!

 SEARCH_WITH_OPERATORS Enables processing of +, - and ~ operators while searching for answers. Default is enabled (Yes).

Page 50: Knowledge Base Tuning

© 2008 RightNow Technologies, Inc.© 2008 RightNow Technologies, Inc.

Thank You

Questions?