best practices for search
DESCRIPTION
Best Practices for Search. for the Federal Government Marti Hearst Web Manager University November 10, 2009. The Importance of Search for Govt. OMB memorandum, Dec 2005: “When disseminating information to the public-at-large, publish your information directly to the internet.” - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/1.jpg)
Best Practices for Search
for the Federal Government
Marti HearstWeb Manager University
November 10, 2009
![Page 2: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/2.jpg)
The Importance of Search for Govt
• OMB memorandum, Dec 2005:“When disseminating information to the public-at-large, publish your information directly to the internet.”
• Pres. Obama’s memorandum, Jan 21, 2009:“Information maintained by the Federal Government is a national asset. My Administration will take appropriate action, consistent with law and policy, to disclose information rapidly in forms that the public can readily find and use. ”
![Page 3: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/3.jpg)
A bit about me• Professor at the School of Information
at University of California, Berkeley. Teach masters students User Interface Design, Search
Engines, Computational Linguistics, Visualization.
• Search User Interfaces• Visiting government for 1 year
Updating usasearch.gov Looking at site search alternatives. Generally kibbitzing
![Page 4: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/4.jpg)
Two Focus Areas
• Web search engines The quality and form of your content How your results are viewed in
search engine listings How your site is crawled
• Site search The search interface What is crawled How results are presented
![Page 5: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/5.jpg)
Outline
• Designing your site for effective search
• Site search interfaces• Special considerations for web
search engines• An example of what not to do.
![Page 6: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/6.jpg)
• both content and tech people• have to be focused on it together
• mention why using (free) book example• add an exercise towards the start
• top 3 things to do right away• don’t force h1n1, be sure swine flu too• seo integrated into process
![Page 7: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/7.jpg)
Use Proven Interface Techniques
Use modern search UI ideas that are known to have good usability.
Apply the principle: recognition over recall.• Related query suggestions• Auto-suggest as the user types• Use faceted navigation where appropriate.
![Page 8: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/8.jpg)
Search-as-you-Type (SAYT)
• As the user types, shows other peoples’ queries with the same word stems.
• Helps people think of additional words (recognition over recall)
• Proven to improve search results.
![Page 9: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/9.jpg)
Evidence-based Decision Making
DATA TRUMPS INTUITIONS (Kohavi)
![Page 10: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/10.jpg)
Use Evidence-based Decision Making
User behavior determines if an idea is retained.
A/B testing is a standard way to do this.1) Make small changes to an interface.2) Show the changed interface to a
significant sample of the user population, show everyone else the original version.
3) Do this over time (~ two weeks) and for (tens of) thousands of users.
4) Compare what the two groups do over time.
5) Based on this, decide whether to keep or reject the feature.
![Page 11: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/11.jpg)
Evidence-based Decision Making
• Example: Dan Siroker on Obama for
America’s website and video design decisions
Easy to measure the outcome: it is in money donated.
http://www.siroker.com/archives/2009/05/14/obama_lessons_learned_talk_at_google.html
![Page 12: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/12.jpg)
Vote: Which Button is Best?
count down counter
![Page 13: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/13.jpg)
Which Button is Best?
![Page 14: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/14.jpg)
Which Button is Best?
![Page 15: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/15.jpg)
Ease of Use: Summary
USE PROVEN UI TECHNIQUES
REDUCE EXTRA STEPS
USE CLEAR LANGUAGE
MAKE EVIDENCE-BASED UI DECISIONS
![Page 16: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/16.jpg)
How Web Search Engines Work
![Page 17: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/17.jpg)
How Search Engines Work
i. Gather the contents of all web pages (crawling)
ii. Organize the contents of the pages in a way that allows efficient retrieval (indexing)
iii. Take in a query, determine which pages match, and show the results (ranking and display of results)
Three main parts:
![Page 18: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/18.jpg)
Standard Web Search Engine Architecture
userquery
![Page 19: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/19.jpg)
Standard Web Search Engine Architecture
crawl theweb
Create an inverted
index
Check for duplicates,store the
documents
Inverted index
Search engine servers
DocIdsCrawlermachines
![Page 20: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/20.jpg)
Standard Web Search Engine Architecture
crawl theweb
Create an inverted
index
Check for duplicates,store the
documents
Inverted index
Search engine servers
userquery
Show results To user
DocIdsCrawlermachines
![Page 21: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/21.jpg)
i. Spiders or crawlers
• How to find web pages to visit and copy? Can start with a list of domain names,
visit the home pages there. Look at the hyperlink on the home
page, and follow those links to more pages.
Keep a list of urls visited, and those still to be visited.
Each time the program loads in a new HTML page, add the links in that page to the list to be crawled.
![Page 22: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/22.jpg)
Spider behaviour varies
• Parts of a web page that are indexed
• How deeply a site is indexed • Types of files indexed• How frequently the site is
spidered
![Page 23: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/23.jpg)
Four Laws of Crawling
• A Crawler must show identification
• A Crawler must obey the robots exclusion standardhttp://www.robotstxt.org/wc/norobots.html
• A Crawler must not hog resources
• A Crawler must report errors
![Page 24: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/24.jpg)
Lots of tricky aspects• Servers are often down or slow• Hyperlinks can get the crawler into
cycles• Some websites have junk in the web
pages• Now many pages have dynamic
content• The web is HUGE
![Page 25: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/25.jpg)
The Internet Is Enormous
Image from http://www.nature.com/nature/webmatters/tomog/tomfigs/fig1.html
![Page 26: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/26.jpg)
“Freshness”
• Need to keep checking pages Pages change
• At different frequencies
• Pages are removed
Many search engines cache the pages (store a copy on their own servers)
![Page 27: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/27.jpg)
What really gets crawled?
• A small fraction of the Web that search engines know about; no search engine is exhaustive
• Not the “live” Web, but the search engine’s index
• Not the “Deep Web”• Mostly HTML pages but other file
types too: PDF, Word, PPT, etc.
![Page 28: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/28.jpg)
ii. Index (the database)
Record information about each page• List of words
In the title? How far down in the page? Was the word in boldface?
• URLs of pages pointing to this one• Anchor text on pages pointing to
this one
![Page 29: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/29.jpg)
Inverted Index• How to store the words for fast lookup• Basic steps:
Make a “dictionary” of all the words in all of the web pages
For each word, list all the documents it occurs in.
Often omit very common words• “stop words”
Sometimes stem the words • (also called morphological analysis)• cats -> cat• running -> run
![Page 30: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/30.jpg)
Inverted Index Example
Image from http://developer.apple.com/documentation/UserExperience/Conceptual/SearchKitConcepts/searchKit_basics/chapter_2_section_2.html
![Page 31: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/31.jpg)
Inverted Index
• In reality, this index is HUGE• Need to store the contents
across many machines• Need to do optimization tricks to
make lookup fast.
![Page 32: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/32.jpg)
Query Serving Architecture
• Index divided into segments each served by a node
• Each row of nodes replicated for query load
• Query integrator distributes query and merges results
• Front end creates a HTML page with the query results
Load Balancer
FE1
QI1
Node1,1 Node1,2 Node1,3 Node1,N
Node2,1 Node2,2 Node2,3 Node2,N
Node4,1 Node4,2 Node4,3 Node4,N
Node3,1 Node3,2 Node3,3 Node3,N
QI2 QI8
FE2 FE8
“travel”
“travel”
“travel”
“travel”
“travel”
…
…
…………
…
…
![Page 33: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/33.jpg)
iii. Results ranking
• Search engine receives a query, then• Looks up the words in the index,
retrieves many documents, then• Rank orders the pages and extracts
“snippets” or summaries containing query words.
Most web search engines assume the user wants all of the words
• These are complex and highly guarded algorithms unique to each search engine.
![Page 34: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/34.jpg)
Some ranking criteria • For a given candidate result page, use:
Number of matching query words in the page Proximity of matching words to one another Location of terms within the page Location of terms within tags e.g. <title>, <h1>,
link text, body text Anchor text on pages pointing to this one Frequency of terms on the page and in general Link analysis of which pages point to this one (Sometimes) Click-through analysis: how often the
page is clicked on How “fresh” is the page
• Complex formulae combine these together.
![Page 35: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/35.jpg)
Measuring Importance of Linking
• PageRank Algorithm Idea: important pages are pointed
to by other important pages
Method:• Each link from one page to another is counted as
a “vote” for the destination page• But the importance of the starting page also
influences the importance of the destination page.• And those pages scores, in turn, depend on those
linking to them.
Image and explanation from http://www.economist.com/science/tq/displayStory.cfm?story_id=3172188
![Page 36: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/36.jpg)
CRAFT SITES FOR FINDABILITY
(SEO)
![Page 37: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/37.jpg)
Making Web Sites Attractive to Search
Engines• Called “Search Engine
Optimization” (SEO)• There is a LOT of information
about this on the web Most is about how to improve your
site Some is about “cheating”; avoid this
• There are many tools to help you too.
![Page 38: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/38.jpg)
The Most Important Principle:
Good, unique content trumps everything else.
![Page 39: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/39.jpg)
Content is Key
• Web sites that are primarily high-quality, unique content will be ranked highly. Not just links to other content Not re-packaging of other content
• Example: My online book was top ranked for “search
user interfaces” within one day of site launch.
It is also top ranked for many related queries.
![Page 40: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/40.jpg)
![Page 41: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/41.jpg)
Web Site Characteristics
• These can lead to high search engine ranking (but no guarantees): High-quality, unique content. Linked to by high-quality sites. Been around a long time with
consistent content.
![Page 42: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/42.jpg)
Keyword Placement
• Search engines place “weight” on words according to where they are used
• Place important words in Title tags Headings (H1 is key) and emphasized
text Visible body text Description metadata – often used in
search results snippets. Alt text in images
![Page 43: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/43.jpg)
Keyword Variation
• Describe the same concepts using different words within the relevant pages. Compare “search interfaces” with “search user
interfaces” in the next slide. 1 hit versus 4 hits in the top 6 I need to use more variation for the key concepts
• But it must make sense in your page; Don’t hide dictionaries of words ! Can include them in the description
metadata.
![Page 44: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/44.jpg)
![Page 45: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/45.jpg)
• put in aw-stat logs
![Page 46: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/46.jpg)
The Importance of URLs
• Meaningful, short urls improve search engine ranking and usability
• Urls that consist of computer-generated database queries can hurt rankings.
• Urls with lots of redirects also hurt.
![Page 47: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/47.jpg)
The Importance of Titles
• The title tag determines what words show up in the search results title. Make them descriptive of the site Vary them to differentiate them.
• Example (next page) Consistently varies the title to show
how they differ. But makes a mistake in the metadata
description by putting the part that varies too far from the start, so it all looks the same.
![Page 48: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/48.jpg)
![Page 49: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/49.jpg)
Robots Exclusion
• It is important to check your robots.txt files to be sure they are allowing crawling.
• If your server can’t handle a lot of traffic, use the site map file to slow crawlers down.
![Page 50: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/50.jpg)
Site Maps
• There are two kinds of site maps:1. A navigation structure visible to
users2. An XML file visible only to
search engines• The latter is important to help ensure the
pages on your site are crawled.• You can also specify the frequency with which
you hope the pages will be crawled.• There are free tools to help you do this.
![Page 51: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/51.jpg)
![Page 52: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/52.jpg)
![Page 53: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/53.jpg)
![Page 54: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/54.jpg)
Examples of What Not To Do
For both site design and SEO.
Or … don’t mess with my dog!
![Page 55: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/55.jpg)
Photo of Emmi
![Page 56: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/56.jpg)
What happens when you search for recalls
?
![Page 57: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/57.jpg)
![Page 58: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/58.jpg)
What happens when you type
http://recalls.gov
?
![Page 59: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/59.jpg)
![Page 60: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/60.jpg)
The lesson: make your url (web address) easy
to find There should at least be a
redirect from recalls.gov to www.recalls.gov
Also, the url should match its description in the site title
field!
![Page 61: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/61.jpg)
![Page 62: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/62.jpg)
Where is search on this site?
![Page 63: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/63.jpg)
![Page 64: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/64.jpg)
The point: the search entry form should be highly visible and in a
standard position.
Usually wide and centered towards the top or else
shorter and on the upper right.
![Page 65: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/65.jpg)
Where do I search for that recent dog food
recall?
![Page 66: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/66.jpg)
![Page 67: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/67.jpg)
The point: do not make the user guess how your information is
structured.There should be one search engine for all government recall
information.
![Page 68: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/68.jpg)
The point: do not require users to fill out
structured search forms.
This can be an option but should not be required.
Showing categories with previews of how many hits are associate with each is better than lots of
entry forms.
![Page 69: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/69.jpg)
What happens after I search?
![Page 70: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/70.jpg)
![Page 71: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/71.jpg)
The point: use standard layout
(unless there is a good reason not to)
This site puts too much text at the top before showing search
results.
Also, searchers frequently modify their query
It is standard to show the search form with the previous query at
the top.
![Page 72: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/72.jpg)
![Page 73: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/73.jpg)
The point: do promote commonly requested
information to the top of the results.
This site uses “best bets” to promote popular
content to the top; the user finds what they want.
![Page 74: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/74.jpg)
What happens if I search for recalls at
searchusa.gov?
![Page 75: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/75.jpg)
![Page 76: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/76.jpg)
The point: use descriptive titles.
It is important to put the distinguishing information first so the repeated part does not
dominate. For example:
Home page: Recalls.govRecent Recalls
Food Safety RecallsAutomotive Recalls
![Page 77: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/77.jpg)
What happens if I search for car recalls at major search engines?
![Page 78: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/78.jpg)
![Page 79: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/79.jpg)
![Page 80: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/80.jpg)
What happens if I search for car recalls at major search engines?
Answer: I don’t see recalls.gov
![Page 81: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/81.jpg)
What is on the page for car recalls?
![Page 82: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/82.jpg)
![Page 83: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/83.jpg)
The point: use words that your users use.
Notice that the main page for cars at recalls.gov does not appear
towards the top. The word “car” does not play an important role on
relevant page.
![Page 84: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/84.jpg)
Tools for Improving Web Sites for Search Engines
![Page 85: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/85.jpg)
![Page 86: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/86.jpg)
![Page 87: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/87.jpg)
![Page 88: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/88.jpg)
![Page 89: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/89.jpg)
Search Engine Information
• SEO http://www.ninebyblue.com/
• Keep current with industry http://www.searchengineland.com http://battellemedia.com
• Search Interface Principles http://searchuserinterfaces.com
• Search Design Patterns (Peter Morville) http://
www.flickr.com/photos/morville/collections/72157603785835882/
![Page 90: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/90.jpg)
Faceted Navigation
For Structured Web Site Search
![Page 91: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/91.jpg)
The Idea of Facets• Facets are a way of labeling data
A kind of Metadata (data about data) Can be thought of as properties of
items• Facets vs. Categories
Items are placed INTO a category system
Multiple facet labels are ASSIGNED TO items
![Page 92: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/92.jpg)
The Idea of Facets• Create INDEPENDENT categories
(facets) Each facet has labels (sometimes
arranged in a hierarchy)• Assign labels from the facets to
every item Example: recipe collection
Course
Main Course
CookingMethod
Stir-fry
Cuisine
Thai
Ingredient
Bell Pepper
Curry
Chicken
![Page 93: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/93.jpg)
The Idea of Facets• Break out all the important
concepts into their own facets• Sometimes the facets are
hierarchical Assign labels to items from any
level of the hierarchy
Preparation Method Fry Saute Boil Bake Broil Freeze
Desserts Cakes Cookies Dairy Ice Cream Sorbet Flan
Fruits Cherries Berries Blueberries Strawberries Bananas Pineapple
![Page 94: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/94.jpg)
Using Facets
• Now there are multiple ways to get to each item
Preparation Method Fry Saute Boil Bake Broil Freeze
Desserts Cakes Cookies Dairy Ice Cream Sherbet Flan
Fruits Cherries Berries Blueberries Strawberries Bananas Pineapple
Fruit > PineappleDessert > Cake
Preparation > Bake
Dessert > Dairy > SherbetFruit > Berries > Strawberries
Preparation > Freeze
![Page 95: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/95.jpg)
Advantages of Faceted Navigation
• Systematically integrates search results: reflect the structure of the info
architecture retain the context of previous
interactions• Gives users control and flexibility
Over order of metadata use Over when to navigate vs. when to
search
![Page 96: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/96.jpg)
Faceted Categories vs. Hierarchies
Stickers vs. Folders
vs
![Page 97: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/97.jpg)
Example: Medicare Prescription Drug Plan
Scam If you have folders, have to place the item into multiple folders:
Health
Elderly
Drugs
Fraud
Safety
![Page 98: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/98.jpg)
Alternative: assign stickers to the item:
Medicare Prescription Drug Plan Scam Assign categories to the item,
rather than put the item into categories
Health
Safety
Elderly
Drugs
Scams
Physicians
![Page 99: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/99.jpg)
Faceted Navigation
• User can start with any category, and see the results grouped by the other categories.
• Example: Start with Health
• See results grouped by subcategories of Health, such as Drugs, Nutrition
Alternatively, user can group results by other categories:
• Click on Financial, see Insurance, Payments, etc• Click on Teens, see results relevant to teens
![Page 100: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/100.jpg)
![Page 101: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/101.jpg)
![Page 102: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/102.jpg)
![Page 103: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/103.jpg)
Examples of Faceted Layouts
![Page 104: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/104.jpg)
![Page 105: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/105.jpg)
![Page 106: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/106.jpg)
Examples of Faceted Layouts
![Page 107: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/107.jpg)
Best Practices for SearchThank you!
Marti HearstWeb Manager University
November 10, 2009
![Page 108: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/108.jpg)
Ease of Use
REDUCE STEPS
![Page 109: Best Practices for Search](https://reader035.vdocument.in/reader035/viewer/2022062807/56814fff550346895dbdc83b/html5/thumbnails/109.jpg)
USE CLEAR LANGUAGE