leveraging your taxonomy to increase user productivity maiquery and tm navtree
TRANSCRIPT
2
Taxonomies aid site organization
Taxonomy provides: Framework for content
organization Hierarchical outline of your
content by subject categories
Basis for effective browsing
3
Integrated taxonomy enhances findability
Browsable categories of a directory
Smart search for term equivalents Taxonomy terms (original or
modified) as labels Navigation aids incorporate
taxonomy terms and relationships
4
Example Search: body growth
Complete database (60,000 + titles) Free text search
8 hits — some irrelevant Free text search on titles
6 hits — limited recall Search by taxonomy descriptor (AKA
subject term or category) 470 hits
100% relevant 100% recall
5
Increasing User Productivity Items in an information
collection can be retrieved with better precision (relevance) and better recall by using a controlled vocabulary to assign subject terms (key words) to them How do you connect your
users to the controlled vocabulary?
6
Connecting Users
1. Use the rulebase you’ve developed for machine aided indexing (MAIQuery)
2. Use the controlled vocabulary itself (TM Navtree)
7
MAI’s talents MAI (Machine Aided Indexer)
helps authors and editors assign effective subject terms
automates the assignment of subject terms to items in legacy collections
8
M.A.I. suggests the correct terms from the taxonomy as descriptors
M.A.I. rulebase recognizes term equivalents germs Microorganisms vaccin* Pharmaceutical drugs
Recognizing term equivalents enables enhanced search
Taxonomy terms on documents help sort and organize the content
9
MAI’s “hidden talents” MAI can also:
Provide for the appropriate preferred term when given a word or phrase
Return preferred terms for uses of the word in different contexts
10
More “hidden talents” MAIQuery can:
Show related terms from the thesaurus to broaden a search
Show the rules and preferred term’s scope notes to clarify how the preferred term relates to others in the thesaurus
11
Presenting: MAIQuery™ Web page presents a search
box that will use the MAI rulebaseCan be in addition to full text
search and advanced search User enters a word or phrase
in the search box MAI searches the rulebase for
any occurrences of the word(s)
13
the MAIQuery demo Uses web pages and php
coding:Passes the search words to
“dosearch.php”dosearch.php passes the term
to MAI’s concept extractorMAI returns a list of suggested
terms from the controlled vocabulary
14
Suggested terms
The term Music is suggested by the rule for music*(1)Click on the first (the preferred term) to see the term record; click on the second to see the MAI rule
The term Instrumental Music is suggested by the rule for music*(1)Click on the first (the preferred term) to see the term record; click on the second to see the MAI rule
15
Options Thesaurus Master can be
queried to show the term recordBroader termNarrower termsUse For terms (“synonyms”)Related termsScope notes
19
Options, continued Your database/index of items is
then queried to bring back the records in your collection that are indexed with the preferred termFor our demo, we wrote an
xquery request into the gettitles.php file
Our 1100-title demo records are maintained by a MarkLogic server
21
Choose the item Your user clicks on the
item(s) appropriate to their queryThe document details (or the
item itself) is returned
23
How’s it working?
What words and phrases do your users search for?
a search log can record “misses”
a user focus group can suggest additions
subject matter experts can help in their area of expertise
24
Fine tuningModify your taxonomy to
respond to more words add common misspellings to
rules add alternate words as Use
For terms (synonyms) in the thesaurus (or as additions to the rules)
consider terms for addition to the thesaurus (candidates)
25
The advantages MAIQuery connects your user
with the controlled vocabulary
Your user can review term records and rulebase rules to learn more about your taxonomy
Your user becomes more productive
26
Another way to connect users
Category search used more than half the time for research
Also known as directory search, your user “drills down” from general to specific
27
Value of Category search Searchers find info 50%
faster using browsable categories than using list returned from free text searchResults even stronger when
results not in top 20 returns Searchers prefer browsable
category searchChen, H., and Dumais, S.
33
Thesauri OnLine Australian Governments' Interactive
Functions Thesaurus – AGIFThttp://www.naa.gov.au/recordkeeping/thesaurus/000482.htm
Transportation Research Thesaurus – TRT http://ntl.bts.gov/trt/trt_topterms.jsp
NBII (National Biological Information Infrastructure)http://thesaurus.nbii.gov/SearchNBIIThesaurus/about.faces
34
Presenting: TM Navtree Your thesaurus presented as
a navigation aid User “drill down” with all the
neighboring terms visible Each term indicates the
number of documents indexed with it
Terms are hyperlinks to a list of items
39
How it’s done We used PHP Levels, an open
source application from SourceForge to create the tree
An exported XML version of the thesaurus is parsed to produce the required text file to populate the tree
The content manager is queried for the document totals
40
How it’s done, continued When a term is selected, it is
passed to a gettitles.php A bit of php code connects to
the content manager and returns a string of data about each title
The web page displays the data in the format desired
41
The advantages TM Navtree Top Terms
describe the organization of your collection(s)
Narrower terms help your user hone in on the most appropriate term
Adjacent terms impart connotation
42
The advantages ALL the records indexed with
the chosen term are returned Your user finds what’s
needed more quickly and is more productive
43
Questions? Comments?
Try out the demo atwww.mediasleuth.com
See more details:Data Harmony Programmer
Interface for Web ApplicationsThank you.Mary Garcia