strategies llc taxonomy october 4, 2006copyright 2006 taxonomy strategies llc. all rights reserved....
TRANSCRIPT
Strategies LLCTaxonomy
October 4, 2006 Copyright 2006 Taxonomy Strategies LLC. All rights reserved.
Making the Business Case for Metadata:
U.S. Environmental Protection Agency
Case Study
Joseph A Busch, Principal
2Taxonomy Strategies LLC The business of organized information
Who I am: Joseph Busch
Over 25 years in the business of organized information. Founder, Taxonomy Strategies LLC Director, Solutions Architecture, Interwoven VP, Infoware, Metacode Technologies
– (acquired by Interwoven, November 2000)
Program Manager, Getty Foundation Manager, Pricewaterhouse
Metadata and taxonomies community leadership. President, American Society for Information Science & Technology Director, Dublin Core Metadata Initiative Adviser, National Research Council Computer Science and
Telecommunications Board Reviewer, National Science Foundation Division of Information and
Intelligent Systems Founder, Networked Knowledge Organization Systems/Services
3Taxonomy Strategies LLC The business of organized information
Recent & current projects
4Taxonomy Strategies LLC The business of organized information
Agenda
Metadata themes & issues Metadata costs & benefits
5Taxonomy Strategies LLC The business of organized information
Overall enterprise metadata goals
Provide a single methodology for categorizing information across EPA offices, programs, and regions.
Reduce the time it takes to successfully target and find cross-Program/Region information
Enable and enforce content linking across the agency Build common agency-wide terminology resources
Eliminate multiple, ambiguous taxonomies Eliminate multiple glossaries, abbreviations and acronyms
Group things differently depending on the context e.g., ground water with drinking water, or ground water with water
quality Get the right content to the right people in the right format
at the right time.
6Taxonomy Strategies LLC The business of organized information
Epa.gov site search
Put almost everything in the search index (PDF libraries, EIMS and other repositories, etc.)
Search across Programs & Regions on broad reaching topics.
Enable automatic synonym search (query expansion) Avoid retrieving too much stuff. Provide guidance about how to choose the best
keywords. Provide a single EPA-wide glossary, abbreviations and
acronyms resource. Find related information on-the-fly (more like this) Identify major new resources (new topics, new major
sites, documents of interest, what’s important today)
7Taxonomy Strategies LLC The business of organized information
Link information to & from Programs
Enforce linking to major national information. Identify & recommend related content when pages are
submitted to CMS (content management system). Provide reliable & efficient process to efficiently assemble
super topic sites, e.g., on mercury. Define process to assemble topical sites quickly, e.g., in
response to Hurricane Katrina. Relate things to a certain application that supports a
service being provided to certain end users. Scope linking beyond EPA including GPO, Dept of
Energy, & some commercial sites.
* GPO – Government Printing Office
8Taxonomy Strategies LLC The business of organized information
Issues in using information from other Programs
Can’t reliably search information from other programs. Terminology often means different things in different
contexts, e.g., “risk assessment.” Information from other programs may be contradictory,
e.g., research information may conflict with regulations. Links send people to another site. Different look & feel. Links are not necessarily reciprocal.
9Taxonomy Strategies LLC The business of organized information
Metadata
Require metadata. Make it easy to add & change metadata.
Incorporate metadata into content workflow. Pre-populate metadata.
Provide incentives for quality user-contributed metadata Provide almost instantaneous feedback using that metadata.
Make it easy to change website organization without having to re-index or re-code pages.
10Taxonomy Strategies LLC The business of organized information
Agenda
Metadata themes & issues Metadata costs & benefits
11Taxonomy Strategies LLC The business of organized information
Success measures: Usage metrics
Reduce FOIA requests/costs. Expand use to include different types of people (new audiences) Improve customer satisfaction survey results
Score higher on American Customer Satisfaction Index (ACSI) government-wide survey
Improve OMB Performance and Accountability Reports (PARS) Show cause and effect especially between regulation & measured
outcome, e.g, arsenic removed from water and health. Provide more visibility for research pages.
Reduce cost per unique user (UU) Increase Web statistics (page hits)
Increase number of successful website searches. Others?
* FOIA – Freedom of Information Act.
** OMB – Office of Management & Budget.
12Taxonomy Strategies LLC The business of organized information
Success measures: IA Metrics
Increased number of links. Increased number of internal cross-cutting links. Reduced time to build super topic website. Increased number of metadata fields filled-in. Others?
13Taxonomy Strategies LLC The business of organized information
Benefit: Reduce FOIA* Requests
Number of FOIA requests 5,000
Average cost per FOIA response $ 500
FOIA response cost per year $2,500,000
Percentage reduction of FOIA requests per year 5%
FOIA cost savings per year $ 125,000
Potential benefit
* FOIA – Freedom of Information Act.
14Taxonomy Strategies LLC The business of organized information
Benefit: Reduce Other Information Requests
Number of information requests 50,000
Average cost per response $ 30
Info response cost per year $1,500,000
Percentage reduction of info requests per year 50%
Info requests cost savings per year $ 750,000
Potential benefit
15Taxonomy Strategies LLC The business of organized information
Benefit: Reduce cost per unique user (UU) 2005 2006 Total IT/Data management budget* $13,020,000 $12,657,400
Percentage epa.gov 1% 1%
Estimated epa.gov budget $ 1,320,000 $ 1,265,740
Epa.gov unique users per year 10,000,000 10,500,000
Cost per user $ .13 $ .12
Percentage increase in users per year 5%
Potential benefit
* Summary of EPA 2006 Budget.
16Taxonomy Strategies LLC The business of organized information
Benefit: Expand to new audiences %
Identified target audiences Industry Advocacy General Public
33%
33%
33%
Online survey self-identified audiences Business & Industry Advocacy General Public Competitors/Collaborators State & local governments Tribal councils Academic K-12
15%
10%
20%
10%
15%
5%
10%
5%
Potential benefit
17Taxonomy Strategies LLC The business of organized information
Benefit: Improve customer satisfaction
ACSI* score
Average clicks per UU session 10
Reduce clicks by 10%
Potential benefit
* ACSI – American Customer Satisfaction Index.
18Taxonomy Strategies LLC The business of organized information
Benefit:Improve OMB* Performance and Accountability Reports (PARS) 2006 2007
Research scores amber green
No links to/from research pages 1,000 2,000
% increase in links to/from Research pages 100%
Potential benefit
* OMB – Office of Management & Budget.
19Taxonomy Strategies LLC The business of organized information
Benefit:Increase number of successful website searches 2006 2007
No of searches with 0 hits 100,000 20,000
% decrease in searches with 0 hits 500%
Potential benefit
20Taxonomy Strategies LLC The business of organized information
Benefit:Increase number of links (internal cross-cutting links) 2006 2007
No. of links to/from epa.gov pages 200,000 400,000
% increase in links to/from epa.gov pages 100%
Potential benefit
21Taxonomy Strategies LLC The business of organized information
Benefit:Reduce time to build topic websites
Time to build Hurricane Katrina site 1 month
Cost to build Hurricane Katrina site $ 250,000
Time to build Mercury site 1 year
Cost to build Mercury site $ 750,000
Potential benefit
22Taxonomy Strategies LLC The business of organized information
Benefit:Increased metadata consistency & quality 2006 2007
Total epa.gov webpages 500,000 500,000
No of webpages with metatags 50,000 100,000
Percentage pages with metatags 10% 20%
Potential benefit
23Taxonomy Strategies LLC The business of organized information
Benefit: Decrease time to regulation
Cost per regulation $ 150,000
Percentage decrease 10%
Regulations per year 100
Total savings $ 1,500,000
Potential benefit
24Taxonomy Strategies LLC The business of organized information
Benefit: Improve e-government (one-stop-shop)
Current avg cost to obtain permit $ 1,500
Total number of permits 100,000
Percentage decrease 5%
Total savings $ 7,500,000
Potential benefit
25Taxonomy Strategies LLC The business of organized information
Benefit: Decrease grant cycle
Get information out faster
Improve quality of grant proposals
More effective use of grant money
Potential benefit
26Taxonomy Strategies LLC The business of organized information
Other potential beneits?
27Taxonomy Strategies LLC The business of organized information
Estimated tagging costs
Taxonomy Facet Hier?TypicalCV Size
Time/ Value (min)*
Avg # values /
Item $ / MinCost/
Element
Audiences N 10 .25 2 $ 0.42 $ 0 .21
Substances N 20,000 .50 3 $ 0.42 $ 0.63
Functions Y 500 .50 1 $ 0.42 $ 0.21
Industries Y 2,500 .50 2 $ 0.42 $ 0.42
Information Types N 20 .25 1 $ 0.42 $ 0.11
Locations Y 5,000 .50 2 $ 0.42 $ 0.42
Organizations Y 50 .50 2 $ 0.42 $ 0.42
Topics Y 100 .50 2 $ 0.42 $ 0.42
TOTALS 28,180 3.50 15 $ 2.84
* Assumption: All values are automatically populated using business rules, clues, or training sets. So the time per value is editorial quality checking, not indexing from scratch.
28Taxonomy Strategies LLC The business of organized information
Cost assumptions
ASSUMPTIONS
Enterprise SW License* $ 250,000
Annual Maintenance/Support 15%
SW Implementation 100%
Annual Application Tech Support $ 50,000
Legacy Content Items 100,000
Content Growth Rate 15%
Tagging/Item $ 2.84
Enterprise Taxonomy $ 100,000
* Assumption: Includes one-time Documentum, Northern Lights, and other SW license costs
29Taxonomy Strategies LLC The business of organized information
Total cost of ownership (TCO)
Description Year 1 Year 2 Year 3 Year 4 Year 5
SW
Licenses $ 250,000
Maintenance $ 37,500 $ 37,500 $ 37,500 $ 37,500
Implementation $ 250,000
Appl Support $ 37,500 $ 37,500 $ 37,500 $ 37,500
Tagging
Legacy Content $ 284,000
Ongoing $ 42,600 $ 42,600 $ 42,600 $ 42,600
Taxonomy
Creation $ 100,000
Maintenance $ 15,000 $ 15,000 $ 15,000 $ 15,000
TOTAL $ 884,000 $ 132,600 $ 132,600 $ 132,600 $ 132,600
30Taxonomy Strategies LLC The business of organized information
Benefits
Service efficiency increase
Reduce FOIA Requests 10% $ 250,000
Reduce Other Information Requests 50% $ 1,250,000
Reduce cost per UU (unique user) 8% $ 108,333
Reduce time/cost to build super topic website 50% $ 250,000
Increase no. of webpages with metatags 100% $ 142,000
Decrease time/cost-to-regulation cycle 10% $ 1,500,000
Decrease time/cost to obtain permit 5% $ 7,500,000
TOTAL $ 11,000,333
31Taxonomy Strategies LLC The business of organized information
ROI Calculations
Description Year 1 Year 2 Year 3 Year 4 Year 5
Costs
SW Licenses/Maintenance $ 250,000 $ 37,500 $ 37,500 $ 37,500 $ 37,500
Implementation/Support $ 250,000 $ 37,500 $ 37,500 $ 37,500 $ 37,500
Build/Maintain Taxonomy $ 100,000 $ 15,000 $ 15,000 $ 15,000 $ 15,000
Legacy/Ongoing Tagging $ 284,000 $ 42,600 $ 42,600 $ 42,600 $ 42,600
Benefits*
Reduce info requests $ 1,500,000 $ 1,500,000 $ 1,500,000 $ 1,500,000
Improve website efficiency $ 500,333 $ 500,333 $ 500,333 $ 500,333
Service efficiency gains $ 9,000,000 $ 9,000,000 $ 9,000,000 $ 9,000,000
Yearly Net Benefits $ (884,000) $ 10,867,733 $ 10,867,733 $ 10,867,733 $ 10,867,733
Payback period 1.9 Years until Benefits = Costs
Strategies LLCTaxonomy
October 4, 2006 Copyright 2006 Taxonomy Strategies LLC. All rights reserved.
Questions?
Joseph A. Busch
415-377-7912,
For more information about the EPA Taxonomy Project please see:
http://taxonomy.jkshirey.us/doku.php
Logon: EPA-Taxonomy
Passcode: taxonomy