presented to the xml community of practice, architecture and infrastructure committee,
DESCRIPTION
FOR DISCUSSION PURPOSES ONLY. Multifaceted Information Search, Discovery, and Retrieval for the ET.gov Site and Process. Presented to the XML Community of Practice, Architecture and Infrastructure Committee, U.S. CIO Council Washington, DC September 21, 2005. Outline. - PowerPoint PPT PresentationTRANSCRIPT
Presented to the
XML Community of Practice,Architecture and Infrastructure Committee,
U.S. CIO Council
Washington, DCSeptember 21, 2005
FOR DISCUSSION PURPOSES ONLY
Multifaceted InformationSearch, Discovery, and Retrieval for the
ET.gov Site and Process
Outline
The problem
1. Objectives of demo—and use of multifaceted search at ET.gov
2. Objectives of ET.gov—and impetus *
3. Value and expected impact of ET.gov *
The context
4. ET.gov—and its relationship with the AIC and USCIOC *
5. Eight stages of ET.gov *
6. Evolution of ET.gov—from vision to execution *
7. ET.gov today and tomorrow *
The solution—and value
8. Process used to develop the ET.gov demo
9. Defining the facets and categories for ET.gov
10. ET.gov demo and test drive
11. Conclusion 1: Value of MSDR at ET.gov—and beneficiaries
12. Conclusion 2: Next steps
* Source: Adapted from xmlCoP and ET/SC documents.
• Illustrate how data on ETs can be organized and accessed by multiple facets (axes) and hierarchical taxonomies (categories) within
• Show how end-users within and outside the Federal government can then search, discover, and retrieve information from the ET.gov site in a human-friendly way
• Highlight the social, economic, and intelligence value that different stakeholders can derive by using multifaceted search-discovery-and-retrieval (MSDR) technology as part of the ET.gov site and process.
1. Objectives of demo—and use ofmultifaceted search at ET.gov
2. Objectives of ET.gov—and impetus
• Help facilitate the discovery of various types of components which may be beneficial to the Federal government—thereby provide a formal channel by which agencies can evaluate ETs
• Produce communication between CIOs, governmental decision makers, submitters, and other integrators by taking a standards-based approach to capitalize on the benefits of XML and maximize the benefits of the Web
• Aid the discovery of new technologies by providing a standard XML schema for component description and a submission point from which components can be evaluated—i.e., via a single point of SDR.
3. Value and expected impact of ET.gov
When a government agency, technology company or another organization registers with ET.gov, it will:
• Provide the opportunity for their emerging technology to “get discovered” by the Federal community
• Foster the maturity of their technology in the Federal market
• Shorten the “time to market” for their ET in the Federal civilian and non-civilian sectors
• Separate the “wheat from the chaff,”—i.e., weed out “intergalactic technology solutions!”
3A. What we mean here is that…
“Better, faster, smarter” search will give the technology company (seller) the opportunity to “fairly” increase the exposure and visibility of its ET—
making sure that the Federal agency (searcher and buyer) looking for that
ET or similar ETs will, in fact, find what the agency is looking for, with NO
chance of not finding that ET…so long as the company registered and
input its data correctly at ET.gov.
“Better, faster, smarter” search will give the technology company (seller) the opportunity to “fairly” increase the exposure and visibility of its ET—
making sure that the Federal agency (searcher and buyer) looking for that
ET or similar ETs will, in fact, find what the agency is looking for, with NO
chance of not finding that ET…so long as the company registered and
input its data correctly at ET.gov.
4. ET.gov—and its relationship with the AIC and USCIOC
• The AIC is pursuing seven funded tasks in FY 2005, on top of other existing efforts, e.g., fostering communities of practice on XML and semantic interoperability
• These tasks will result in the greatest ROI in terms of the time and dedication put forth by AIC members and staff
• Task 6 is to “develop identification and validation processes for emerging technologies,” i.e., ET.gov
• Lead staff for this task under the ET S/C are: John McManus (NASA), Susan Turnbull (GSA), and Owen Ambur (DOI).
5. Eight stages of ET.gov
1. Identify +register
1. Identify +register
2. Subscribe+ indicate
LOI
2. Subscribe+ indicate
LOI
3. Acceptstewardship3. Accept
stewardship
4. Graduate +transition toCORE.gov
4. Graduate +transition toCORE.gov
5. Budget(President + Congress)
5. Budget(President + Congress)
6. Acquirefor use
6. Acquirefor use
7. Maintainover life
cycle
7. Maintainover life
cycle
8. Retire +replace
8. Retire +replace
ET S/C vision S/C mandate
Scope Objectives
XML schema Other tech specs
Booz design Feedback frS/C + CoPs
Vision ET.gov
6. Evolution of ET.gov: from vision to execution
2003> 2004> 2005> 2006>
BLS UsabilityLab testing Stage 1 exec.
MSDR design MSDR demo
Stage 2 exec. Stage N exec.
Instance docs XSDs
7. ET.gov today and tomorrow
ET = IT data:USA
ET = IT data:USA + global
ET = IT + XT data:USA
ET = IT + XT data:USA + global
100s of records,
users, and queries
100s of records,
users, and queries
1,000s of records,
users, and queries
1,000s of records,
users, and queries
10,000s of records,
users, and queries
10,000s of records,
users, and queries
100,000s of records,
users, and queries
100,000s of records,
users, and queries
http://et.gov/component_search.aspx
8. Process used to develop the ET.gov demo
CustomerData Store
XML DataTransformatio
nand Index
Generation
i411Search Server
XMLSearch
Request
XMLSearch
Response
XML DocumentGeneration
XML DocumentsCategoryHierarchyRecords
CustomerData StoreCustomerData Store
XML DataTransformation
and IndexGeneration
CustomerWeb Server
XML DocumentGeneration
XML DocumentGeneration
XML DocumentsCategoryHierarchyRecords
XML DocumentsCategoryHierarchyRecords
9. Defining the facets + categories for ET.gov
Existing facets Use Categories (examples)
1. Component name WHAT? A, B, C, D…Z
2. Organization name WHO? A, B, C, D…Z
3. Component type WHAT? Software, hardware, data
4. Relation to FEA: SRM WHAT? Analysis/stats, assets, BI…visualization
5. Relation to FEA: TRM WHAT? Component framework, service access…
Possible future facets Use Categories (examples)
6. Component lifecycle stg. WHAT? ET Stage 1, ET Stage 2…ET Stage 8
7. Component cost HOW MUCH? <$10,000, $10-$50,000…>$1 million
8. Organization location WHERE? Country, state, city, zip code
9. Organization industry WHO? NAICS, SIC
10. Agency name WHO? A, B, C, D…Z
11. USCIOC level of interest HOW MUCH? Low, medium, high
11. Conclusion 1: Value of MSDR at ET.gov— and beneficiaries
i411 “value pillars”
Beneficiaries/stakeholders
ET.govcontent
USCIOC as a
content owner
ET.govorgzs.:
comm, gov
ET.govend-
users
1. Search and discovery + + + +
2. Virtual aggregation + + + +
3. Virtual syndication + +
4. Content exposure (traffic) + + +
5. Content visibility (depth) + + +
6. Content access (security) + +
7. RT cstmr. responsiveness + +
8. Business agility +
9. Cost efficiencies + savings $ $
10. Revenue generation $ $
12. Conclusion 2: Next steps
1. Integrate feedback on the demo from the xmlCoP and ET S/C2. "Beef up" the ET.gov data and instance documents3. Refine the XML schema4. Publish a basic XSL style sheet that can be used to render the
XML instance documents5. Make the data set crawlable, i.e., expose the database of
instance documents in a simple HTML/XML interface that can be crawled by an automated Web spider
6. Integrate stemming/synonyms7. Subscribe via RSS to a search or a particular component8. Add, delete or refine the facets and categories9. Refine the UI and general "look and feel“ of the ET.gov site.
Thank you…and contacts
Amin Hassam [email protected] President, Government Strategy + Solutionsi411, Inc. www.i411.com Herndon, Virginia703.793.3270 x140
Paul Woods [email protected] President & CEOBusiness Technology Source LLC www.biztechsource.com Shepherdstown, West Virginia304.876.9242
Owen Ambur [email protected], xmlCoP http://xml.gov/Project Manager, ET.gov http://et.gov/Program Manager, DEARChief XML Strategist, U.S. DOIU.S. Department of the Interior202.208.5439