presented to the xml community of practice, architecture and infrastructure committee,

16
Presented to the XML Community of Practice, Architecture and Infrastructure Committee, U.S. CIO Council Washington, DC September 21, 2005 FOR DISCUSSION PURPOSES ONLY Multifaceted Information Search, Discovery, and Retrieval for the ET.gov Site and Process

Upload: shalom

Post on 15-Jan-2016

20 views

Category:

Documents


2 download

DESCRIPTION

FOR DISCUSSION PURPOSES ONLY. Multifaceted Information Search, Discovery, and Retrieval for the ET.gov Site and Process. Presented to the XML Community of Practice, Architecture and Infrastructure Committee, U.S. CIO Council Washington, DC September 21, 2005. Outline. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Presented to the XML Community of Practice, Architecture and Infrastructure Committee,

Presented to the

XML Community of Practice,Architecture and Infrastructure Committee,

U.S. CIO Council

Washington, DCSeptember 21, 2005

FOR DISCUSSION PURPOSES ONLY

Multifaceted InformationSearch, Discovery, and Retrieval for the

ET.gov Site and Process

Page 2: Presented to the XML Community of Practice, Architecture and Infrastructure Committee,

Outline

The problem

1. Objectives of demo—and use of multifaceted search at ET.gov

2. Objectives of ET.gov—and impetus *

3. Value and expected impact of ET.gov *

The context

4. ET.gov—and its relationship with the AIC and USCIOC *

5. Eight stages of ET.gov *

6. Evolution of ET.gov—from vision to execution *

7. ET.gov today and tomorrow *

The solution—and value

8. Process used to develop the ET.gov demo

9. Defining the facets and categories for ET.gov

10. ET.gov demo and test drive

11. Conclusion 1: Value of MSDR at ET.gov—and beneficiaries

12. Conclusion 2: Next steps

* Source: Adapted from xmlCoP and ET/SC documents.

Page 3: Presented to the XML Community of Practice, Architecture and Infrastructure Committee,

• Illustrate how data on ETs can be organized and accessed by multiple facets (axes) and hierarchical taxonomies (categories) within

• Show how end-users within and outside the Federal government can then search, discover, and retrieve information from the ET.gov site in a human-friendly way

• Highlight the social, economic, and intelligence value that different stakeholders can derive by using multifaceted search-discovery-and-retrieval (MSDR) technology as part of the ET.gov site and process.

1. Objectives of demo—and use ofmultifaceted search at ET.gov

Page 4: Presented to the XML Community of Practice, Architecture and Infrastructure Committee,

2. Objectives of ET.gov—and impetus

• Help facilitate the discovery of various types of components which may be beneficial to the Federal government—thereby provide a formal channel by which agencies can evaluate ETs

• Produce communication between CIOs, governmental decision makers, submitters, and other integrators by taking a standards-based approach to capitalize on the benefits of XML and maximize the benefits of the Web

• Aid the discovery of new technologies by providing a standard XML schema for component description and a submission point from which components can be evaluated—i.e., via a single point of SDR.

Page 5: Presented to the XML Community of Practice, Architecture and Infrastructure Committee,

3. Value and expected impact of ET.gov

When a government agency, technology company or another organization registers with ET.gov, it will:

• Provide the opportunity for their emerging technology to “get discovered” by the Federal community

• Foster the maturity of their technology in the Federal market

• Shorten the “time to market” for their ET in the Federal civilian and non-civilian sectors

• Separate the “wheat from the chaff,”—i.e., weed out “intergalactic technology solutions!”

Page 6: Presented to the XML Community of Practice, Architecture and Infrastructure Committee,

3A. What we mean here is that…

“Better, faster, smarter” search will give the technology company (seller) the opportunity to “fairly” increase the exposure and visibility of its ET—

making sure that the Federal agency (searcher and buyer) looking for that

ET or similar ETs will, in fact, find what the agency is looking for, with NO

chance of not finding that ET…so long as the company registered and

input its data correctly at ET.gov.

“Better, faster, smarter” search will give the technology company (seller) the opportunity to “fairly” increase the exposure and visibility of its ET—

making sure that the Federal agency (searcher and buyer) looking for that

ET or similar ETs will, in fact, find what the agency is looking for, with NO

chance of not finding that ET…so long as the company registered and

input its data correctly at ET.gov.

Page 7: Presented to the XML Community of Practice, Architecture and Infrastructure Committee,

4. ET.gov—and its relationship with the AIC and USCIOC

• The AIC is pursuing seven funded tasks in FY 2005, on top of other existing efforts, e.g., fostering communities of practice on XML and semantic interoperability

• These tasks will result in the greatest ROI in terms of the time and dedication put forth by AIC members and staff

• Task 6 is to “develop identification and validation processes for emerging technologies,” i.e., ET.gov

• Lead staff for this task under the ET S/C are: John McManus (NASA), Susan Turnbull (GSA), and Owen Ambur (DOI).

Page 8: Presented to the XML Community of Practice, Architecture and Infrastructure Committee,

5. Eight stages of ET.gov

1. Identify +register

1. Identify +register

2. Subscribe+ indicate

LOI

2. Subscribe+ indicate

LOI

3. Acceptstewardship3. Accept

stewardship

4. Graduate +transition toCORE.gov

4. Graduate +transition toCORE.gov

5. Budget(President + Congress)

5. Budget(President + Congress)

6. Acquirefor use

6. Acquirefor use

7. Maintainover life

cycle

7. Maintainover life

cycle

8. Retire +replace

8. Retire +replace

Page 9: Presented to the XML Community of Practice, Architecture and Infrastructure Committee,

ET S/C vision S/C mandate

Scope Objectives

XML schema Other tech specs

Booz design Feedback frS/C + CoPs

Vision ET.gov

6. Evolution of ET.gov: from vision to execution

2003> 2004> 2005> 2006>

BLS UsabilityLab testing Stage 1 exec.

MSDR design MSDR demo

Stage 2 exec. Stage N exec.

Instance docs XSDs

Page 10: Presented to the XML Community of Practice, Architecture and Infrastructure Committee,

7. ET.gov today and tomorrow

ET = IT data:USA

ET = IT data:USA + global

ET = IT + XT data:USA

ET = IT + XT data:USA + global

100s of records,

users, and queries

100s of records,

users, and queries

1,000s of records,

users, and queries

1,000s of records,

users, and queries

10,000s of records,

users, and queries

10,000s of records,

users, and queries

100,000s of records,

users, and queries

100,000s of records,

users, and queries

http://et.gov/component_search.aspx

Page 11: Presented to the XML Community of Practice, Architecture and Infrastructure Committee,

8. Process used to develop the ET.gov demo

CustomerData Store

XML DataTransformatio

nand Index

Generation

i411Search Server

XMLSearch

Request

XMLSearch

Response

XML DocumentGeneration

XML DocumentsCategoryHierarchyRecords

CustomerData StoreCustomerData Store

XML DataTransformation

and IndexGeneration

CustomerWeb Server

XML DocumentGeneration

XML DocumentGeneration

XML DocumentsCategoryHierarchyRecords

XML DocumentsCategoryHierarchyRecords

Page 12: Presented to the XML Community of Practice, Architecture and Infrastructure Committee,

9. Defining the facets + categories for ET.gov

Existing facets Use Categories (examples)

1. Component name WHAT? A, B, C, D…Z

2. Organization name WHO? A, B, C, D…Z

3. Component type WHAT? Software, hardware, data

4. Relation to FEA: SRM WHAT? Analysis/stats, assets, BI…visualization

5. Relation to FEA: TRM WHAT? Component framework, service access…

Possible future facets Use Categories (examples)

6. Component lifecycle stg. WHAT? ET Stage 1, ET Stage 2…ET Stage 8

7. Component cost HOW MUCH? <$10,000, $10-$50,000…>$1 million

8. Organization location WHERE? Country, state, city, zip code

9. Organization industry WHO? NAICS, SIC

10. Agency name WHO? A, B, C, D…Z

11. USCIOC level of interest HOW MUCH? Low, medium, high

Page 13: Presented to the XML Community of Practice, Architecture and Infrastructure Committee,

10. ET.gov demo and test drive

http://etgov.i411.com

Page 14: Presented to the XML Community of Practice, Architecture and Infrastructure Committee,

11. Conclusion 1: Value of MSDR at ET.gov— and beneficiaries

i411 “value pillars”

Beneficiaries/stakeholders

ET.govcontent

USCIOC as a

content owner

ET.govorgzs.:

comm, gov

ET.govend-

users

1. Search and discovery + + + +

2. Virtual aggregation + + + +

3. Virtual syndication + +

4. Content exposure (traffic) + + +

5. Content visibility (depth) + + +

6. Content access (security) + +

7. RT cstmr. responsiveness + +

8. Business agility +

9. Cost efficiencies + savings $ $

10. Revenue generation $ $

Page 15: Presented to the XML Community of Practice, Architecture and Infrastructure Committee,

12. Conclusion 2: Next steps

1. Integrate feedback on the demo from the xmlCoP and ET S/C2. "Beef up" the ET.gov data and instance documents3. Refine the XML schema4. Publish a basic XSL style sheet that can be used to render the

XML instance documents5. Make the data set crawlable, i.e., expose the database of

instance documents in a simple HTML/XML interface that can be crawled by an automated Web spider

6. Integrate stemming/synonyms7. Subscribe via RSS to a search or a particular component8. Add, delete or refine the facets and categories9. Refine the UI and general "look and feel“ of the ET.gov site.

Page 16: Presented to the XML Community of Practice, Architecture and Infrastructure Committee,

Thank you…and contacts

Amin Hassam [email protected] President, Government Strategy + Solutionsi411, Inc. www.i411.com Herndon, Virginia703.793.3270 x140

Paul Woods [email protected] President & CEOBusiness Technology Source LLC www.biztechsource.com Shepherdstown, West Virginia304.876.9242

Owen Ambur [email protected], xmlCoP  http://xml.gov/Project Manager, ET.gov  http://et.gov/Program Manager, DEARChief XML Strategist, U.S. DOIU.S. Department of the Interior202.208.5439