taxonomy tools: collaboration, creation & integration dow jones … · 2007. 9. 14. ·...
TRANSCRIPT
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
Taxonomy Tools:
Collaboration, Creation &
Integration
Dave Clarke
Global Taxonomy Director
Dow Jones & Company
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
Introduction
1. Collaboration 2. Creation 3. Integration
Software Tools for Taxonomy
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
– Handle massive volumes of data 24x7 every day:
• Over 500,000 documents per day
• 10,000+ Sources
• 22 languages
– Expertise to create and maintain a robust taxonomy including:
• 310,000+ company codes
• 820+ industries
• 520+ subjects
• 340+ regions
60
Terra Byte
Content Server
3.6 Million
Documents/Month
700
Feeds
152
Countries
Dow Jones
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
Taxonomy Tools
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
Taxonomy Tools
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
Collaboration
• WHO needs to get involved?
• WHAT do they need to do?
• HOW do they work together?
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
WHO
• Categorization
• Content Management
• Information Technology
• Knowledge Management
• Knowledge Workers
• Library
• Metadata
• Ontology
• Search
• Subject Matter Expertise
• Taxonomy
Cross-functional
Team
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
WHAT
Assess
Business Goals
Content
IT
Metadata
Taxonomy
Standards & Best
Practices
Users
Design
Audience
Segmentation &
Definition
Facet Analysis
Information
Architecture
Editorial
Guidelines &
Workflow
Build
Entity Extraction
(machine and/or
human)
Content Tagging
Rules (machine
and/or human)
Taxonomy
Construction &
Mapping
Maintain
Continuous
Work-in-progress
Engage end-
users (query log
analysis, focus
groups,
folksonomy)
Governance
Process
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
HOW
• Web workspace
• Task-oriented
• Role-based
• Workflow
• Governance alerts
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
HOW
• Web workspace
• Task-oriented
• Role-based
• Workflow
• Governance alerts
Location independent
access for in-house
stakeholders and very
often external
consultants and SMEs
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
HOW
• Web workspace
• Task-oriented
• Role-based
• Workflow
• Governance alertsWork-oriented views for
teams of people
performing different
tasks…
…the flip side of the
collaboration coin is
compartmentalization
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
HOW
• Web workspace
• Task-oriented
• Role-based
• Workflow
• Governance alerts
Multiple levels of
functional permission for
fine-tuning what users
can do to particular sets
of terms
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
HOW
• Web workspace
• Task-oriented
• Role-based
• Workflow
• Governance alerts
New Candidates
Deactivated / Deleted
Rejected for Rework
Withdrawn &
Replaced By
Primary Review
Secondary
QC
Approved &
Published
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
HOW
• Web workspace
• Task-oriented
• Role-based
• Workflow
• Governance alerts
Design “need-to-know”
reports for each
stakeholder group /
stage in the workflow
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
HOW
• Web workspace
• Task-oriented
• Role-based
• Workflow
• Governance alerts Schedule the reports to
be generated
automatically
And to email alerts to
designated recipients
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
Creation(models, methods and trends for building taxonomies)
• Folksonomies
• Taxonomies
• Semantic webs
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
Classic Taxonomy
Classification based
Web portals
Navigation aids
File-folder metaphor
Ad-hoc groupings
2-dimensional
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
Faceted Taxonomy
Separate taxonomies for
individual attributes
Content tagged
to facets
separately
not
pre-
-coordinated
Used as orthogonal
search filters
n-Dimensional
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
Faceted Taxonomy in Action
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
Faceted Taxonomy in Action
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
Folksonomy
Web 2.0Folksonomy
Social tagging
Wikis
Blogs
User participation
Un-controlled Un-structured
Tag
Clouds
Collaboration
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
Pros & Cons
Folksonomy lets
users create (and
adopt) terminology
that is meaningful to
themselves but does
so at the expense
of precision and
recall for the
general user (meta
noise).
Controlled
vocabularies
solve the
precision-recall
trade off but their
insistence on “preferred
terminology” imposes one-
size-fits-all order on a
heterogeneous user community.
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
A Middle Path
Audience-Centric Taxonomy
1. Segment a user community into “Audiences”
2. Develop a core-taxonomy but append
extensions to it which store the terminology
and hierarchy preferences of each audience
3. Leverage folksonomy and social tagging
systems to help inform the evolution of the
audience-centric taxonomies
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
Audience-Centric ViewsThe world of your content
Conceptual
representation of
the content as a
semantic web
Audience-centric
views provide
access and
navigation
orientated for
different user
perspectives
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
Semantic Webs & Ontologies
sem
antic
web
Formally defined
relationships
Concept-oriented rather
than terminology-
oriented
Extensible concept
types & extensible
relationship types
Resource Description
Framework (RDF)
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
Integration
• Components
• Talking to each other
• RDF Files & Web Service
Calls
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
Components
Taxonomy
Content
Categorization
Search
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
Taxonomy
Categorization
Content Search
Open
Standards
• n Components
• 1 Common integration
W3C RDF-based Open
Standards (SKOS & OWL)
Web Services – ad hoc
transactions and small
data sets
XML File Libraries –
published versions and
large data sets
Talking to Each Other
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
EXAMPLE
From Idea to
Published Output
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
John Doe
ABC Corporation
New York
Employed
By
Employer
Of
Located
In
Location
Of
Manufacturer
Of
Vendor
To
WidgetsManufactured
By
PQR Corporation
XYZ Corporation
Client
Of
Client
Of
Vendor
To
whiteboard your entities and
relationships
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
Step 1
Design the
conceptual
structure
•Concept types
•Data elements
•Relationship types
•Semantic rules
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
Step 2
Input entities and
build relationships
•Key data via GUI
•Import Excel files
•Import XML files
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
Step 3
Publish
•HTML Browser
•CSV Download
•XML/RDF Export
HTML
CSV
XML
From
whiteboard to
published RDF
in 30 minutes
Taxonomy Tools: Collaboration, Creation & Integration – iKMS Workshop, Singapore
Thank
YouQuestions
Comments