tech mining: concept, methods and applications in science ... · ¾mine field-structured text like...

41
Search Technology, Inc. Tech Mining: Concept, Methods and Applications in Science Policy & Technology Management Alan Porter Director of R&D, Search Technology, Inc. & Co-Director, Technology Policy & Assessment Center Georgia Tech [email protected]

Upload: others

Post on 10-May-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

Tech Mining: Concept, Methods and Applications in Science Policy & Technology Management

Alan PorterDirector of R&D, Search Technology, Inc.

&Co-Director,

Technology Policy & Assessment Center Georgia Tech

[email protected]

Page 2: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

Agenda1. Competitive World2. Information Resources + Software Tools3. Case Examples4. Discussion: How to access and use research

knowledge better?

Page 3: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

SciSIP• President’s Science Advisor (Marburger):

Science of Science Policy• US National Science Foundation Program:

Science of Science & Innovation Policy• Empirical grounding for R&D policy:

Data + Analytics + Visualizations

Page 4: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

Innovation Challenges• Radical Innovation

Discontinuous change – implies unfamiliar realmsIncreasingly science-based technologies (challenge to predict breakthroughs)

• Open InnovationLeveraging the knowledge economySeek external R&D knowledge & collaborationCase example: 35% of Procter & Gamble’s recent innovation based on major external contributions

Page 5: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

The Need: A way to formulate wise ST&I Policy and Technology Management

• Information is key• Internal R&D

information+

• External R&D intelligence

Page 6: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

The Problem• Information

overload

Page 7: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

The Solution

• Access the best ST&I information resources

• Apply powerful Tech Mining (software) tools(e.g., TDA –Thomson Data Analyzer)

Page 8: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

Technical Information• Science, Technology &

Innovation (“ST&I”) Databases (e.g., Web of Science; CSCD, Thomson Innovation)

• Internet Sources(e.g., Googling)

• Technical Expertise

Contextual Information• Business, competition,

customer, policy, popular content Databases (e.g., Thomson One)

• Internet Sources (e.g., blogs, website profiling)

• Business Expertise

Six information types

Page 9: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

How do you deal with all this information?

Tech MiningAlan L. Porter and Scott W. CunninghamJohn Wiley & Sons Inc., 2005

Page 10: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

The Tech Mining Process Tech Mining

Page 11: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

MOT Issues, Questions, and Indicators

13 MOT Issues

• R&D Portfolio Mgt

• R&D Project Initiation

• Engr Project Initiation

• New Product Development

• Strategic Planning

• Track/forecast emerging or breakthrough technologies

• etc.

~200 Innovation Indicators

• Mapping of topic clusters within the technology

• 3-D trend charts for topic clusters

• Ratio of conference to journal papers (benchmarked)

• Scorecard rate-of-change metrics for topic clusters

• Time slices to show evolution of topical emphases

• Topic growth modeling (S-curve) fit & extrapolation

• Profile table of main players• Pie chart: Company vs.

Academic vs. Government publishing

• Spreading (or constricting) # of players by topic

39 MOT Questions

What?• What’s hot?• Fit into tech landscape?• New frontiers at fringe?• Drivers?• Competing technologies?• Likely development paths?Who?• Who are available experts?• Which universities or labs

lead?

Page 12: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

Application Examples• R&D Portfolio Management• Research Evaluation• Research Profiling• Tracking R&D over time• Research Network Analyses• Monitoring Research Knowledge Flows• Geo-mapping• ST&I Indicators

Page 13: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

NSF Proposal Assignment System: First Stage

Page 14: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

Research Program Evaluation: EPA STAR• Endocrine Disruptors (1 of 2 programs assessed, for

NAS Committee evaluation)• EPA provided reports on papers resulting from the

projects (funding started in 1996)• We searched databases (especially Web of

Science) for research in the target domains, and for citations generally and to the EPA project papers

• Compared co-authoring patterns before & after the EPA funding [effect of funding on teaming]

• Found major disjunction – only ~5% of the project-based papers appeared in the target research domains!

Page 15: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

Organizational Self-Profiling• Georgia Tech

For 6 years; >20,000 research abstracts from Web of Science, INSPEC, Compendex, MEDLINEAlso NSF awards; research projects database

• Research Locator – Quickly, on demand• Research Profiler – A unit: change over time• Marriage Broker – Identify capabilities to bolster a proposal• Story Teller – Help make the case for your capabilities and

teaming (networking)

Page 16: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

Nanopatenting: Life Cycle Analyses[Simone Alencar, Adelaide Antunes, et al.]

• Nanopatenting search (Derwent)• Combine two sources of information

Patent sub-classesText mining on “uses”

• Categorize technology targets into 3 life stagesNano raw materialsNano intermediatesNano products

• Use to examine organizational and national patenting emphases

• Identify strategic differences among US, Japan & Germany

Case Examples

Page 17: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

Discerning Patent Aims along the Value Chain[by Alencar, Antunes & Porter]

Main IPC [# patents] Main uses description in the nanopatents

Position along the Nano Value Chain

H01L-Semiconductor Devices; Electric Solid State Devices Not Otherwise Provided [2870]

• Electron device • Semiconductor device • Solar cell

• Nanointermediate • Nanointermediate • Nano-products

C01B-Non-Metallic Elements; Compounds Thereof [2716]

• carbon nanotube • fuel cell • catalyst

• Nano-raw material • Nano-products • Nanointermediate

A61K-Preparations For Medical, Dental, Or Toilet Purposes [1863]

• Cancer (treatment, medication) • Cosmetics • drugs

• Nano-products • Nano-products • Nano-products

B82B-Nano-Structures; Manufacture Or Treatment Thereof Chemistry [1615]

• Carbon nanotube • Electron device • catalyst

• Nano-raw material • Nanointermediate • Nanointermediate

Case Examples

Page 18: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

National Benchmarking: Relative Research Emphases (WOS)

Page 19: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

1997 1998 1999 2000 2001 2002 2003 2004 2005 2006

Molecular Sequence DataModels, Molecular

Biodegradation, EnvironmentalCloning, Molecular

0

2

4

6

8

10

12

14

16

Key Topics Top Authors Related Topics Prime Years

Molecular Sequence Data[237]Cygler, Miroslaw [48];Thomas, David Y [46];

Ni, Feng [25]

Intellectual Product [237];Amino Acid, Peptide, or Protein [225];

Laboratory Procedure [130]1992-2002

Models, Molecular[94]Cygler, Miroslaw [50];

Li, Yunge [19];Schrag, Joseph D [18]

Intellectual Product [94];Amino Acid, Peptide, or Protein [93];

Spatial Concept [73]1997-2003

Biodegradation, Environmental[68]Hawari, Jalal [31];

Greer, Charles W [18];Halasz, Annamaria [18]

Biodegradation, Environmental [68];Organic Chemical [64];

Bacterium [48];Hazardous or Poisonous Substance [32]

1998-2005

Cloning, Molecular[56]Thomas, David Y [18];Cygler, Miroslaw [10];Lau, Peter C K [10]

Laboratory Procedure [56];Amino Acid, Peptide, or Protein [53];

Intellectual Product [46]1993-2001

0

10

20

30

40

50

60

1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006

Biotechnology Research InstituteQuick Profile: 750 Articles(MEDLINE)

4 Key Topics Breakout:- Over time (3-D plot)- Key Researchers (table)

Page 20: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

Science Citation Index Nano Publications in Environment & Energy: By Region

Regional Entities Publication Year Subject Category% since 2004 Top 5 Items

EU[2185] 36% of 2185

Environmental Sciences [801];Energy & Fuels [592];Engineering, Chemical [512];Water Resources [469];Chemistry, Physical [341]

USA[1528] 37% of 1528

Environmental Sciences [708];Engineering, Environmental [377];Energy & Fuels [369];Engineering, Chemical [292];Water Resources [214]

China[481] 63% of 481

Environmental Sciences [242];Energy & Fuels [175];Chemistry, Physical [116];Engineering, Environmental [93];Electrochemistry [78]

SE Asia Tigers[474] 52% of 474

Environmental Sciences [204];Energy & Fuels [197];Electrochemistry [140];Engineering, Environmental [120];Water Resources [104]

Page 21: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

One Pagers

Page 22: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

Science Overlay + Other Mapping• Two base science maps• Overlay various topical research sets over these• Cohesion maps• Network maps

Collaboration maps; Common interest maps; Factor maps (term consolidation)Within TDA, so maps are fully interactive

Page 23: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Cognitive Sci

Agri Sci

Biomed Sci

Chemistry

Physics

Engr Sci

Env Sci & Tech

Mtls Sci

Reproductive Sci

Math, Interdisciplinary

Health Sci

Soc/Psych & Rltd

Policy Sci

Literature & Arts

Clinical Med

Computer Sci

Ind Engr/Mgt Sci

Geosciences

Ecol Sci

Civil Engr

Ethical & Social Issues

Base Science Map (Science, Social Sciences, & Arts & Humanities) Based on co-citation of 244 Subject Categories by 30,261 sample of USA-authored articles in Web of Scienceclustered into 21 Macro-disciplinesRafols/Porter, 2008

Page 24: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Pajek

Medical Tech

Health Sci

Ethical & SocialIssues

Literature & Arts

Social Sciences

Behav Sci

Clinical Med

Infectious Diseases Agri Sci

Eco Sci

Math across Disciplines

Comp Sci

Engr Sci

Chemistry

Geo SciEnv Sci

Mtls Sci

Physics

Biomed Sci

Ind Engr/Mgt Sci

Soc/Psych

Synthetic Biology Overlay MapScience Overlay Mapping:Which research communitiesto engage in a “Synthetic Biology”workshop?

Page 25: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Cognitive Sci

Agri Sci

Biomed Sci

Chemistry

Physics

Engr Sci

Env Sci & Tech

Mtls Sci

Reproductive Sci

Math, Interdisciplinary

Health Sci

Soc/Psych & Rltd

Policy Sci

Literature & Arts

Clinical Med

Computer Sci

Ind Engr/Mgt Sci

Geosciences

Ecol Sci

Civil Engr

Ethical & Social Issues

Nano in Social Sciences Articles’ Cited SCs (from SSCI + Scopus, pre-2005)Overlay on the 244 Subject Category Web of Science map(normalized to 2005-07 total cites)

Relative Nano in Social Sciences Citation Emphases: pre-2005

Page 26: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Cognitive Sci

Agri Sci

Biomed Sci

Chemistry

Physics

Engr Sci

Env Sci & Tech

Mtls Sci

Reproductive Sci

Math, Interdisciplinary

Health Sci

Soc/Psych & Rltd

Policy Sci

Literature & Arts

Clinical Med

Computer Sci

Ind Engr/Mgt Sci

Geosciences

Ecol Sci

Civil Engr

Ethical & Social Issues

Nano in Social Sciences Articles’ Cited SCs(from SSCI + Scopus for 2005-07) (Rafols/Porter, Aug 31, 2008).NOTE: Enhanced research knowledge bases in the social sciences, compared to the earlier period.

Relative Nano in Social Sciences Citation Emphases: 2005-07

Page 27: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Science Mapping

Governance

Visions

Co-citation Mapof the most citedauthors by307 nanosocial sciencepapers

Evolutionary Economics

Perception

Ethics

Page 28: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

Pajek

GLOBAL MAP OF SCIENCE

Neurosciences

Computer Sciences

GeoscienceAgriculture

Ecology

Biological Sciences

Chemistry

Physics

Engineering

Environ. Sci.

Materials Sci

Infectious diseases

Clinical medicine

General medicine

Leydesdorff&Rafols (2007, submitted)

Science Overlay Mapping – Base Map

Page 29: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

Pajek

Nanotech-related publicationsin the map of science (1991)

Neurosciences

Computer Sci.

GeoscienceAgriculture

Ecology

Biological Sci.

Chemistry

Physics

Engineering

Environ. Sci.

Materials Sci.

Infectious diseases

Clinical medicine

General medicine

Nano Sample Profile (~900 records)

Page 30: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

Pajek

Nanotech-related publicationsin the map of science (2005)

Neurosciences

Computer Sci.

GeoscienceAgriculture

Ecology

Biological Sci.

Chemistry

Physics

Engineering

Environ. Sci.

Materials Sci.

Infectious diseases

Clinical medicine

General medicine

Nano Sample Profile (~900 records)

Page 31: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.Pajek

Quantum Dot1995Size (area) of nodes is proportional to:Size (area) of nodes is proportional to:

Log (1+Number of citations per category)Log (1+Number of citations per category)RafolsRafols

Map of Science

Page 32: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

Pajek

Quantum Dot2005

Rafols

Map of Science

Page 33: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

Polymer Biomaterials : fibrous structural proteins : skin1991-1997 (68 patents)

Tracking Change over Time

Page 34: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

Polymer Biomaterials : fibrous structural proteins : skin1991-2005 (470 patents)

Page 35: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

Geo-coded Mapping

Georgia Tech TPAC / CNS-ASU Analysis of SCI Publications; refined nano definition; results subject to revision

Page 36: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Tech Mining for ST&I Indicators Construction: Sao Tech Mining for ST&I Indicators Construction: Sao Paulo & Brazil [Leandro Paulo & Brazil [Leandro FariaFaria et al., UFSC]et al., UFSC]

Page 37: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy
Page 38: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

An Increasingly Competitive World: Georgia Tech High Tech Indicators point to China ~#1

Change in Competitiveness 1993-2007

0

10

20

30

40

50

60

70

80

90

100

20 30 40 50 60 70 80 90 100

INPUT-Average

Tec

hnol

ogic

al S

tand

ing

China

United States

Japan

Germany

Ireland

Israel

South KoreaSingapore

Malaysia

Venezuela

Mexico Czech Rep.India

Poland

Brazil

Hungary

ArgentinaIndonesia

ThailandPhilippines

Taiwan

Note: Coverage forVenezuela added 1996Poland added in 1996Ireland added 1999Israel added in 1999Czech Republic added in 1999

Page 39: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

“Aged” Nano Citations in 2000 and 2004 relative to Nano Articles (1st Author)

Page 40: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

Mine field-structured text like data –for patterns!Using TDA + other softwarePatterns speak to ST&I policy intelligence: benchmarking, trends, assessing technology maturation, etc.Answer “who, what, where & when” questions for policy decision processes[Generate “One-Pagers”]Aim to standardize questions & answersThen semi-automate the analytical processes: QTIP – Quick Technology Intelligence Process

Summing Up & Looking Ahead

Page 41: Tech Mining: Concept, Methods and Applications in Science ... · ¾Mine field-structured text like data – for patterns! Using TDA + other software ¾Patterns speak to ST&I policy

Search Technology, Inc.

Resources• Thomson Reuters Information

http://www.thomsonreuters.com/

• Thomson Data Analyzer//scientific.thomsonreuters.com/products/tda/

• Tech Mining by Alan Porter and Scott Cunningham, Wiley, 2005.

• Chesbrough, H. W. (2003), Open Innovation –The New Imperative for Creating and Profiting from Technology, Harvard Business School Press.

• "Connect and Develop: Inside Procter & Gamble's New Model for Innovation," Harvard Business Review, Vol. 84, No. 3, March 2006,