london online 2008

21
Unlocking Innnovation in a Sea of Information Joe Buzzanga Product Manager, Science and Technology Elsevier [email protected] www.illumin8.com Online Information 2008 Dec. 3, 2008 London, UK

Upload: joebuzz1

Post on 05-Dec-2014

655 views

Category:

Technology


2 download

DESCRIPTION

Presentation at London Online, Dec. 2008

TRANSCRIPT

Page 1: London Online 2008

Unlocking Innnovation in a Sea of Information

Joe BuzzangaProduct Manager, Science and [email protected]

Online Information 2008Dec. 3, 2008London, UK

Page 2: London Online 2008

Topics

• R&D Challenges• The “Sea of Information”• Limits of Keyword• Next Generation Search

Page 3: London Online 2008

Investment in R&D

Page 5: London Online 2008

Technology Moves Fast

~$200 Laptop (OLPC initiative)

Amazon “Kindle”

Winners? Losers?

Page 6: London Online 2008

Winners

Page 7: London Online 2008

Losers

Page 8: London Online 2008

Supporting R&D and the Innovation Process

– Successful innovation requires superior information retrieval systems

– Delivering critical information on trends, competitors, substitute technologies, experts, etc. ie., “technology intelligence”

•“Technology Intelligence is the activity that enables companies to identify the technological opportunities and threats that could affect the future growth and survival of their business.”•“It aims to capture and disseminate the technological information needed for strategic planning and decision making”

Centre for Technology Management, University of Cambridge

Critical in industries characterized by technology turbulence and rapid change

Page 9: London Online 2008

“Searching for meaning in the content of unstructured data like images, video clips, documents, and the numbers and characters in databases is the rocket science of the digital universe.” IDC

Source: IDC Whitepaper, The Diverse and Exploding Digital Universe, March 2008

The “Sea” of Information

Page 10: London Online 2008

The “Sea” of Information

Page 11: London Online 2008

Today’s Researcher?

Search for Meaning?

5.5 hours / week *Searching and gathering information

* Source: 2007 survey of 6,300 knowledge workers, Outsell, Inc.

4.7 hours / week *Organizing and analyzing and applying information

Page 12: London Online 2008

Challenge for Information Retrieval?

• Separate the Signal from Noise

• Signal processing

Page 13: London Online 2008

Current Search Has Reached Its Limit

A keyword search for “biodegradable film” will yield over 1,000,000 links to documents.

Page 14: London Online 2008

The “key” in Keyword

• Keyword is a misnomer in context of an index• Keyword is in the mind of the searcher• Every word is indexed, since the computer is not smart enough to know significant words (i.e., the “key” in “keyword”)

– Brute force approach, feasible with compute power

Page 15: London Online 2008

Mystery Equation

mystery clip

Page 16: London Online 2008

Search and Its Discontents

Page 17: London Online 2008

What is illumin8?

Research and discovery tool powered by the world’s largest natural language processing engine:

– Designed for Corporate R&D professionals

– One search across billions of web pages, premium scientific articles & patents

– Find organizations, products, experts, approaches, technical landscapes & more

– Growing customer base across leading Fortune – 500 companies

Natural Language Processing at Internet Scale!

Page 18: London Online 2008

How does illumin8 work?

Full Text

Abstracts

Internet

Patents

illumin8 index1.1 billion semantic extractions

7 Billion web pages, blogs and forums

3 Million full-text scientific and technical articles from 1,800 Elsevier journals

36 Million scientific records from 15,000 peer reviewed journals & more than 4,000 publishers

22 Million patents from 5 world-wide patent offices

Extract and Summarize Results

Search

illumin8 discovers and extracts “Results” rather than bibliographic citations

Page 19: London Online 2008

How does illumin8 work?

Content

• Premium Scientific• Patent• Web

-Crawl-Load

NLP Applied

SemanticIndex

Problems, Solutions, Benefits

Search

NLP Applied

Results

Fuse, Classify, Summarize

NLP Applied

NLP applied throughout the system: index, query, result set

Page 20: London Online 2008

Taking Search Beyond Keyword

Keyword Indexing

• Meaning is lost

Sentence processing

• Meaning is maintained

• Identify & classify problems,

solutions and benefits

Neural Network used in handwriting recognition

Solution Problem

Page 21: London Online 2008

“ We have found illumin8 to be a unique and effective way to mine the internet and premium content for solutions to technical problems and questions. The value for us is the unique search capability with the deep and broad content set. Further, doing the research without illumin8 would have taken weeks and I would have spent countless hours looking at data that was not relevant to my project”

“ The company… has signed up a long list of major innovators, including 3M, Proctor & Gamble, General Mills and a couple of dozen other Fortune 500 companies. By all accounts, Illumin8 is set to become the Google of the innovation space...”

The Economist - March 2008

A Proven Solution