ii-sdv 2014 patent intelligence with bibliographic, legal status and patent register data: how...
TRANSCRIPT
Patent Intelligence with Bibliographic, Legal Status and Patent Register Data:How patent statistical analyses can help to improve services
Geert BoedtChristian Soltmann 15 April 2014
© European Patent Office 2014
Outline
� PATSTAT product family� Using PATSTAT data with popular tools:
– Excel example– KNIME example
� Case studies:– patent activity of industry sectors– EPO-OHIM study
© European Patent Office 2014
PATSTAT product family
© European Patent Office 2014
EPO Worldwide Patent Statistical Database (PATSTAT):
� unique basis for conducting sophisticated statistical analyses of patent data
� bibliographic data on more than 80 million patent documents from leading industrialized and developing countries
� can be supplemented by:– legal event data of patent documents in many
countries worldwide– European patent register data: Bibliographic,
legal and procedural information on published European patent applications and on published EURO-PCT applications
EPO Worldwide
Patent Statistical Database
(PATSTAT)European Patent
Register for PATSTAT
EPO worldwide
legal status database for
PATSTAT
© European Patent Office 2014
� updated twice a year� possible fields of application include:
– sophisticated citation analyses– linking patent data to business and trademark data– identifying technological trends– identifying competitors and potential partners
EPO Worldwide
Patent Statistical Database
(PATSTAT)European Patent
Register for PATSTAT
EPO worldwide
legal status database for
PATSTAT
EPO Worldwide Patent Statistical Database (PATSTAT):
© European Patent Office 2014
Database implementation
PATSTATEuropean Patent
Register for PATSTAT
EPO worldwide legal status database for
PATSTAT
DBMS
[…]
user1 usern[…]Interface
Excel […] SPSSRKNIME
© European Patent Office 2014
Using PATSTAT data with popular tools
© European Patent Office 2014
Accessing PATSTAT data
� various options available to access PATSTAT data, including using:– a database management system (e.g.
MySQL, Microsoft SQL Server)– a statistical tool (e.g. SPSS, R)– a data mining tool, e.g. KNIME– office software, e.g. Microsoft Access,
Microsoft Excel
© European Patent Office 2014
Processing PATSTAT data: Excel example
PATSTATEuropean Patent
Register for PATSTAT
EPO worldwide legal status database for
PATSTAT
DBMS
ODBC
Excel� VBA scripting:
� technical features of procedure is hidden from end user
� procedure:a.creating ActiveX Data Objects
for accessing data objects on DBMS:– Connection object to
establish connection to DBMS via ODBC and to have SQL query run
– Recordset object to accept result set of SQL query
b.result set is processed with PivotTable and PivotChart object
c.graphs may be coordinated to corporate design
© European Patent Office 2014
Processing PATSTAT data: Excel example
PATSTATEuropean Patent
Register for PATSTAT
EPO worldwide legal status database for
PATSTAT
DBMS
ODBC
Excel� VBA scripting:
– technical features of procedure is hidden from end user
– procedure:a.creating ActiveX Data Objects
for accessing data objects on DBMS:• Connection object to
establish connection to DBMS via ODBC and to have SQL query run
• Recordset object to accept result set of SQL query
b.result set is processed with PivotTable and PivotChart object
c.graphs may be adjusted to corporate design
© European Patent Office 2014
Processing PATSTAT data: Excel example
� sample result for wind energy sector (defined by patent class domain Y02E10/70):
© European Patent Office 2014
Processing PATSTAT data: KNIME example
� open source data analytics, reporting and integration platform
� modular data pipelining concept� graphical user interface allowing to assembly nodes
for data processing� further information available at: http://www.knime.org/
© European Patent Office 2014
� sample dataflow: number of inventions in the wind energy sector:a. query PATSTAT databaseb. extract result tablec. process result table
– optionally combination with other datad. aggregate datae. display results
Processing PATSTAT data: KNIME example
© European Patent Office 2014
Processing PATSTAT data: KNIME example
© European Patent Office 2014
Processing PATSTAT data: KNIME example
© European Patent Office 2014
Processing PATSTAT data:PATSTAT database scheme
© European Patent Office 2014
Processing PATSTAT data: KNIME example
© European Patent Office 2014
Processing PATSTAT data: KNIME example
© European Patent Office 2014
Case studies: Patent activity of industry
sectors
© European Patent Office 2014
Patent activity of industry sectors: Motivation� interest in patent activity as a proxy for the input side of
innovation
� target groups:
– investors
– policy makers
� procedure: linking patent data and business data to identify industry sectors with high patent/innovation activity
EPO Worldwide
Patent Statistical Database (PATSTAT)European
Patent Register for PATSTAT
EPO worldwide
legal status database for
PATSTAT
NACEclassification
IPC/NACEconcordance
© European Patent Office 2014
Patent activity of industry sectors
� procedure:– categorisation of patent documents according to
so-called (manufacturing) sectoral fields [based on IPC/NACE concordance table (Schmoch et al. (2003)]
– determining the number of inventions per year of invention for the said sectoral fields
– data normalisation, to 1980
© European Patent Office 2014
Patent activity of industry sectors
Description
Food, beverages
Tobacco products
Textiles
Wearing apparel
Leather articles
Wood products
Paper
Petroleum products, nuclear fuel
Basic chemical
Pesticides, agro-chemical products
Paints, varnishes
PharmaceuticalsSoaps, detergents, toilet preparations
Other chemicals
Man-made fibres
Rubber and plastics products
Non-metallic mineral products
Basic metals
Fabricated metal products
Energy machinery
Non-specific purpose machinery
Agricultural and forestry machinery
Machine-tools
Special purpose machinery
Weapons and ammunitionDomestic appliances
Office machinery and computers
Electric motors,generators, transformers
Electric distribution, control, wire, cable
Accumulators, battery
Lightening equipment
Other electrical equipment
Electronic components
Signal transmission, telecommunications
Television and radio receivers, audiovisual electronics
Medical equipment
Measuring instruments
Industrial process control equipmentOptical instruments
Watches, clocks
Motor vehicles
Other transport equipment
Furniture, consumer goods
© European Patent Office 2014
Case studies: EPO-OHIM study
© European Patent Office 2014
Example: Linking patent data, trademark data and design data:
� Joint project of European Patent Office and the Office for Harmonization in the Internal Market (OHIM):– purpose: to examine economic characteristics of
IP-intensive industries in Europe– methodology:� determine which industries use IP rights more than
others� determine employment and value added generated in
those industries� determine weight of IP-intensive industries in Europe
© European Patent Office 2014
Example: Linking patent data, trademark data and design data:
� methodological challenge:– complexity of dealing with a large amount of data from
27 EU member states, contained in several databases– novel and sophisticated data-matching technique was
needed, including extensive data preparation/name harmonisation
� in order to determine which industries are IP-intensive:– matching EPO's Worldwide Patent Statistical Database
(PATSTAT) and OHIM's register database with commercial ORBIS database and EUROSTAT data
© European Patent Office 2014
� Linking patent data, trademark data and design data to determine IP intensive industries:
Patent applications
published and patents granted by
EPO
Community trademarks and
registered community
designs
EPO's PATSTAT database OHIM's register database
ORBIS database
PATSTAT-ORBIS
concordance
OHIM-ORBISconcordance
Concordance schema
Industry classification and other information
for more than 20 million European
companies
Example: Linking patent data, trademark data and design data:
EUROSTAT dataNACEcode
© European Patent Office 2014
Example: Linking patent data, trademark data and design data:
� Results include:– about half of European industries can be considered IP-
intensive– total number of IP-dependent jobs: about 77 million (35
per cent of all jobs)– IP-intensive industries pay significantly higher wages
than other industries– value added per employee is higher in IP-intensive
industries than elsewhere in European economy� further information available at:
http://ec.europa.eu/internal_market/intellectual-property/studies/index_en.htm