pratical deep dive into the semantic web - #smconnect

177
International Freelance SEO

Upload: jan-willem-bobbink-freelance-seo-consultant

Post on 11-Apr-2017

834 views

Category:

Data & Analytics


0 download

TRANSCRIPT

Page 1: Pratical Deep Dive into the Semantic Web - #smconnect

International Freelance SEO

Page 2: Pratical Deep Dive into the Semantic Web - #smconnect

What is

Page 3: Pratical Deep Dive into the Semantic Web - #smconnect
Page 4: Pratical Deep Dive into the Semantic Web - #smconnect

―The Semantic Web is a collaborative

movement led by international standards

body the World Wide Web

Consortium (W3C). The standard promotes

common data formats on the World Wide

Web‖

Page 5: Pratical Deep Dive into the Semantic Web - #smconnect

―The Semantic Web provides a common

framework that allows data to be shared

and reused across application, enterprise,

and community boundaries‖

Page 6: Pratical Deep Dive into the Semantic Web - #smconnect
Page 7: Pratical Deep Dive into the Semantic Web - #smconnect
Page 8: Pratical Deep Dive into the Semantic Web - #smconnect
Page 9: Pratical Deep Dive into the Semantic Web - #smconnect
Page 10: Pratical Deep Dive into the Semantic Web - #smconnect
Page 11: Pratical Deep Dive into the Semantic Web - #smconnect
Page 12: Pratical Deep Dive into the Semantic Web - #smconnect
Page 13: Pratical Deep Dive into the Semantic Web - #smconnect
Page 14: Pratical Deep Dive into the Semantic Web - #smconnect

Why are Google and other online giants interested

Page 15: Pratical Deep Dive into the Semantic Web - #smconnect

So…what is the main reason?

Page 16: Pratical Deep Dive into the Semantic Web - #smconnect
Page 17: Pratical Deep Dive into the Semantic Web - #smconnect
Page 18: Pratical Deep Dive into the Semantic Web - #smconnect
Page 19: Pratical Deep Dive into the Semantic Web - #smconnect

36%

24%

29%

46%

42%

36% 37%

0%

5%

10%

15%

20%

25%

30%

35%

40%

45%

50%

North America South America Europe Asia Africa Oceania Global

2014 average versus 2015 until date

Page 20: Pratical Deep Dive into the Semantic Web - #smconnect
Page 21: Pratical Deep Dive into the Semantic Web - #smconnect
Page 22: Pratical Deep Dive into the Semantic Web - #smconnect
Page 23: Pratical Deep Dive into the Semantic Web - #smconnect

So how does the

Page 24: Pratical Deep Dive into the Semantic Web - #smconnect
Page 25: Pratical Deep Dive into the Semantic Web - #smconnect
Page 26: Pratical Deep Dive into the Semantic Web - #smconnect
Page 27: Pratical Deep Dive into the Semantic Web - #smconnect
Page 28: Pratical Deep Dive into the Semantic Web - #smconnect
Page 29: Pratical Deep Dive into the Semantic Web - #smconnect
Page 30: Pratical Deep Dive into the Semantic Web - #smconnect
Page 31: Pratical Deep Dive into the Semantic Web - #smconnect
Page 32: Pratical Deep Dive into the Semantic Web - #smconnect
Page 33: Pratical Deep Dive into the Semantic Web - #smconnect
Page 34: Pratical Deep Dive into the Semantic Web - #smconnect
Page 35: Pratical Deep Dive into the Semantic Web - #smconnect
Page 36: Pratical Deep Dive into the Semantic Web - #smconnect
Page 37: Pratical Deep Dive into the Semantic Web - #smconnect
Page 38: Pratical Deep Dive into the Semantic Web - #smconnect
Page 39: Pratical Deep Dive into the Semantic Web - #smconnect
Page 40: Pratical Deep Dive into the Semantic Web - #smconnect
Page 41: Pratical Deep Dive into the Semantic Web - #smconnect
Page 42: Pratical Deep Dive into the Semantic Web - #smconnect
Page 43: Pratical Deep Dive into the Semantic Web - #smconnect
Page 44: Pratical Deep Dive into the Semantic Web - #smconnect
Page 45: Pratical Deep Dive into the Semantic Web - #smconnect
Page 46: Pratical Deep Dive into the Semantic Web - #smconnect
Page 47: Pratical Deep Dive into the Semantic Web - #smconnect

How about those future

Page 48: Pratical Deep Dive into the Semantic Web - #smconnect
Page 49: Pratical Deep Dive into the Semantic Web - #smconnect
Page 50: Pratical Deep Dive into the Semantic Web - #smconnect
Page 51: Pratical Deep Dive into the Semantic Web - #smconnect

So…

Page 52: Pratical Deep Dive into the Semantic Web - #smconnect
Page 53: Pratical Deep Dive into the Semantic Web - #smconnect
Page 54: Pratical Deep Dive into the Semantic Web - #smconnect

54

Page 55: Pratical Deep Dive into the Semantic Web - #smconnect

55

Page 56: Pratical Deep Dive into the Semantic Web - #smconnect
Page 57: Pratical Deep Dive into the Semantic Web - #smconnect

International Freelance SEO

SEO Consultant Metapeople

/ Netbooster Group

Brand Ambassador Majestic

Cycling & Skating

Science: Physics in particular

Page 58: Pratical Deep Dive into the Semantic Web - #smconnect
Page 59: Pratical Deep Dive into the Semantic Web - #smconnect
Page 60: Pratical Deep Dive into the Semantic Web - #smconnect

1. Make data available

2. Use specific markup languages

3. Data is available for everyone

Page 61: Pratical Deep Dive into the Semantic Web - #smconnect

―The Open Graph protocol enables any web

page to become a rich object in a social

graph. For instance, this is used on

Facebook to allow any web page to have

the same functionality as any other object

on Facebook.‖

Page 62: Pratical Deep Dive into the Semantic Web - #smconnect
Page 63: Pratical Deep Dive into the Semantic Web - #smconnect
Page 64: Pratical Deep Dive into the Semantic Web - #smconnect

Use: https://developers.facebook.com/docs/opengraph/

Page 65: Pratical Deep Dive into the Semantic Web - #smconnect
Page 66: Pratical Deep Dive into the Semantic Web - #smconnect
Page 67: Pratical Deep Dive into the Semantic Web - #smconnect

Use: https://cards-dev.twitter.com/validator

Page 68: Pratical Deep Dive into the Semantic Web - #smconnect

1. Schema.org microdata

2. Open Graph protocol

3. Title + metadescription element

4. Best guess from page content

Use: https://developers.google.com/+/web/snippet/

Page 69: Pratical Deep Dive into the Semantic Web - #smconnect

Use: https://wordpress.org/plugins/wordpress-seo/

Page 70: Pratical Deep Dive into the Semantic Web - #smconnect
Page 71: Pratical Deep Dive into the Semantic Web - #smconnect
Page 72: Pratical Deep Dive into the Semantic Web - #smconnect

Use Amazon EC2, setup a crawler and crawl

the top 1.000.000 Alexa URLs

Checked for occurrences of:

–Microdata / Schema

–OpenGraph

–Twitter Cards

Page 73: Pratical Deep Dive into the Semantic Web - #smconnect

- Crawled with 360/URLS/sec

- 68.4GB of data used

- 68% (683267 URLs) returned 200 OK

- 27% 30X Redirects

- 3% of domains had DNS issues

Page 74: Pratical Deep Dive into the Semantic Web - #smconnect

15,84%

14,55%

1,59%

1,32%

7,27%

2,69%

0,22%

OpenGraph Title

OpenGraph URL

Twitter:title

Twitter:url

Schema itemprop

Schema Itemprop Name

AggregateRating

Based on 683k of top million Alexa urls

Page 75: Pratical Deep Dive into the Semantic Web - #smconnect

Commercial tool: http://www.builtwith.com

Page 76: Pratical Deep Dive into the Semantic Web - #smconnect

Commercial tool: http://www.builtwith.com

Page 77: Pratical Deep Dive into the Semantic Web - #smconnect
Page 78: Pratical Deep Dive into the Semantic Web - #smconnect
Page 79: Pratical Deep Dive into the Semantic Web - #smconnect

se·man·tics [si-man-tiks]

noun

the branch of linguistics that deals with the

study of meaning, changes in meaning,

and the principles that govern the

relationship between sentences or words

and their meanings

Page 80: Pratical Deep Dive into the Semantic Web - #smconnect
Page 81: Pratical Deep Dive into the Semantic Web - #smconnect
Page 82: Pratical Deep Dive into the Semantic Web - #smconnect

―Microdata is a set of tags, introduced with

HTML5, that allows you to do this.‖

Page 83: Pratical Deep Dive into the Semantic Web - #smconnect
Page 84: Pratical Deep Dive into the Semantic Web - #smconnect

• Is separated from the HTML

• Which gives more flexibility and scalabilty

options

• Used in more software, like the washing

machine I showed earlier

• But… Google hasn’t integrated everything

yet

Page 85: Pratical Deep Dive into the Semantic Web - #smconnect
Page 86: Pratical Deep Dive into the Semantic Web - #smconnect
Page 87: Pratical Deep Dive into the Semantic Web - #smconnect
Page 88: Pratical Deep Dive into the Semantic Web - #smconnect

<div itemscope itemtype="http://data-vocabulary.org/Review-aggregate">

<span itemprop="itemreviewed">Several German beers</span>

<img itemprop="photo" src="beer.jpg" />

<span itemprop="rating" itemscope itemtype="http://data-vocabulary.org/Rating">

<span itemprop="average">9</span>

<span itemprop="best">10</span>

</span>

<span itemprop="votes">24</span>

<span itemprop="count">5</span>

</div>

Page 89: Pratical Deep Dive into the Semantic Web - #smconnect

<div itemscope itemtype="http://schema.org/Person">

<span itemprop="name">Jan-Willem</span>

<img src="janwillem.jpg" itemprop="image" />

<span itemprop="jobTitle">International SEO</span>

<div itemprop="address" itemscope itemtype="http://schema.org/PostalAddress">

<span itemprop="addressLocality">Amsterdam</span>,

<span itemprop="addressRegion">- Europe</span>

<span itemprop="postalCode">9999XX</span>

</div>

</div>

Page 90: Pratical Deep Dive into the Semantic Web - #smconnect

1. Products

2. Product offer

3. Product aggregated offer

Page 91: Pratical Deep Dive into the Semantic Web - #smconnect

Create multiple links to relevant pages within

1 entry in the SERPs.

Page 92: Pratical Deep Dive into the Semantic Web - #smconnect
Page 93: Pratical Deep Dive into the Semantic Web - #smconnect
Page 94: Pratical Deep Dive into the Semantic Web - #smconnect

• https://developers.google.com/structured-data/rich-

snippets/

• Schema Creator by Raven http://schema-creator.org/

• Schema.org Generator http://www.microdatagenerator.com/

• Rich Snippets Testing Tool Bookmarklet• http://www.blindfiveyearold.com/rich-snippets-testing-tool-bookmarklet

• Everything you need to know to generate

rich snippets: http://seogadget.com/micro-data-schema-org-

guide-to-generating-rich-snippets/

Page 95: Pratical Deep Dive into the Semantic Web - #smconnect

1. You have specific data points available

2. SE’s accept specific markup language

3. SE’s accept certain snippets

4. Information within the SERPs is correct

• Implement code and check with the SE’s:

https://developers.google.com/structured-data/testing-tool/?hl=it

Page 96: Pratical Deep Dive into the Semantic Web - #smconnect
Page 97: Pratical Deep Dive into the Semantic Web - #smconnect

• Make sure all items are structured and

nested in the correct way.

• Google Testing tool only shows errors

based on missing elements, not on wrong

coding!

Page 98: Pratical Deep Dive into the Semantic Web - #smconnect
Page 99: Pratical Deep Dive into the Semantic Web - #smconnect
Page 100: Pratical Deep Dive into the Semantic Web - #smconnect
Page 101: Pratical Deep Dive into the Semantic Web - #smconnect
Page 102: Pratical Deep Dive into the Semantic Web - #smconnect

https://plus.google.com/communities/103048251221048356778

Page 103: Pratical Deep Dive into the Semantic Web - #smconnect

―Google doesn’t use markup for ranking purposes at this time—but rich snippets can make your web pages appear more prominently in search results, so you may see an increase in traffic.‖

Source: https://support.google.com/webmasters/answer/1211158?hl=en

Page 104: Pratical Deep Dive into the Semantic Web - #smconnect
Page 105: Pratical Deep Dive into the Semantic Web - #smconnect

https://support.google.com/webmasters/contact/rich_snippets_spam

Page 106: Pratical Deep Dive into the Semantic Web - #smconnect
Page 107: Pratical Deep Dive into the Semantic Web - #smconnect
Page 108: Pratical Deep Dive into the Semantic Web - #smconnect
Page 109: Pratical Deep Dive into the Semantic Web - #smconnect
Page 110: Pratical Deep Dive into the Semantic Web - #smconnect
Page 111: Pratical Deep Dive into the Semantic Web - #smconnect

406

368

288

248

228

182

177

148

135

Artificial Intelligence and Machine Learning

Algorithms and Theory

Human-Computer Interaction and Visualization

Natural Language Processing

Machine Perception

Information Retrieval and the Web

Security, Cryptography, and Privacy

Data Mining

Software Systems

Top 10 Research fields per # Publications

Page 112: Pratical Deep Dive into the Semantic Web - #smconnect
Page 113: Pratical Deep Dive into the Semantic Web - #smconnect
Page 114: Pratical Deep Dive into the Semantic Web - #smconnect

What happened during the past 8 years?

2007 2010 2015

Page 115: Pratical Deep Dive into the Semantic Web - #smconnect

From a database to search engine result pages

Page 116: Pratical Deep Dive into the Semantic Web - #smconnect

Now… Let’s be honest

Page 117: Pratical Deep Dive into the Semantic Web - #smconnect
Page 118: Pratical Deep Dive into the Semantic Web - #smconnect
Page 119: Pratical Deep Dive into the Semantic Web - #smconnect

Basic information retrieval

Page 120: Pratical Deep Dive into the Semantic Web - #smconnect

Basic information retrieval

Page 121: Pratical Deep Dive into the Semantic Web - #smconnect
Page 122: Pratical Deep Dive into the Semantic Web - #smconnect

Basic information retrieval

Page 123: Pratical Deep Dive into the Semantic Web - #smconnect

Freebase only has +/- 200 attributes for the class Country

?

Page 124: Pratical Deep Dive into the Semantic Web - #smconnect

http://arxiv.org/pdf/1503.00759.pdf

Page 125: Pratical Deep Dive into the Semantic Web - #smconnect
Page 126: Pratical Deep Dive into the Semantic Web - #smconnect

http://research.google.com/pubs/pub41894.html

Page 127: Pratical Deep Dive into the Semantic Web - #smconnect
Page 128: Pratical Deep Dive into the Semantic Web - #smconnect
Page 129: Pratical Deep Dive into the Semantic Web - #smconnect

Four different methods to extract triples from web content

Natural Language

Processing tools

Entity recognition

Entity linkage

Entity verification

against Freebase

Source: https://www.cs.cmu.edu/~nlao/publication/2014.kdd.pdf

Document Object

Model

Either text or

database driven

―deep web‖ sources

Think of quering

HTML forms

570M tables on the

web

Relations are difficult

to extract

Schema matching

methods

Entity verification

against Freebase

Schema.org

Mostly people

related

Products & Events

are not stored

Mapping

Schema.org to

Freebase for

predicates

Page 130: Pratical Deep Dive into the Semantic Web - #smconnect
Page 131: Pratical Deep Dive into the Semantic Web - #smconnect

Researchers deal with ―duplicate content‖ as being just one source

P1

P2P3

P4

Page 132: Pratical Deep Dive into the Semantic Web - #smconnect

Exploring the power of tables on the Web

https://research.google.com/tables

Page 133: Pratical Deep Dive into the Semantic Web - #smconnect
Page 134: Pratical Deep Dive into the Semantic Web - #smconnect
Page 135: Pratical Deep Dive into the Semantic Web - #smconnect

The papers share some insights about the factors relevant to Google Tables results

Sources of data Google uses according to the paper

Optimise the

surrounding content

with relevant

captions and texts.

Use <th> table

headings to add

labels to specific

columns

Add relevant

attributes to your

table headings

focusing on the

queries used

Only add useful

content to the table.

Boilerplate content

is filtered out.

http://www.cidrdb.org/cidr2015/Papers/CIDR15_Paper3.pdf

Page 136: Pratical Deep Dive into the Semantic Web - #smconnect
Page 137: Pratical Deep Dive into the Semantic Web - #smconnect

―Extraction errors are far more prevalent than

source errors. Ignoring this distinction can

cause us to incorrectly distrust a website‖

Page 138: Pratical Deep Dive into the Semantic Web - #smconnect

Back to the basics for Google (and probably the other search engines too)

Links still tell something about

relationships between pages but also

between entities.

Simply search in the indices you already

have. In the case of Google, they already

have ―everything‖.

Simply gather user feedback from within

the search results.

Page 139: Pratical Deep Dive into the Semantic Web - #smconnect
Page 140: Pratical Deep Dive into the Semantic Web - #smconnect

Source: https://twitter.com/brentnau

Page 141: Pratical Deep Dive into the Semantic Web - #smconnect
Page 142: Pratical Deep Dive into the Semantic Web - #smconnect
Page 143: Pratical Deep Dive into the Semantic Web - #smconnect

Source: https://twitter.com/brentnau

Page 144: Pratical Deep Dive into the Semantic Web - #smconnect
Page 145: Pratical Deep Dive into the Semantic Web - #smconnect
Page 146: Pratical Deep Dive into the Semantic Web - #smconnect
Page 147: Pratical Deep Dive into the Semantic Web - #smconnect

One in 20 searches is health related according to Google.

Page 148: Pratical Deep Dive into the Semantic Web - #smconnect
Page 149: Pratical Deep Dive into the Semantic Web - #smconnect
Page 150: Pratical Deep Dive into the Semantic Web - #smconnect

Use Web based Fact

extraction, like DOM, tables

and annotated data

(Schema.org)

Text based extractors

adding more triples to the

datasets

Systems like described in the Biperpedia

paper. Data is enriched and quality

control takes place. Use partnerships for

trusted resources.

Use existing datasets like

Freebase / Wikidata to verify

extracted data and calculate

probability

Page 151: Pratical Deep Dive into the Semantic Web - #smconnect
Page 152: Pratical Deep Dive into the Semantic Web - #smconnect
Page 153: Pratical Deep Dive into the Semantic Web - #smconnect
Page 154: Pratical Deep Dive into the Semantic Web - #smconnect
Page 155: Pratical Deep Dive into the Semantic Web - #smconnect
Page 156: Pratical Deep Dive into the Semantic Web - #smconnect

Make sure you understand

Page 157: Pratical Deep Dive into the Semantic Web - #smconnect

A few possibilities to influence the content of brand cards

Page 158: Pratical Deep Dive into the Semantic Web - #smconnect

Main source still is Wikipedia, always backup your edits with sources

Page 159: Pratical Deep Dive into the Semantic Web - #smconnect
Page 160: Pratical Deep Dive into the Semantic Web - #smconnect
Page 161: Pratical Deep Dive into the Semantic Web - #smconnect

Your are able to give Google hints about your logo, corporate contacts and social profiles

Page 162: Pratical Deep Dive into the Semantic Web - #smconnect

Add schema.org Organization markup to your official website

Page 163: Pratical Deep Dive into the Semantic Web - #smconnect

Add schema.org Organization markup to your official website

Page 164: Pratical Deep Dive into the Semantic Web - #smconnect

Add schema.org Organization markup to your official website

Find example JSON-LD at

https://developers.google.com/structured-data/customize/overview

Page 165: Pratical Deep Dive into the Semantic Web - #smconnect

What about the localised Google search indices?

?

?

?

?

?

?

Page 166: Pratical Deep Dive into the Semantic Web - #smconnect
Page 167: Pratical Deep Dive into the Semantic Web - #smconnect
Page 168: Pratical Deep Dive into the Semantic Web - #smconnect

Contains the main

subject of the required

answer

Contains the main

subject of the required

answer

Within the content, the

question is answered in

a single sentence

No, Euro NCAP is more

authoritative in the EU

for car safety levels.

NHTSA for the US

Page 169: Pratical Deep Dive into the Semantic Web - #smconnect
Page 170: Pratical Deep Dive into the Semantic Web - #smconnect
Page 171: Pratical Deep Dive into the Semantic Web - #smconnect
Page 172: Pratical Deep Dive into the Semantic Web - #smconnect
Page 173: Pratical Deep Dive into the Semantic Web - #smconnect

Two indices, two truths?

Page 174: Pratical Deep Dive into the Semantic Web - #smconnect

So how can we make use this for our brand?

Page 175: Pratical Deep Dive into the Semantic Web - #smconnect

Since not many are focusing on the getting into the Direct Answers yet, grab the positions first!

Page 176: Pratical Deep Dive into the Semantic Web - #smconnect

95% of the cases had increased traffic - including movements within top 10 normal blue links.

Less than

expected, probably

because of quality of the

answer: results between -

5% and +6% traffic.

Results varied between -3%

and +11% depending on

previous position in the

SERPs

These were performing the

best, increases between 6

and 14%

Depending on the

topic, complicated topics

tend to get more clicks.

Average results between -

2% and 16% increase

Page 177: Pratical Deep Dive into the Semantic Web - #smconnect