how to avoid shooting yourself in the seo
DESCRIPTION
Fifteen Tips to Improve SEO on Sitecore Sites - June 23, 2013TRANSCRIPT
How to Avoid Shooting Yourself in the SEO
Fifteen Tips to Improve SEO on Sitecore SitesJune 23, 2013 1
Sitecore Enterprise Web CMS
• Popular .NET Web Content Management System for mid-to-large organizations
• Provides friendly tools for non-technical Content Authors and Marketers
• Can be customized and extended to meet almost any conceivable business need
2
Why SEO is Important
Search Engine Optimization (SEO)• Getting listed on Search Engine Results Pages (SERPs)• Getting more traffic from listings on search engines• Getting your content recognized as the authority
• What search engines consider most relevant to users• Search engines rank authority pages higher
Why Search Engine Optimization is Done• Search engines need to determine relevance
• What the page is about• How the page would be useful
• SEO helps the engines understand content context 3
Things that Hurt SEO
• Signs of a neglected or abandoned site• Having too many broken links on a page• Having too many HTML errors• Missing Alt Text for images
• Substantially similar content on the same site• Having a country code top level domain• Helps site rank for that particular country• Limits the site’s ability to rank globally
4
Static URLs are Better for SEO
URLs can be static or dynamic• Dynamic URLs contain a question mark• Static - http://example.com/en/content• Dynamic - http://example.com/content?sclang=en
Use static URLs over dynamic URLs for SEO• Static URLs typically rank better in search engines• Search engines are slower to index dynamic pages• Static URLs also appear more friendly to users
5
Using Sitecore Wildcard Nodes
Wildcard Node• A node in the Content Tree that matches all requests
for the given level if the item is not found on that level• URL parameters can be passed without query strings• Used to create friendly URLs for external content • Content is not in the Content Tree • Query strings do not have to be used to access it
How to create Wildcard Nodes• Place a * node below a node • Node accepts all URLs for items not found at that level• Parse the URL for the data
6
Handling URL Parameters for SEO
Google provides a Parameter Handling Tool• Used to tell Google how to handle URL parameters• Used to reduce impact of duplicate content on SEO• Useful when query strings cannot be avoided
• Available on the Webmaster Tools Dashboard
7
Why Duplicate Content is Bad
Duplicate content• Identical or nearly identical content on…• Multiple pages of a site• Pages on different sites
• URL variations pointing to the same page
Impact of duplicate content • Search engines try to index pages with distinct info• Links to the page contain multiple URLs• Popularity of the page can get divided across URLs• Page ranking of all the pages can be reduced• This can hinder pages' performance in SERPs
8
How Google Handles Duplicates
When Google detects duplicate content• Google groups the duplicate URLs into a cluster• Google selects one URL to represent the cluster• Google consolidates the link popularity to that URL
When Google doesn’t detect duplicate content• Google cannot consolidate the link popularity• This can dilute the ranking by splitting it across URLs
9
How Duplicate Content Occurs
Sitecore Content Tree items• Content items are copied with little or no editing• Cloned items are created and used• Alias items are created and used• Campaign items are created and used for tracking• Multiple layouts are used to display the same content• Web, print, and mobile layouts, etc.
URLs• Multiple domain names are bound to the same site• Links to a page contain formatting variations
10
URL Variations
Variations that can occur in links to a page• Lowercase vs. mixed case• Www vs. no www• Trailing slash vs. no trailing slash• Language embedding vs. no embedding (localization)• URL parameters vs. no parameters (campaign tracking)
URL variations are treated as duplicate content
11
What is a Canonical Page?
Canonical pages• Identify a page as the primary page over others • Useful when a site contains duplicate content• Multiple pages with (nearly) identical content• Multiple layouts used for the same content• Multiple URLs used to load the same page
12
What is a Canonical Tag?
Canonical tags• Used when pages contain duplicate content• Specifies the canonical page• Format: <link rel="canonical" href="http://primary-url>• Add to primary as well as secondary pages
• Prevents search engines from picking a primary page• Supports duplicate content within a single domain• Supports pages across different sub-domain URLs• Does not support pages in different domain URLs 13
DNS Mistakes that Hurt SEO
DNS domain name mistakes• Don’t bind multiple generic domains to the primary site• Don’t bind your primary site using a wildcard for all names• DNS Admins can get requests to add new names for a site• Each name that connects to the site can dilute the SEO for all
Content-related mistakes• Avoid creating multiple pages with duplicate content• Avoid varying the format of URLs on a site• Don’t let the search engines decide which page is primary
14
Other DNS Mistakes for SEO
Development sites for developers• Can contain an out-of-date copy of site content• Should not be accessible over the Internet • Should not have a binding to a public DNS name
Content management sites for content authors• Can contain unapproved and unpublished content• Can have a separate IP address• Can be mapped to the master database for website• Should not have a binding to a public DNS name 15
How to Set Up DNS for SEO
Using multiple DNS domain names for a live site• Bind a single generic domain name to the live site• Bind top-level country domain names to the live site• Create a second empty IIS site• Bind all other generic domain names to second site• Use the HTTP Redirect in IIS to set a 301 redirect• Select the second empty site in IIS• Open the HTTP Redirect tool in the IIS section• Add the primary URL in the Redirect text box• Set the status code to 301
16
Setting Up DNS for Localization
Try to use top level domains for localization• Get country specific domain names for each language version• Use www.example.fr for French, not fr.example.com• Use www.example.de for German, not www.example.com/de
Sites in top-level domains have distinct SERPS• Search Engines use an initial link for the root of the site to
crawl the site in a particular language• No session is needed to remember the language• Each page has a unique URL in the different languages because
the domain name is unique17
Geotargeting for SEO
For country-coded top-level domains • Sites are already associated with a geographic region
For generic top-level domains (.com, .org, etc.)• Use Webmaster Tools to set geographic targets• You can target users in a particular geographic location• Targets help determine how site appears on SERPs
18
What to Ask Your DNS Admin
Identify DNS names that point to your live site• Find the ones with no binding to your site• Add a binding for it to a secondary site that redirects• Alternatively, request the DNS name be removed
Identify DNS names that don’t point to your live site• Find the ones that display duplicate content• Create a 301 redirect to the real site or• Use a robots.txt file to prevent indexing of the site• Old links to the site will continue to work• Search engines will list only the real site
19
Staging Mistakes that Reduce SEO
Duplicate content on a staging site• A separate staging site can exist for Sitecore sites• The staging site points to a copy of the web database• A DNS Address record can exist for the staging site• This DNS Address record may not be restricted• The binding for the staging site uses the DNS record
• Search bots view the two sites as duplicate content• There is no robots.txt file at the root of the staging site• SEO can be reduced for the primary Live site 20
Setting up Staging Sites for SEO
There are good reasons to have a staging site • Authors can test content changes before going live• Admins can troubleshoot publishing issues
Set up your staging site without hurting SEO• Do one or more of the following…• Add a Robots.txt file to the root of the staging site • Restrict access with NTFS and Windows Authentication• Restrict access with IP Address restrictions• Allow DNS name resolution only on corporate network 21
How SEO Mistakes Can Occur
A company decides to revamp their web site• Old site continues to run • New site is created over time with no robots.txt file• There are public DNS records for both sites
A company wants to test a version upgrade• Old site is copied as-is with configuration settings• Upgrade is applied to copy of site and tested over time
A company wants to keep their old site around• Robots.txt file is not added to old site• Default redirects in IIS (302s) are used instead of 301s
<Your scenario here>
22
Using Robots.txt to Help SEO
Robots.txt• Plain text file• Add to the root folder of the site• Reduces indexing of duplicate content by search bots• Can prevent indexing of specific folders or paths• Can prevent indexing of entire site
How to prevent indexing of a siteUser-agent: *Disallow: / 23
Using a Sitemap to Improve SEO
What is a Sitemap?• Lists a hierarchy of the site pages you want indexed• Helps search engines discover canonical pages
Guidelines for Sitemaps• Add a Sitemaps file to the root of your site• Periodically submit a Sitemap to all search engines• Use the Sitemap protocol so all search engines can use it• Specify your preferred URL for pages in your Sitemap• Identify the location of your Sitemap file in robots.txt
24
Redirect Mistakes that Reduce SEO
Types of redirects• 301 indicates “Moved Permanently”• 302 indicates “Moved Temporarily or Found”• 307 indicates “Temporary Redirect”
The three biggest redirect mistakes for SEO• 302 Redirects are used• 307 Redirects are used• 301 Redirects are not used
Our next slide explains why…25
Link Juice
Marketing term for "online ranking power“• The quantity and quality of inbound links to a page• One of the primary factors that determines rankings
How Link Juice is affected by redirects• 302 redirects forward almost no link juice • 307 redirects forward almost no link juice• 301 redirects forward 90-99% of link juice
26
When to Create a Redirect
Redirects are useful when…• A page of content is moved on your site• You want an old link to work on your new site• You want the link juice from an old link to be
transferred to the new location of the content• You want the link juice from multiple domain names
transferred to a single site name• You don’t mind if visitors type abc.com and xyz.com is
displayed in their address bar 27
The URL Rewrite Tool
Microsoft add-on for IIS 7• Permits Web admins to replace and redirect URLs• Rules are configured in IIS and saved to web.config• URLs can be more search engine friendly• Supports many types of redirects, including 301s• Supports regular expressions in addition to wildcards• Hint: Use regular expressions (regex) over wildcards
28
Benefits of the URL Rewrite Tool
The URL Rewrite tool is very powerful• Should be installed on “Live” Sitecore server• Enables admins & webmasters to support redirects• Helps avoid binding non-primary generic domains
to secondary site and use 301 redirects (Slide #16) • Create a “Use Canonical Hostname” rule in URL
Rewrite tool instead (see next slide for details)
29
The ‘Use Canonical Name’ Rule
A rule created in the URL Rewrite tool• Install the URL Rewrite Tool add-on for IIS 7• Bind all domain names and wildcards to the main site• Create a rule named “Use Canonical Name”• Add all secondary generic domain names as conditions• Set the Logical Grouping to “Match Any”• Create a permanent redirect to your preferred URL• See examples on the next three slides
30
‘Use Canonical Name’ ExampleExample 1 – Top of URL Rewrite Window
31
‘Use Canonical Name’ ExampleExample 2 – Bottom of URL Rewrite Window
32
‘Use Canonical Name’ ExampleExample 3 – Resulting Rule in Web.config File
<rule name="Use Canonical Name" stopProcessing="true"> <match url="(.*)" /> <conditions logicalGrouping="MatchAny“ trackAllCaptures="false"> <add input="{HTTP_HOST}" pattern="(.*)sitecorenet\.com" /> <add input="{HTTP_HOST}" pattern="(.*)sitecorepedia\.com" /> </conditions> <action type="Redirect" url="http://www.theinsidecorner.com/{tolower:{R:1}}" /></rule>
33
Improving SEO with Internal Links
SEO guidelines for internal links on Sitecore sites• Use consistent URL formats for all internal links• Use the toolbar to configure links in Rich Text• Avoid adding markup for URLs manually • Use General Links over Rich Text to create links• Use Clones when displaying duplicate content• Review Broken Links Report and fix broken links
34
Improving SEO with Link Manager
Sitecore’s Link Manager settings• Located in the web.config file or a custom config file
Guidelines for Link Manager settings• Set Lowercase URLs to True• Set Add Aspx Extension to False• Set Always Include Server URL to True• Set Language Embedding to Always for multi-lingual sites• Set Language Location to File Path for multi-lingual sites• Set Shorten URLs to True for Content Trees with multiple sites
Additional guidelines for SEO• Configure IIS to pass all requests to ASP.NET• Add a trailing slash to the URL when appropriate
35
Use File Paths to Embed Languages
Link Manager supports two types of embedding• Query Strings• Parameter sets the language in a session variable• Search engines cannot read session variables
• File Paths• A session-less solution• Allows multiple languages in the same domain
Use file paths over query strings36
Sitecore Shared Source for SEO
SEO-Friendly URL module• Enables SEO-friendly URLs for Sitecore items • Implements a custom link provider & item resolver
Jungle Torch module• A SEO reporting and inbound marketing tool• Useful for social network analysis
Both are available from http://marketplace.sitecore.net 37
MOZ Tools for SEO
MozBar• A free SEO-Toolbar for Firefox or Chrome• Displays relevant SEO metrics as you surf the Web• Available from http://moz.com/tools/seo-toolbar
Open Site Explorer• A free tool that uses the Mozscape web index to
analyze links• Supports analyzing up to 10k links to any site or page• Available from http://www.opensiteexplorer.org/ 38
Tools To Locate Duplicate Content
Duplicate Content Tool• Performs a series of SEO checks on submitted URLs• http://www.virante.org/seo-tools/duplicate-content
Xenu• Scans site links and returns a table of all available URLs• http://home.snafu.de/tilman/xenulink.html
Google Webmaster Tools• Reports site’s duplicate titles and meta descriptions• https://www.google.com/webmasters/tools/home
39
Duplicate Content Tool Checks
• WWW/NonWWW Header checks if site returns a 301 redirect from www to non-www or vice versa• Google Cache checks if site has issues with www vs. non
www in Google’s Cache• Similarity checks if site has pages omitted from Google for
being too similar in the top 1000 results• Default Page checks if site appears to have redirected or
standardized on either the / or /index.{ext} • 404 checks if site correctly returns 404 errors for pages
that do not exist• PageRank Dispersion compares page rankings for the www
and non-www versions of the site to verify they are same
40
In Summary1. Use consistent URL formats and minimize similar content2. Minimize the use of cloned items and alias items in Sitecore3. Use static URLs over dynamic URLs and avoid query strings4. Clean up broken links and HTML validation errors5. Add canonical tags to pages with duplicate content6. Specify canonical URLs in a Sitemap and submit it to search engines7. Use Google Webmaster tools to specify your preferred domain8. Analyze your site bindings and compare them with your DNS records9. Bind only your primary DNS domain(s) to your live site10. Bind generic DNS domains to a second site that redirects to live site11. Use 301 redirects instead of 302 or 307 redirects12. Use top-level domains for localization, not subdomains or folders13. Block public DNS resolution for all development and staging sites14. Add a robots.txt file to all development and staging sites15. Configure your Sitecore Link Manager settings for SEO
41
Caveat Emptor
There are many ways to manage SEO for Sitecore sites• Skilled developers can provide other solutions• Review your SEO needs with your Sitecore partner• Review proposed changes with all stakeholders• Implement new configurations on a test system firstThis presentation is intended to be academic• To enhance public knowledge of SEO issues• To promote a better understanding of SEOAuthor cannot guarantee accuracy or fitness for your site• Author makes no representations or warranties• Author disclaims any liability if you act on this information
42
More Topics from TICAll Sitecore-related SlideShares from TIChttp://www.slideshare.net/cwnelson/presentations
How to Assure Success for a New Sitecore Site (full slide deck)http://www.slideshare.net/cwnelson/creating-new-sitecore-sites-full
Implementing Routine Sitecore Maintenancehttp://www.slideshare.net/cwnelson/sitecore-maintenance-tips
Performance Tips for Sitecore Mobile Usershttp://www.slideshare.net/cwnelson/sitecore-mobile-performance-tips
Sitecore 7 – New Features for Usershttp://www.slideshare.net/cwnelson/sitecore-7-new-features
Sitecore Performance Tips for Visitor Engagementhttp://www.slideshare.net/cwnelson/sitecore-performance-tips
Troubleshooting Sitecore Performancehttp://www.slideshare.net/cwnelson/troubleshooting-sitecore-performance
43
The Inside Corner, Inc.
• The Inside Corner, Inc. (TIC) is owned by Craig Nelson• TIC is a Sitecore partner & web consultancy• A U.S. point of contact for teams of Sitecore specialists • We create new Sitecore sites & enhance existing ones
44
Primary Developer - Craig Nelson
• Sitecore Specialist certified for Sitecore 7• 10 years of system administration experience• 10 years as a Microsoft Certified Trainer• 5 years of ASP.NET web development experience• 4 years of Sitecore development experience• 3 years as a Sitecore technical instructor• Sitecore architecture & configuration expert• Performance & SEO optimization specialist• Workflow configuration specialist 45
Get Help with your Sitecore Site
Follow TIC on LinkedIn• http://www.linkedin.com/company/the-inside-corner-inc.
Add TIC to your Circles on Google Plus• http://google.com/+Theinsidecorner
Follow TIC on Twitter • https://twitter.com/TheInsideCorner
Contact Craig Directly• Craig Nelson - Certified 7.0 Developer and Sitecore Partner• cwnelson[at]ticdevs.com• Mobile - 707.319.7446
Visit the TIC website• http://ticdevs.com
46
Bibliography
A Look at Google's 200 Search Ranking Factors About /robots.txtDemystifying the "duplicate content penalty"Link Juice ExplainedMOZ - CanonicalizationMultiple Languages in SitecoreSDN - Languages and Search EnginesSitecore - Avoiding query string in dynamic URLWebmaster Tools - Duplicate ContentWebmaster Tools - GeotargetingWebmaster Tools - URL parameters
Download slide presentation to view URLs
47
Thank you for taking the time to view this presentation on
How to Avoid Shooting Yourself in the SEO
Please share with others and add a comment if you think it is useful
Thanks!
June 23, 2013
48