<?xml version="1.0" encoding="UTF-8"?>
<blog-post>
  <author-id type="integer">42244</author-id>
  <blog-comments-count type="integer">3</blog-comments-count>
  <blog-post-status-id type="integer">3</blog-post-status-id>
  <body-format>econsultancy_xml</body-format>
  <body-formatted>
  &lt;p&gt;According to Sven Naumann on Google's &lt;a href="http://googlewebmastercentral.blogspot.com/2008/06/duplicate-content-due-to-scrapers.html"&gt;&lt;em&gt;Webmaster Central Blog&lt;/em&gt;&lt;/a&gt;&lt;em&gt;, &lt;/em&gt;there are&#160;two main types of duplicate content: &lt;/p&gt;
  &lt;p&gt;
    &lt;strong&gt;Duplicate content within one website&lt;/strong&gt;
  &lt;/p&gt;
  &lt;p&gt;This is often unintentional and can be the result of sites having pages for similar products where the content has been only slightly changed, or because landing pages have been created for PPC campaigns. &lt;/p&gt;
  &lt;p&gt;In this case, Google recommends that webmasters include the preferred version of the URL on their sitemap file, which will help the search engine's crawlers&#160;find the best version. &lt;/p&gt;
  &lt;p&gt;
    &lt;strong&gt;Duplicate content across domains&lt;/strong&gt;
  &lt;/p&gt;
  &lt;p&gt;This refers to content identical to that on your website appearing on third party domains, often when sites use scrapers to copy your text and use it to push themselves up the rankings. &lt;/p&gt;
  &lt;p&gt;Naumann&#160;claims that Google manages to determine the original source of the content &lt;em&gt;"in most cases"&lt;/em&gt;, and that having your content copied shouldn&#8217;t impact on your search rankings. &lt;/p&gt;
  &lt;p&gt;He offers the following tips if sites with scraped content are ranking higher than the original website:&lt;/p&gt;
  &lt;ul&gt;
    &lt;li&gt;Make sure your site&#8217;s content is being crawled by Google.&lt;br /&gt;&lt;br /&gt;&lt;/li&gt;
    &lt;li&gt;Check the Sitemap file to see if you made changes for the particular content which has been scraped.&lt;br /&gt;&lt;br /&gt;&lt;/li&gt;
    &lt;li&gt;Make sure your site is in line with Google&#8217;s webmaster guidelines. &lt;/li&gt;
  &lt;/ul&gt;
  &lt;p&gt;
    &lt;em&gt;
      &lt;strong&gt;Also see our &lt;/strong&gt;
    &lt;/em&gt;
    &lt;a href="http://econsultancy.com/reports/search-engine-optimization-seo-best-practice-guide-2007"&gt;
      &lt;strong&gt;
        &lt;em&gt;Search Engine Optimisation (SEO) - Best Practice Guide&lt;/em&gt;
      &lt;/strong&gt;
    &lt;/a&gt;
    &lt;em&gt;
      &lt;strong&gt;&#160;for more info on&#160;the issue.&lt;/strong&gt;
    &lt;/em&gt;
  &lt;/p&gt;
  &lt;p&gt;
    &lt;strong&gt;Related stories:&lt;br /&gt;&lt;/strong&gt;
    &lt;a href="/blog/634-google-in-duplicate-content-shocker"&gt;Google in duplicate content shocker&lt;/a&gt;
    &lt;br /&gt;
    &lt;a href="/blog/2329-5-ways-to-beat-the-seo-competition-in-google"&gt;5 ways to beat the SEO competition in Google&lt;/a&gt;
  &lt;/p&gt;
</body-formatted>
  <body-unformatted>&lt;FormattedContent xmlns="http://www.e-consultancy.com/schema/formattedContent/"&gt;
  &lt;Paragraph&gt;According to Sven Naumann on Google's &lt;Link URL="http://googlewebmastercentral.blogspot.com/2008/06/duplicate-content-due-to-scrapers.html" Window="Self"&gt;&lt;Quote&gt;Webmaster Central Blog&lt;/Quote&gt;&lt;/Link&gt;&lt;Quote&gt;, &lt;/Quote&gt;there are&#160;two main types of duplicate content: &lt;/Paragraph&gt;
  &lt;Paragraph&gt;
    &lt;Emphasis&gt;Duplicate content within one website&lt;/Emphasis&gt;
  &lt;/Paragraph&gt;
  &lt;Paragraph&gt;This is often unintentional and can be the result of sites having pages for similar products where the content has been only slightly changed, or because landing pages have been created for PPC campaigns. &lt;/Paragraph&gt;
  &lt;Paragraph&gt;In this case, Google recommends that webmasters include the preferred version of the URL on their sitemap file, which will help the search engine's crawlers&#160;find the best version. &lt;/Paragraph&gt;
  &lt;Paragraph&gt;
    &lt;Emphasis&gt;Duplicate content across domains&lt;/Emphasis&gt;
  &lt;/Paragraph&gt;
  &lt;Paragraph&gt;This refers to content identical to that on your website appearing on third party domains, often when sites use scrapers to copy your text and use it to push themselves up the rankings. &lt;/Paragraph&gt;
  &lt;Paragraph&gt;Naumann&#160;claims that Google manages to determine the original source of the content &lt;Quote&gt;"in most cases"&lt;/Quote&gt;, and that having your content copied shouldn&#8217;t impact on your search rankings. &lt;/Paragraph&gt;
  &lt;Paragraph&gt;He offers the following tips if sites with scraped content are ranking higher than the original website:&lt;/Paragraph&gt;
  &lt;List Type="Disc"&gt;
    &lt;ListItem&gt;Make sure your site&#8217;s content is being crawled by Google.&lt;LineBreak /&gt;&lt;LineBreak /&gt;&lt;/ListItem&gt;
    &lt;ListItem&gt;Check the Sitemap file to see if you made changes for the particular content which has been scraped.&lt;LineBreak /&gt;&lt;LineBreak /&gt;&lt;/ListItem&gt;
    &lt;ListItem&gt;Make sure your site is in line with Google&#8217;s webmaster guidelines. &lt;/ListItem&gt;
  &lt;/List&gt;
  &lt;Paragraph&gt;
    &lt;Quote&gt;
      &lt;Emphasis&gt;Also see our &lt;/Emphasis&gt;
    &lt;/Quote&gt;
    &lt;Link URL="http://econsultancy.com/reports/search-engine-optimization-seo-best-practice-guide-2007" Window="New"&gt;
      &lt;Emphasis&gt;
        &lt;Quote&gt;Search Engine Optimisation (SEO) - Best Practice Guide&lt;/Quote&gt;
      &lt;/Emphasis&gt;
    &lt;/Link&gt;
    &lt;Quote&gt;
      &lt;Emphasis&gt;&#160;for more info on&#160;the issue.&lt;/Emphasis&gt;
    &lt;/Quote&gt;
  &lt;/Paragraph&gt;
  &lt;Paragraph&gt;
    &lt;Emphasis&gt;Related stories:&lt;LineBreak /&gt;&lt;/Emphasis&gt;
    &lt;Link URL="/blog/634-google-in-duplicate-content-shocker" Window="Self"&gt;Google in duplicate content shocker&lt;/Link&gt;
    &lt;LineBreak /&gt;
    &lt;Link URL="/blog/2329-5-ways-to-beat-the-seo-competition-in-google" Window="Self"&gt;5 ways to beat the SEO competition in Google&lt;/Link&gt;
  &lt;/Paragraph&gt;
&lt;/FormattedContent&gt;</body-unformatted>
  <created-at type="datetime">2008-06-10T10:15:00+01:00</created-at>
  <enabled-blog-comments-count type="integer">1</enabled-blog-comments-count>
  <expertise-level-id type="integer">1</expertise-level-id>
  <extract-format>econsultancy_xml</extract-format>
  <extract-formatted>
  &lt;p&gt;
    &lt;strong&gt;Duplicate content is an important issue and&#160;something&#160;that can have an adverse effect on a website&#8217;s search engine rankings.&lt;/strong&gt;
  &lt;/p&gt;
  &lt;p&gt;So lots of site owners will be pleased to hear that Google has provided some tips on how to address it.&lt;/p&gt;
</extract-formatted>
  <extract-unformatted>&lt;FormattedContent xmlns="http://www.e-consultancy.com/schema/formattedContent/"&gt;
  &lt;Paragraph&gt;
    &lt;Emphasis&gt;Duplicate content is an important issue and&#160;something&#160;that can have an adverse effect on a website&#8217;s search engine rankings.&lt;/Emphasis&gt;
  &lt;/Paragraph&gt;
  &lt;Paragraph&gt;So lots of site owners will be pleased to hear that Google has provided some tips on how to address it.&lt;/Paragraph&gt;
&lt;/FormattedContent&gt;</extract-unformatted>
  <featured type="boolean">false</featured>
  <id type="integer">2511</id>
  <learn-more-formatted nil="true"></learn-more-formatted>
  <learn-more-unformatted nil="true"></learn-more-unformatted>
  <legacy-article-id type="integer">365775</legacy-article-id>
  <name>Google provides tips on duplicate content</name>
  <private type="boolean">false</private>
  <published-at type="datetime">2008-06-13T10:34:00+01:00</published-at>
  <slug>google-provides-tips-on-duplicate-content</slug>
  <tweetbacks-updated-at type="datetime">2009-04-28T23:10:45+01:00</tweetbacks-updated-at>
  <unpublished-at type="datetime" nil="true"></unpublished-at>
  <updated-at type="datetime">2009-04-28T23:10:45+01:00</updated-at>
  <views-count type="integer">1322</views-count>
</blog-post>
