| What is duplicate content? |
| Written by Peter Dowse | |
| Monday, 12 May 2008 | |
|
To be a valuable search engine you need to have a fresh index with lots of new information in it… all the time. To get this information a search engine needs to crawl the web and then filter this information into usable chunks of data that relate to search queries. One of the things search engines do to ensure the quality of their data is remove duplicate content from their index as it fills up their servers with more copies of a page than is needed and creates a bad user experience for searchers. Could you imagine getting the same article from different websites on all ten results from a search engine page? Searching for information is about cross-referencing different bits and pieces to make an overall informed judgement about something, someone or somewhere. Search engines know this, that’s why they spend so much time making sure the content that is served up for a search query is relevant, non-spammy and will create a great user experience.
So we have established that search engines don’t like duplicate
content, but what exactly is duplicate content? Like the name implies,
it’s content on your website that is identical to other content. As
mentioned before this is bad for search engines so making sure you have
unique content on your website is a must if you want any chance of
ranking well. There are many reasons for duplicate content on your website, here are just a few:-
Print pages FIX: You could put all your print friendly pages into a directory on your server and then disallow the search engine crawlers to this directory using a robots.txt file.
Canonicalization issues These could all point to your homepage (please note this is an extreme example but still possible all the same). If a search engine crawls and indexes all these versions of URLs there could be multiple versions of your homepage… or worse your entire website in the index. This would be very bad news indeed. One thing to keep in mind is that if your competitors are smart and notice that you haven’t re-directed your URLs correctly, they could point links from other websites or directories to your different URLs causing a search engine to crawl these (basically creating a forced crawl of all your different URLs) that could lead to a drop in rankings. FIX: First you will need to find out if there are any additional versions of your website or homepage in the index. You can do this by using the site: operator (put site: before all your URLs in a search engine’s search box to check if they’re in the index eg. site:http://www.yourwebsite.com) If you have multiple versions of your site in a search engine’s index you will need to ‘301 re-direct’ the unwanted URLs to your main URL as a fix. (If you want further info on how to do a ‘301 re-direct’, leave a comment and I’ll get back to you.)
Manufacturers product descriptions FIX: Really the only way to get around this is to modify your content so that it’s unique. Try writing your own product descriptions so your content is unique and original.
Product pages FIX: You could re-write all your shopping cart pages however, if you have a few thousand products this could be a very large job indeed. Another option is to analyse your website and find out which product generates the most revenue for you and filter out the others using a robots.txt file. (This isn’t the best solution however, you may find the lift in rankings due to less duplicate content penalties will increase your revenue.)
Stolen content
Multiple domains As you can see there’s a few ways your site can produce duplicate content. If you are aware of these issues and take appropriate measures to ensure your site doesn’t suffer from these, you shouldn’t have too many problems. |
| < Prev | Next > |
|---|
| SEO for beginners |
| SEO tutorial videos |
| Design and build |
| Search engines |
| On-page SEO |
| Link building |
| Keyword research |
| Content |
| Operator commands |
| Analytics |
| Online tools |
|
"SEOhub is terrific! I like your short, clear SEO training articles - they're on a great level for me... makes me stretch and see some areas I need to look into more!" |