What is Duplicate Content?

Subscribe to the Blog

Your email:

AcademyCTA



Post Onboarding Consulting Packages

Attend Live Webinars

Check out the upcoming #InboundLearning webinars and register to attend live. You'll get to ask questions and participate in the conversation.

See Upcoming Webinars

See the latest from HubSpot

Current Articles | RSS Feed RSS Feed

What is Duplicate Content?

  
  
  

As an Inbound Marketing Consultant here at HubSpot the questionwhat is duplcate content of duplicate content comes up at least once during the consulting period for every customer. The most common question is “what is duplicate content?” so let’s start with that. Google’s answer to the question is as follows:

“Duplicate content generally refers to substantive blocks of content within or across domains that either completely match other content or are appreciably similar.”

So, what does that mean exactly? If you have the same content on different URL’s, regardless of whether it’s on the same domain or across multiple domains, it’s duplicate content.  Search engines try to eliminate duplicate content to give you the best search results possible and limit spammers attempts to rank for the same content multiple times.

Still, Google recognizes that in many cases duplicate content is not intentional and sometimes even out of your control. To make sure you don’t mistakenly have duplicate content across the web you can proactively address any potential duplicate content issues. In instances where your own content is being scraped by other websites and republished you shouldn’t worry about being penalized by search engines as they recognize these occurences.

Two specific questions came up during the first webinar in our Blogging series that I wanted to address.

1.) The first came from Angela Stevens of Reading Horizons who asked “is posting an article from a guest blogger on your blog who had already published that same article on their domain duplicate content?”

The answer is yes. As long as the majority of the content, if not all of it, can be found elsewhere it’s always duplicate content – regardless of the circumstances. A better solution that still gives credit to the original content creator and while providing an opportunity to rank for a new keyword is to reference that blog article and include a link back to it through a new post that is relevant to the original article. Take the original blog post and expand upon it express your opinions.

2.) The second question came from Dwight Kellams of Your Financial Watchdog who asked if “republishing  existing articles on our new HubSpot blog page from our old blog duplicate content?”

Once again, if the content can be found in more than one place it is duplicate content. In this case you would want to setup a permanent 301 redirect from the old blog article’s URL to the new URL. This will maintain the authority of any inbound links generated by that article as well as keep the links intact. You also will want to remove the original blog article from the old blogging platform.

Have any more questions regarding duplicate content? Ask us in the comment section below.

Photo credit: Mukumbura

Comments

Thank you for posting this informaiton on duplicate content. I was unable to attend last week's HubSpot Content Camp on Blogging Optimization so I found this blog post very helpful. I will put this to use on our future staffing software blog posts.
Posted @ Tuesday, January 03, 2012 7:17 AM by Angie Ryan
@Dick 
 
You won't get knocked for duplicate content for just having similar (manufacturers I'm assuming) descriptions. However, by having those similar descriptions you are limiting the number of keyword phrases you can get found by. Each page of your website should target a unique keyword, so I recommend creating a unique page title, url, meta description etc. for every product page of your website. 
 
You may be interested in checking out our eCommerce and inbound marketing webinar as well: <a>http://www.hubspot.com/ecommerce-and-inbound-marketing/<a>
Posted @ Tuesday, January 03, 2012 7:17 AM by Billy MacDonald
I have a web store where each product has a page and many products are very similar. Each product (web page) has detailed descriptions and most of the descriptive sentences for each product are identical. Duplicate content? Affecting my rankings?
Posted @ Tuesday, January 03, 2012 7:17 AM by Dick Zimmerer
Thank you both!
Posted @ Tuesday, January 03, 2012 7:17 AM by Julie Graff
@Stephen: You could definitely do this. Great idea!
Posted @ Tuesday, January 03, 2012 7:17 AM by Mark Kilens
Hi Julie, 
 
Re: PDF files. I am wondering if you would be best to disallow the folder with your pdf's in your robots.txt file. 
Posted @ Tuesday, January 03, 2012 7:17 AM by Stephen Crewson
@Julie: In the case you described you should be fine and run no risk of duplicate content. Paul did mention last week that the search engines are indexing PDFs now, so if you had a PDF and a website page with the same content then that could be a problem. We'll keep you posted when we find or hear more info on the subject. Thanks!
Posted @ Tuesday, January 03, 2012 7:17 AM by Mark Kilens
We have a section on our Web site where people can view pdf versions of a newsletter we send out via mail. Sometimes we take content from those newsletters and break them into smaller blog articles. We rarely copy the newsletter articles word for word, but do often take chunks of text, particularly quotes. Could this cause a problem with duplicate content? Does it matter if some of the content is in a pdf rather than a regular Web page?
Posted @ Tuesday, January 03, 2012 7:17 AM by Julie Graff
Comments have been closed for this article.