How Do I Identify Bad Backlinks? – A Beginners Guide

Date: 16th July 2013

Author: Liam McGurk

Having low-quality, spam backlinks pointing to your website is not only bad from a ranking point of view but it can also be extremely detrimental to the credibility of your brand.  No business should want to have external references from untrustworthy, deceitful sources and this same mentality should transpire into your online marketing efforts.

Before the introduction of Panda and more importantly Penguin, many so called “SEO Experts” were able to easily manipulate and trick Google’s algorithms by rolling out automated link building tactics where links to client websites were placed on 1000’s of directories, article websites and social bookmarking networks at the click of a button. This quantity over quality approach on some occasions achieved some form of ranking success and unfortunately deceived the client into thinking an ethical job had been done.

However, the longevity of this strategy soon caught up with most and on 25th May 2012 Google began to heavily penalise websites that had participated in these underhand strategies with the roll out of their Penguin update. This update was specifically aimed at decreasing search engine visibility of websites that had participated in what Google deemed to be black hat techniques including; cloaking, keyword stuffing and link networks to name but a few.

Back in October 2012, Google introduced it’s widely anticipated Disavow tool allowing webmasters and site owners to inform Google of the low quality links in their backlink profiles and request that these be ignored when determining the position of their website in the search index. However, Google were very quick to stress that all efforts must be carried out to first attempt to remove low quality links before using the Disavow tool.

There are a number of tools out there to help you automate the rather mundane process of identifying bad backlinks, however, I prefer to adopt a much more granular approach when evaluating a website’s backlink profile as automated tools can adopt false negative and false positive results.

This guide illustrates the steps needed to fully identify the harmful links in your backlink profile.

Step One: Has Your Website Been Hit With a Penalty

Before diving straight in and identifying the low quality backlinks it’s first good to establish whether or not your website may have been hit with a Google penalty. There are two types of Google link penalties, the first being a manual penalty and the second being an algorithmic (Penguin) penalty.

  • The Manual Penalty
    • If your website has been hit with a manual penalty you will be notified in Google Webmaster Tools with a detection of unnatural links message. It is unknown as to what causes Google to begin reviewing your website on a manual basis however there is speculation that someone submitting a spam report against your website or targeting overly competitive niches such as loans or insurance will trigger Google to monitor your website more closely.
  • The Algorithmic (Penguin) Penalty
    • Google won’t notify you if you have been hit with an algorithmic penalty, however you may begin to see a decrease in organic search traffic around the time of algorithm refreshes. The best way to determine loss of traffic around algorithm changes is through syncing up your website analytics with the Panguin tool. The Panguin tool benchmarks the dates of major Google algorithm changes against your organic search traffic.

 

Figure 1.0 – Example Panguin tool data over 7 month period

Step Two: Gathering Your Backlink Data

Understanding the severity of your backlink profile is key to the clean up. There are many tools out there that help you discover what links point to your website and to get a full understanding it’s imperative that you use a range of these tools to ensure no stone is left unturned. Tools that I find useful are:

In order to get all of the data out; Open Site Explorer and MajesticSEO you will need to sign up to their services, however, Open Site Explorer offer a free 30 day premium trial and MajesticSEO allow you to export full link data for your website with the upload of a unique .txt file to the root of your website directory.

Once you have access to the relevant tools the next step is to export all link data out of each into individual Microsoft Excel spreadsheets, similar to figure 2.0 below.

Figure 2.0 – Link Data in Excel

Step Three: Collating Your Backlink Data

Once you have all your backlink data from the above 4 resources split out into individual sheets the next step is to collate all of this data into one spreadsheet so we have a complete overview of all the backlinks.

We want to keep consistency throughout our list of backlinks and as Google Webmaster Tools only allows you to export linking URLs we will focus purely on collating a list of linking URLs.

Create a new sheet within your current Excel Spreadsheet and name it “Collated”. We now want to copy and paste over all of our linking URLs from each individual sheet into our newly formed “Collated” sheet.

Of course there are going to be duplicate URLs in our collated spreadsheet as high authority links are likely to be flagged in all backlink discovery tools. In order to remove duplicate entries we need to highlight all entries in our collated list sheet.

 

Figure 3.0 – Selecting Our List of Linking URLs

Once we have our linking URLs column highlighted we need to remove the duplicate entries by navigating to “Data” > “Remove Duplicates”.

 

Figure 3.1 – Remove Duplicate Entries

Make sure the correct column is selected, in this case “Column A” and click “OK”. You should now be presented with a message notifying you of how many duplicate entries have been deleted.

 

Figure 3.2 – Successful Removal of Duplicate Entries

We now have a complete list of URLs and can begin the process of identifying the ones that are both live and harmful to our website.

Step Four: Identifying the Dead Links

Although most backlink research tools aim to provide the latest data some can be caught short with the data they provide, webmasters may have recently taken down websites or removed pages where our links may be present. In order to get the most up to date data we need to do a real time check of each backlink to check whether our link is still live.

SEO Tools for Excel is a great Excel plugin that allows you to check live links on the fly with the use of a clever formula. Follow the below video tutorial to installing the plugin.

Once the plugin is installed we now need to enter the formula to check our list of URLs for any links pointing back to our website.

Insert into column “B1”

=CheckBacklink(A1,"http://www.YourWebsiteURL.com*")

and press return (replace http://www.YourWebsiteURL.com with your actual URL. Leave the trailing *). SEO Tools for Excel will now go and check the URL in column A1 for any active hyperlink to your website (including your website’s inner pages).

Simply drag that formula down to the bottom of your spreadsheet and let the plugin determine the active links pointing to your website. Once completed you should have messages similar to the ones pictured below.

 

Figure 4.0 – Identifying the Active Backlinks

We can now see a list of the current active backlinks pointing to our website, we can also see instances of links that return 404 errors, internal 500 server errors or where the website’s webmaster has already removed our link.

The next step in the process is to delete the rows where the plugin has failed to find a link to our website. The rows left are our list of active backlinks that we can begin to categorise and decide whether they are harmful or not.

Step Five: Identifying the Bad Links

Most online marketers can quickly identify what looks like a harmful link purely from looking at the URL, however it’s still good to delve in and manually analyse a given set of domains based on the following criteria.

  • Low Domain Flow Metrics (Majestic Citation & Trust Flow, MOZ Domain Authority and Page Authority)
  • Irrelevant CCTLD (Links from country level domains where your business has no target audience / operating presence) e.g. A link to a local UK florist from http://www.SomeRandomDomain.ru/
  • Unrelated links from websites that bear no relevance to the content / theme of our website
  • The number of outbound links from the page your backlink is situated – the more outbound links the less credible your link becomes
  • Website home page harbours 0 or N/A Google Page Rank (N/A could be an indicator that the website has been banned)
  • No social activity (Likes, Shares & Retweets)
  • Overly optimised, keyword rich anchor text
  • Little or no indexed pages in Google’s SERP’s

Once we have our list of active URLs linking through to our website we can run a few more SEO Tools for Excel functions to extract more data including; Facebook Likes, Google Plus Count, Twitter Count, Google Page Rank, Google Index Count. This data will give us a more insightful overview into determining the quality of our backlinks.

 

Figure 5.0 – Gathering More Data

We can see from figure 5.0 that the last two links (rows 8 and 9) look particularly promising as they have healthy numbers of social activity, Google Page Rank and 1000+ indexed pages. Page Rank set to -1 tells us that that specific page has a Page Rank of N/A.

The functions for adding the additional queries are as follows:

IP Address =ResolveIP(A2)
Facebook Count =FacebookLikes(A2)
Google Plus Count =GooglePlusCount(A2)
Twitter Count =TwitterCount(A2)
Page Rank =GooglePageRank(A2)
Google Indexed Pages =GoogleIndexCount(A2)

Where “A2” is the location of the first URL cell. Similarly to when we ran the Check Backlinks formula we want to drag our cell formula to the end of our spreadsheet to run it against all the URLs listed.

We can now begin to highlight the low quality backlinks in our backlink profile. Obviously I would still thoroughly recommend visiting all the URLs to gauge a proper understanding of the severity of individual links and I would not make a qualified conclusion as to the effectiveness of a link until I had carried this out.

I am sure most will agree that the identification and process of highlighting bad backlinks is a fairly mundane task, however it is imperative to the success of an online marketing campaign and is something that should be dealt with from the get go, rather than leaving to escalate into a much bigger issue later on down the line. Please feel free to let me know your thoughts in the comments below, I would love to hear them!

The following two tabs change content below.

Comments (15)

  1. Kathryn says:

    Great article, I am already doing most of this. I have one question though to make sure I am still on the right track-how do you obtain the emails/contact info for the pages you want to ask to remove the links. I am using http://whois.domaintools.com/ currently

    • Liam McGurk says:

      Hi Kathryn,

      Very good question. I use a range of methods to obtain contact details including WHOIS domain lookup. I have detailed my methods below, in order of priority:

      – Website Contact Information – Finding a contact us page with valid contact form / contact details on – I will do this by navigating around the website or failing that I will use Google Advanced Search Queries e.g. site:example.com inurl:”contact” to locate a contact us page

      – Next I will use http://www.whoishostingthis.com/ to identify the website’s hosting provider and also http://who.godaddy.com/ to pull out all the relevant WHOIS information for the domain

      – If by this point I have not managed to source any contact details off the website or from the WHOIS I will do a reverse IP lookup (http://www.yougetsignal.com/) to see all the other websites hosted on the same IP address and gauge whether they may be administered by the same person, if so, do they have any contact details on them?

      During this process I am keeping a record of communication using screenshots so I can prove (if needed) the lines of communication undertaken to request the removal of my links.

      So overall, I send out removal requests via web contact forms, emails to both the webmaster and web host provider. I will normally email three seperate requests before I consider compiling a list of links to Disavow.

      I hope this has helped Kathryn, any questions please do not hesitate to ask.

      Best Regards,
      Liam

  2. Carlos says:

    Is there any evidence that the Disavow Tool actually removes backlinks? We went through this process nearly two months ago. Not only are some bad backlinks still there, but we’re seeing even more of them, through no fault of our own.

    • Liam McGurk says:

      There are plenty case studies of the disavow tool working for websites, check out http://www.linkresearchtools.com/case-studies for example. Although the disavow doesn’t ‘remove’ backlinks remember, the disavow tool should be used to effectively tell Google not to take those links into consideration. If you are experiencing more bad backlinks, it’s important to monitor this and try to remove the links and if you cannot remove the links, you could resort to using the disavow tool.

  3. James says:

    Probably one of the most thorough and constructive articles on identifying bad links that I’ve found. By far this article has helped me to understand the issue and actually do something about it.

    Thanks,

    James

  4. cup of coffee says:

    I am in fact thankful to the owner of this site who has shared this wonderful piece of writing at
    here.

  5. Advertising Agencies in Pakistan says:

    thank you all

  6. Hasan says:

    Learn many thing from this post.
    Thanks

  7. Amy says:

    Great article. You have put lot of effort for this post. Hats off. This is a perfect guide to analize bad backlnks for SEO people. Thanks a lot

  8. Dominique says:

    Very good article. I’m facing many of these issues as well..

  9. Anonymous says:

    Hiya! I simply would like to give an enormous thumbs up for the good data you

  10. Dixie says:

    Very descriptive post, I loved that bit. Will there be a
    part 2?

  11. Jeramy says:

    I am really loving the theme/design of your web site.
    Do you ever run into any web browser compatibility problems?
    A few of my blog visitors have complained about my site
    not working correctly in Explorer but looks great
    in Safari. Do you have any solutions to help fix this issue?

    • Liam McGurk says:

      Hi Jeramy,

      There are several solutions to this. If your site is wordpress based this plugin is a popular solution.

      There are also websites that look at your website and check to see if it works with versions of popular browsers, some even suggest solutions! My personal favourite is Browser Shots.

      Hope this helps,
      Liam

  12. Jim Kahn says:

    You actually make it seem so easy with your presentation but I
    find this topic to be actually something which I think I would never understand.
    It seems too complex and extremely broad for me. I’m looking forward for
    your next post, I will try to get the hang of it!

Leave a Reply

Your email address will not be published. Required fields are marked *

Related Stories

Which Social Networks for Your Business?

9th January 2013

Social media has very quickly become a hugely important part of doing business online and its influence on con...

Writing for the Web

29th October 2013

In the world of SEO, knowing how to go about ‘Writing for the web’ can be hard, especially now Google guidelines for content have changed....