Find me on Google+

Technical Search Engine Optimisation

by Ferdous Haider on May 25, 2013

Technical SEO: 3 Quick Wins

Technical SEO is hot right now! There is lot going on in SEO world, where everyone is going insane to find the entire “Google secrets” to trick big G to send bucket load of traffic to their sites. Hope one thing is clear to all of us that there is no quick wins in search engine optimisation (or should I say Search Experience Optimisation?) But behold I guess there are some quick wins and which comes by making Google happy, not tricking him into giving you ranking you don’t deserve. Yes boys and girls, ladies and gentlemen, I am talking about technical SEO wins.

Crawl and Fix:

Crawl your site using tools like Screaming Frog or Xenu’s Link Sleuth to find pages on your site returning server redirects/errors (3xx Redirection, 4xx Client Error, 5xx Server Error) and fix the redirects/errors. If redirects are necessary make sure its 301 (permanent redirects) not 302 (temporary redirects)

Seek and Destroy Duplicate Contents:

Duplicate contents have always been a problem for any sites to ranking to its potential. There are many ways a site can generate duplicate contents. See below for most common instances of duplicate content.

  • Problem: Domain with both www. And non-www version available
  • Solution: Select default version of site and use permanent redirect (301) to redirect secondary version to the default version.
  • Problem: Urls with or without trailing slash (/)
  • Solution: Select default version of site and use permanent redirect (301) to redirect secondary version to the default version.
  • Problem: Indexed Url with http and https version
  • Solution: Do not use https site-wide, only use https where you need to encrypt user information.
  • Problem: Urls with both uppercase and lower case
  • Solution: force lowercase urls site-wide
  • Problem: Category pages creating duplicate content
  • Solution: Use no-index tags to force search engines not to index category pages
  • Problem: Use of same content within or across site for obvious reason.
  • Solution: Use rel:canonical tags to identify default version.
  • Problem: your content is being copied.
  • Solution: Use tools like copyscape to find duplicate content and outreach to take it down. You also can submit take down request to Google.
  • Problem: Many web development process include creating a staging server and get the site developed on it, however I have seen many instances where staging server is live and indexed even after the main site pushed live and creating duplicate contents.
  • Solution: Password protects your staging server and advise search engines not to crawl staging pages by using robots.txt.
  • Problem: Many sites have multiple version of their home page (/index, /default, /home etc.)
  • Solution: Have only one version available for crawlers and users (ideally root domain) and redirect 301 all other versions to primary page.

Need for Speed:

Your site performance and speed is vital for your search ranking and your visitors. Google has included site speed as ranking factor long ago and also slow site affect your conversion in a negative way.

Use tools like Webpage Test and Google Page Speed insights to determine your site speed and follow recommendations made by these tools to make your site faster. Regularly monitor your page speed using Google analytics data. Ideally every page on your site should load in less than 4 seconds.

 

 

{ 0 comments }

Web data scraping Guide for SEO

What is Web data scraping?

We frequently hear these words “data scraping” “data mining” etc. Yes it’s true that there is wealth of information out there and every marketer can reap some benefit out of this as well, but the question is how? I have been using a number of tools for a while now to scrape data for SEO. I am using data for On-site Audit, Link Analysis, Blog prospects and outreach. Data scraping for SEO does sounds complicated but trust me it’s not. You need to make sure you have the right tools for the job and you are going after right data.

How to use scraped data for SEO?

On-site Audit

You can use a number of crawlers to crawl the site you are auditing and can scrape very useful data on on-page elements to determine if the site is optimised well or not. You can use tools like Screaming Frog (Paid with functional free version available) or Xenu (free). I would recommend Screaming Frog as you can call it “Ferrari” of crawling. There is a great article on seer interactive blog on how to use screaming frog to its maximum potential.

Link Analysis

With bad link penalty from Google is a reality link analysis or link audit is becoming more and more vital day by day. Given pre 2010 SEO tactics utilised by many business owners and agencies it is quite obvious they now have a number of links within their backlink profile they wish to disappear. Mighty Google (and Bing as well) took pity on us and allow us to inform search engines of these bad link via Disavow tool (Google and Bing). However I believe this disavow is not a good solution until you know which links are good and which are bad.

Now scraping data can be the solution to determine external link quality. You might say there are backlinks data available out there via third party tools such as Opensite Explorer, Ahrefs, Majesticseo etc., but no such data is complete and reporting on complete backlink profile. As Google is our primary target engine I would prefer to use their data and it is certain if someone receive link penalty, the offender link(s) will be listed on backlink profile from Google webmaster tools. Now you will face the problem Google does have backlink data for you but not with any additional metrics so it’s hard to tell which are the bad ones! Yes you can go and check one by one on your browser but if you have a link profile with thousands of links, I wish you all the best with that!

Now data scraping at rescue. You can gather link value metrics using a number of scraping tools depending on what metrics you consider to evaluate links. I personally use External linking domains PageRank, Server Status (404, 500 etc.), Page Title (i.e. look for title with foreign language in it, if you have an English language site), External links from linking page (i.e. over 100 external links), match with blacklist (match linking domain from known spam site, low quality directory etc.). I understand you might want to use different metrics; however these metrics I have mentioned above can be scraped using tools below:

PageRank – SEO Tools for Excel (Free), or if you need to scrape thousands of domains buy ScrapeBox

Server Status, Page Title and number of external links – Screaming Frog using list mode (see under configuration)

Blacklist: I have a list of couple of million blacklisted domains you can Download for Free.

Also find example (analysis done by me) of link analysis for bad links using following method here (this have couple of extra metrics such as TrustFlow and CitationFlow). I will publish a blog post with detailed process to create this analysis here soon.

Blog prospects and find outreach data

It is becoming more and more difficult to run a link building campaign with in a set budget mainly due to the amount of human hours needed to prospect and finds outreach information (email, social media etc.). Most of the agencies charge from $150 – $300 per hour for conducting SEO work. It is quite hard to justify investing 20-30 hours to conduct link prospect and collecting outreach emails. There are services out there to help you with blog outreach however it seemed they are too protective about their data (no raw export etc.) and again quite costly for the service they are providing.  On the other hand you cannot go and manually compile a massive list of blogs and potential sites so you can outreach.

You can use crawler such as screaming frog to crawl sites with lots of qualified blog listed (verified by actual human and categorised based on contents) using complete site crawl and export external links. You also can crawl a specific category to export sites under that category only. Once you have the list of  high quality blogs then you can use ScrapeBox to generate data to evaluate blog popularity such as PageRank, Alexa Rank, number of page indexed etc. So you will have a great list of blogs from the niche you are working on with blog value metrics so you know who to outreach first. You also can use browser based scraper such as multi-links for Firefox or Scraper for Chrome to scrape already created list by known publisher or from search results. Again I would love to write a detailed blog about the process I have mentioned above.

Once you have a nice list of blogs and sites to outreach next thing you need would be contact details for identified opportunities. If you are after email only you can purchase web email extractor from newprosoft, this works really well for large list. If you need social profiles as well you can buy Buzzstream for link building and you will get a scraper capable of gathering all social profiles along with email scraper (however this email scraper search certain urls only and returns far less contacts compared to web mail extractor). Also you can create a custom scraper on 80legs to scrape social profiles from urls.

Happy scraping. Let me know via comment if you guys love scraping too and what tools you use  to get your job done. Also let me know if want to see more post on data scraping like this.

Bonus: list of around 5000 Australian Blogs!

>
 

{ 0 comments }

Must Follow SEO Blog list for Beginner (and every SEO professionals too!)

April 16, 2012

I have seen a lot of buzz and boom going around Internet Marketing and SEO is one of the primary aspect of Internet Marketing. So more and more people are choosing SEO as a potential career. I will be compiling a list of SEO Blogs deemed most beneficial to me as SEO Beginner. When I [...]

Read the full article →

Learning SEO? Here is a great starting point

April 4, 2012

I still remember how it was when I started learning SEO. There is lot’s of resources out there for the beginner, however SEO being such a dynamic industry it’s not always easy to pick the best source of information where to begin learning SEO. I have compiled a list here what I believe will aid [...]

Read the full article →

I am a Content Marketer

March 29, 2012

It’s been a while when I last write on my blog. I am not sure how other folks (Jason from Kaiserthesage or Jon from Pointblankseo) in my industry manages to do such quality posts regularly after the amount of work+reading we need to do to survive in this industry. There is lot going on in [...]

Read the full article →

This Keyword Competition Tool = less work+more bucks!

October 15, 2011

I was (almost) blown away, when I took the trial for the keyword competition tool by Pasha Stewart and Darrin Demchuk of SerpIQ. This amazing super fast, easy to use tool was in only in my imagination. The tool i believe will help thousands of professional involved in internet marketing. This tool will make your [...]

Read the full article →

Google Plus and SEO?

September 17, 2011

Guest Post By Alex Petrovic There has been a lot of hype, and some trembling, regarding Google Plus. Web Surfers and geeks are raving that it will revolutionize internet searching and also businesses and SEO consulting firms fear that it will end search engine ranking procedures that have been done for years. Google Plus is [...]

Read the full article →

Bingo! Bing Knows Now That My Site is From Australia

July 9, 2011

Today I want to discuss about geographic targeting in Bing and Yahoo. I have quite a few clients they are located in Australia with a .com.au domain and still not ranking on google.com.au pages from Australia. So I would assume that the search engines are not having the geographic signals from the websites. I do [...]

Read the full article →

SEO Magic – The Impact of Google Supplemental Index on Rankings

June 4, 2011

So many times I have been asked what’s the significance of Google PageRank in modern SEO. Good questions many people thinks the PageRank is not valued any more as Google don’t update the metrics regularly. As far as I know Google is updating the PageRank but you are getting the visual representation once or twice [...]

Read the full article →