• WordPress SEO Optimization Techniques

    WordPress SEO Optimization Techniques
    • Home
    • About Me
    • SiteMap
    • Contact
    • SEO Directory
    • WordPress Themes
      • Home > Blogging > WordPress
        • Robots.txt for WordPress SEO

        • - Preventing, out of the box duplicate content -

        • WordPress Robots.txt File EditingThe beginning of WordPress SEO, should start with the optimization process of your WordPress robots.txt file. The reason to do this is to prevent as much duplicate content as possible because "WordPress" out of the box produces a lot of duplicate content.  Having duplicated content can cause the search engines to penalize your site's search engine rankings, PageRank and crawl rate.

          Robots.txt

          The WordPress robots.txt file is used to give the search engines "robots" instructions to follow, when they crawl your WordPress blog. These instructions will tell search engines not to crawl non-relevant files, folders, images and duplicate content. Excluding non-relevant files, such as "/wp-admin/", "/wp-content", "/wp-includes/" will save bandwidth and speed up the search engine crawling process, when they access your site.

          A robots.txt file is a simple text file which can be created with your Windows notepad text editor, and then it is placed in your root domain of your site.

          WordPress robots.txt file example one, when WordPress is installed into the root directory, such as, /.

          # robots.txt for http://www.yourdomain.com/

          #  PARTIAL access (Googlebot)
          User-agent: Googlebot

          Disallow: /*?
          Disallow: /*.css$
          Disallow: /*.inc$
          Disallow: /*.js$
          Disallow: /*.php$
          Disallow: /category/*/*
          Disallow: /comment-page/*
          Disallow: /*/feed/$
          Disallow: /*/feed/rss/$
          Disallow: /*/trackback/$
          Disallow: /wp-

          User-agent: *
          Disallow: /cgi-bin/
          Disallow: /archives/
          Disallow: /category/
          Disallow: /comment-page
          Disallow: /feed
          Disallow: /feed/
          Disallow: /page/
          Disallow: /trackback/
          Disallow: /wp-admin/
          Disallow: /wp-content/
          Disallow: /wp-includes/
          Disallow: /wp-login.php/
          Disallow: /index.php

          # Edited last, on 04-05-2009

          WordPress robots.txt file example two, when WordPress is installed into a sub-directory, such as, yourdomain.com/blog/.

          # robots.txt for http://www.yourdomain.com/

          #  PARTIAL access (Googlebot)
          User-agent: Googlebot

          Disallow: /blog/*?
          Disallow: /blog/*.css$
          Disallow: /blog/*.inc$
          Disallow: /blog/*.js$
          Disallow: /blog/*.php$
          Disallow: /blog/category/*/*
          Disallow: /blog/comment-page/*
          Disallow: /blog/*/feed/$
          Disallow: /blog/*/feed/rss/$
          Disallow: /blog/*/trackback/$
          Disallow: /blog/wp-

          User-agent: *
          Disallow: /blog/archives/
          Disallow: /blog/category/
          Disallow: /blog/comment-page
          Disallow: /blog/feed
          Disallow: /blog/feed/
          Disallow: /blog/page/
          Disallow: /blog/tag/
          Disallow: /blog/trackback/
          Disallow: /blog/wp-admin/
          Disallow: /blog/wp-content/
          Disallow: /blog/wp-includes/
          Disallow: /blog/index.php

          # Edited last, on 04-05-2009

          "User-agent: *" function, which means that all search engines bots/spiders will crawl your site according to your instructions.

          "Disallow: /wp-" function, will exclude the search engines from crawling all files and folders which start with"wp-", therefore prevent the indexing of duplicate content.

          "Disallow: /*?" function, will exclude your index any url that contains a ?

          "$" function, at the end of a file, excludes extensions - example, /*.css$
          Using the .css$, will match all files the end with a .css

          # is used to show comments for reference because search engines spiders do not read lines beginning with #.

          Additional notes:
          * The above robots.txt file configuration focuses on keeping as much pagerank as possible on your main page and article pages.
          * You must make sure that you name your "robots.txt file" as, "robots.txt" and upload it as ASCII into your website's root directory or sub directory.
          * I have the robots.txt file up into two groups, one for User-agent: Googlebot and the other for User-agent: *which is for all bots.
          * You can also install the SearchStatus if you use the FireFox browser. It has many SEO options, and it will let you look at many sites robots.txt files if they are not hidden. This way you can find a nice PR6 site in your niche and see what they are using for a WordPress robots.txt configuration.
          * Make sure that your robots.txt file validates. Most people do not do this, but it is a good idea to make sure it validates. To make sure that your robots.txt validates, please visit "Robots.txt Checker.

          Robots.txt for WordPress SEO





          Submit your content Every Day to 25 social bookmarking sites, all on unique C class IPs... FREE.

          Keyword Winner
          Related Posts:
          1. WordPress 2.7 Creates Duplicate Content by Default
          2. How to remove /category/ from your WordPress URL
          3. Dealing with WordPress “Missed Schedule”
          4. How to Remove, “Comments are Closed” Message in WordPress
          5. WordPress Users, Don’t Get Banned by Ping Services
          6. How to Remove the WordPress Generator
          7. How to reset the admin password in WordPress
          8. Follow these Easy Steps to Customize Your WordPress 404 Page!
          9. Removing the WordPress 3.1 Admin Bar
          10. Easy instructions to add a favicon to your WordPress Blog
        • Posted on 5th April 2009 by Spunky Jones in WordPress
          • Subscribe to My: Free RSS Feed!
      • Article By: Spunky Jones

        Owner of the Madmouse Network, including the Spunky Jones SEO Blog.

        Spunky, has been involved in web page design since 1995 and started working with search engine optimization (SEO) in 1996. My very first Domain name that I registered was madmouse.com, in 1998. Furthermore, the Madmouse, is still alive and with me to this very day.

      • Thank you for visiting the Spunky Jones SEO Blog and reading the following post:
        Robots.txt for WordPress SEO

      • Navigation

        • Blog Authors
        • SEO Optimization Articles
        • SEO Strategy
        • Webmaster World
        • Free Stuff for Visitors
        • Canonical Duplicate Fixer
        • META Language Tag Generator
        • Privacy Policy
      • Today’s – Top Blog Posts

        • Keyword Analysis and Niche Research with...
        • PC CMOS Cleaner - Free BIOS Password...
        • .htaccess Tutorial - How to Create a...
        • Improve Communication in the Workplace...
        • Should You Consider Using Reprint Article...
        • Robots.txt for WordPress SEO
        • Strong Tag versus Bold Tag for SEO
        • NoFollow Links, Aren't Completely Useless...
        • Tracking your website PHP Errors, via the...
        • Increase web traffic by the means of forum...
        • How to reset the admin password in...
        • Spunky Jones: Search Engine Ranking SEO...
        • Madmouse Web Directory, free directory...
        • The Spunky Jones, SEO Experience Online
      • Category Topics

        • Blogging
          • Tips
          • WordPress
            • Install
            • Plugins
            • Security
            • WP Themes
        • Business World
          • Making Money Online
          • Marketing Strategy
        • Promotion
          • Social Networking
          • Web Directories
        • SEO Optimization
          • Google SEO
          • Meta Tag Basics
          • RSS Feeds
          • SEO Tips
            • SEO Strategy
            • SEO Techniques
          • WordPress SEO
        • Spunky Jones
          • Ramblings
        • Technology
          • Computers
            • Ergonomics
            • Hardware
            • Software
            • Windows
        • Webmaster World
          • Alexa Traffic
          • PageRank
          • Search Engines
      • Google Custom Search
    • Home
    • About Me
    • SiteMap
    • Contact
    • SEO Directory
    • WordPress Themes
    Copyright © 2008 - 2012 www.spunkyjones.com
    Designed by: Spunky Jones | Coded by: PSD to HTML
    leave a reply