Search
Recommended Products
Related Links


 

  May 18, 2012

Informative Articles

Caution Required
Being active users of the internet, we all come across so many tempting business offers. Most of us would love to be able to quit our mundane job that takes all of our time and energy. Most of us think "I know there is a way to make money on the...

Choosing a Hosting Service: A Checklist for Business Owners
For a website to appear on the Internet, you require a “server” that is usually provided by a web hosting company. Hosting companies are paid monthly, quarterly or yearly. Some companies come with guarantees, too. Recommendations from other...

Create Your Own "First Visit" Popup Window
It's a proven fact that the use of popup windows is a highly effective marketing technique that produces great results. However, they can be very irritating to your visitors. How can you use this powerful marketing technique without offending...

Free Hosting - A Real Bargain?
With the availability of many free hosting offers so prevalent on the Internet, are they the bargain they seem to be? I must admit the offer of free hosting does have major appeal however, rarely is something of value given away for nothing. At...

How To Track Your Online Ad Response
It is crucial for every marketer to accurately track his advertising results. In the direct marketing industry, marketers track the responses to their direct mail pieces so that they know which ads work, which headlines are winners and which...

 
Google
The Proper Way To Use The robot.txt File

When optimizing your web site most webmasters don’t consider using the robot.txt file. This is a very important file for your site. It let the spiders and crawlers know what they can and can not index. This is helpful in keeping them out of folders that you do not want index like the admin or stats folder.

Here is a list of variables that you can include in a robot.txt file and there meaning:

  1. User-agent: In this field you can specify a specific robot to describe access policy for or a “*” for all robots more explained in example.
  2. Disallow: In the field you specify the files and folders not to include in the crawl.
  3. The # is to represent comments

Here are some examples of a robot.txt file

User-agent: *
Disallow:

The above
would let all spiders index all content.

Here another

User-agent: *
Disallow: /cgi-bin/

The above would block all spiders from indexing the cgi-bin directory.

User-agent: googlebot
Disallow:User-agent: *
Disallow: /admin.php
Disallow: /cgi-bin/
Disallow: /admin/
Disallow: /stats/

In the above example googlebot can index everything while all other spiders can not index admin.php, cgi-bin, admin, and stats directory. Notice that you can block single files like admin.php.

About the Author
Jimmy Whisenhunt is the webmaster at VIP Enterprises http://www.vipenterprises.org

vipenter@vipenterprises.org