|
|
Robots txThe Importance of Robots txAlthough the robots tx file is a very important file if you want to have a good ranking on search engines, many fail to include this file in their root directory. This is a mistake in most cases. If you have a robots tx file, it may not be compiled correctly. A recent study showed out of 15,000 websites, 56% had errors in there robot.txt file. If you would like to validate a robots.tx file try the robots.tx syntax checker for free. If you are not comfortable writing a robots.tx file yourself try RoboGen , it is a hndy program that generates robots.tx files automatically. I would suggest your read this article anyway to gain a basic understanding of how this file works, and evaluate if you need one. What is Robots tx? Quite simply, it is a file that directs search engine spiders to pages and files you want indexed and don't want indexed..It is a simple text file that belongs in your root directory on your server. How do I create a Robots tx file? As mentioned above, the robots tx file is a simple text file. Open a simple text editor to create it.A record contains the information for a special search engine. Each record consists of two fields: the user agent line and one or more Disallow lines. Here's an example: User-agent: slugbot This robots tx file would allow the "slugbot", which is a fictitious spider, to retrieve every page from your site except for files from the "cgi-bin" directory. All files in the "cgi-bin" directory will be The Disallow command works like a wildcard. If you enter User-agent: slugbot both "/support.html" and "/support/index.html" as well as all other files in the "support" directory would not be indexed by search engines. Everything behind the directory you disallow will be disallowed. This is very important to remember.If you leave the Disallow line blank, you're telling the search engine that all files may be indexed. In any case, you must enter a Disallow line for every User-agent record. If you want to give all search engine spiders the same rights, use the following robots tx content:
Disallow: /cgi-bin/ However this is not advisable. There are many malicious spiders that harvest personal information for spam. Here is a good list of bot's to disallow.
If you don't format your robots tx file properly, some or all files of your Web site might not get indexed by search engines. To avoid this, do the following:
Tips and tricks: 1. How to allow all search engine spiders to index all files Use the following content for your robots tx file if you want to allow all search engine spiders to index all files of your Web site: User-agent: * 2. How to disallow all spiders to index any file If you don't want search engines to index any file of your Web site, use the following: User-agent: * 3. Where to find more complex examples. If you want to see more complex examples, of robots tx files, view the robots tx files of big Web sites: |
Information27 Tips to Top Search Engine RankingsUsing Links For Improved Search Engine Ranking10 Common Webmaster MistakesHow Do Search Engines Work?Is Your Website Optimized To Sell?Doorway Pages- Good Advertising? More Web Site Traffic With DMOZ More Web Site Traffic With Yahoo Wore Web Site Traffic With Inktomi More Web Site Traffic With Google
ToolsFree Link Popularity CheckFree Meta Tags GeneratorFree Search Engine Ranking Tool |