THE ROBOTS.TXT FILE
You know that search engines have been designed to aid folks discover details speedily on the Net, and the search engines obtain a lot of their information via robots (also recognized as spiders or crawlers), that appear for net pages for them.
Write-up Advertising and marketing Specialists
100% Manual Submission Writing At Per 500 Words for Totally free http://www.MagicSeoBot.com
The spiders or crawlers robots discover the internet looking for and recording all sorts of details. They normally start off with URL submitted by users, or from hyperlinks they uncover on the internet websites, the sitemap files or the top level of a web site.
As soon as the robot accesses the home page then recursively accesses all pages linked from that page. But the robot can also verify out all the pages that can uncover on a certain server.
After the robot finds a web page it operates indexing the title, the search phrases, the text, and so on. But occasionally you may possibly want to stop search engines from indexing some of your net pages like news postings, and specially marked net pages (in example: affiliate´s pages), but no matter whether individual robots comply to these conventions is pure voluntary.
ROBOTS EXCLUSION PROTOCOL
So if you want robots to preserve out from some of your internet pages, you can ask robots to ignore the internet pages that you don´t want indexed, and to do that you can place a robots.txt file on the nearby root server of your net web site.
In example if you have a directory known as e-books and you want to ask robots to keep out of it, your robots.txt file need to study:
User-agent: * Disallow: e-books/
When you don´t have sufficient control more than your server to set up a robots.txt file, you can try adding a META tag to the head section of any HTML document.
In example, a tag like the following tells robots not to index and not to comply with links on a specific page:
meta name=”ROBOTS” content=”NOINDEX, NOFOLLOW”
Assistance for the META tag among robots is not so frequent as the Robots Exclusion Protocol, but most of key internet indexes currently help it.
If you want to keep the search engines out of your news postings, you can create an an “X-no-archive” line in of your postings’ headers:
Totally free Plr Articles
High Quality 140000 Plr Articles For Free Now http://www.Free of charge-Plr-Post.com/
But though typical news customers, enable you to add an X-no-archive line to the headers of your news postings, some of them don´t permit you to do so.
The difficulty is that most search engines assume that all data they locate is public unless marked otherwise.
So be careful simply because even though the robot and archive exclusion standards may possibly assist keep your material out of significant search engines there are some others that respect no such rules.
If you happen to be hugely concerned about the privacy of your e-mail and Usenet postings, you need to use some anonymous remailers and PGP. You can read about it right here: