Jul 10, 2008
No Comments
*Rather than use a robots.txt file to block crawler access to pages, you can add a <META> tag to an HTML page to tell robots what to do.
To prevent all robots from indexing a page on your site, you’d place the following meta tag into the <HEAD> section of THAT PAGE: <META NAME=”ROBOTS” CONTENT=”NOINDEX, NOFOLLOW”> To allow other robots to index the page on your site, preventing only Google from indexing that page, you’d use the following meta tag into the <HEAD> section of THAT PAGE: <META NAME=”GOOGLEBOT” CONTENT=”NOINDEX, NOFOLLOW”> To allow robots to index the page on your site but instruct them not to follow outgoing links, you’d use the following tag: <META NAME=”ROBOTS” CONTENT=”NOFOLLOW”>