6
How To Block Google To Crawl and Index Your Webpages Sponsored by: www.page tron .com

How to block google to crawl your website

Embed Size (px)

DESCRIPTION

To block Googlebot from indexing your site you need to create a robots.txt file to block a page URL or a number of pages of your site.

Citation preview

Page 1: How to block google to crawl your website

How To Block Google To Crawl and Index Your Webpages

Sponsored by: www.pagetron.com

Page 2: How to block google to crawl your website

There might be a number of reasons why you do not want Google to index and cache a particular page of your site. To block a particular page to show up in Google you have to a write a file called robots.txt and upload it to the root directory of your web server. In general to block a webpage or pages from being crawled and indexed by Google you need to follow the following steps:

1. Create a blank robots.txt file, which is just a text file.2. Write the commands to block the URL of the page that you want

to block.3. Upload the file in your web directory. The location of the file will

be under the public_html.

Page 3: How to block google to crawl your website

In your computer, open a note pad and name is as robots and save it. your computer will automatically save it as robots.txt

Now write the necessary command. for example the URL of your website is : www.pagetron.com and you have a post on it with the URL www.pagetron.com/how-to-blog and you want to block this page from getting indexed in all the search engines. So, use the following instructions:

Create a robots.txt

Page 4: How to block google to crawl your website

Add the following two lines in your robot.txt file that you have created.

User-agent: * Disallow: /how-to-blog

And then save it. at the end of the file you can add the sitemap URL of your blog, if you want.

The command user-agent : * means that you are telling every search bot to avoid the URL in the Disallow command.

Page 5: How to block google to crawl your website

The Disallow command states the URL that will be blocked. No need to put the entire URL after the Disallow command. Just write the URL that is after your website’s domain name and do not forget to add the slashes (/) before and after the URL. So, your Disallow section of the robots.txt will look like the following command:

Disallow: /how-to-blog/

Page 6: How to block google to crawl your website

Thank you!!!!

Sponsored by: www.pagetron.com