Think About Search
Search Engine Marketing Blog

Google's form filling bot a benefit to some, scares others.

Print the article

This entry was posted on 4/22/2008 5:08 PM and is filed under Google.

Google's form filling bot a benefit to some, scares others.

Kevin Heisler's article in SEW points out a dilemma that Google faces; in an attempt to homogenize the desires and intents of the masses, they will please some while angering, annoying or frightening others. I am not nearly as bothered by this as Kevin.

As this question popped up in some communications in my company, my response was...

"Google has been inundated with questions as to why pages are not showing up in the index, only to explore the issue and find out that the only way to get to the pages in question is to submit a form of some type. The most obvious is corporate home pages where the user has to select the country / region in a drop down (Matt's example). Until this new release, Google couldn't crawl the pages from the home page. Other examples include product selection, category information where you have to tell the site, via a form, what you want. Web masters and publishers have be frustrated by their in ability to get a lot of content indexed because managing it requires data driven applications and the use of forms. This is Google's attempt to rectify the problem.
 
For those really worried about this, blocking the bot from sub pages can be done."


Matt Cutts has a good post on this.

I think another aspect of blocking the bot is the robot.txt. As Matt says, "If you’d prefer that Google not crawl urls like this, you can use robots.txt to block the urls that would be discovered by crawling through a form."

These URLs should probably be part of the robot.txt file anyway. But if not, this should not be too arduous a task to add them.

Any way, like so many other "things" Google, this seems bigger at first than it will in hind sight.

 del.icio.us  Stumbleupon  Technorati  Digg 

 

What did you think of this article?




Trackbacks
Trackback specific URL for this entry
  • No trackbacks exist for this entry.
Comments
    • No comments exist for this entry.
Leave a comment

Submitted comments will be subject to moderation before being displayed.

 Enter the above security code (required)

 Name

 Email (will not be published)

 Website

Your comment is 0 characters limited to 3000 characters.