Antezeta LogoAntezeta Web Marketing

Reflections on search engine optimization, web analytics and web marketing

Antezeta Web Marketing header image 2

Keep out: an often overlooked part of Search Engine Optimization

by sean · No Comments

Share

Rare is the web professional who doesn’t know that building a great website isn’t usually enough to guarantee its success. Sites have to be visible in search engines for the keywords and phrases web navigators are most likely to associate with the site’s content. An entire industry has grown up around SEO, search engine optimization. Yawn, you say.

What about the reverse side of the coin, keeping content out of search engines? Should be easy, no? Maybe not. In February, we looked at 5 ways to stop Google and the other search engines from downloading and indexing a website’s pages.

Unfortunately, it appears that the folks behind the personal lubricant astroglide didn’t understand the implications of leaving sensitive customer data on a public web server. They, and their customers, found out how search engines can be all too effective in finding content – as long as it is in a public area and there is a public link to it! Too bad astroglide blamed Google rather than admitting the error of their ways.

The astroglide case isn’t an isolated incident. A consortium of French and German Press in Belgium, Copiepresse, has been battling the search engines to keep Belgium news out of search results – all while blissfully ignoring the circa 1996 robots.txt protocol. It does appear that Copiepresse is finally making progress in learning how to manage what content appears in search engines using existing web conventions.

As of last week, there is an additional tool available to manage content in Yahoo!. By adding a class=”robots-nocontent” attribute to an html tag, a webmaster can specify that content within the html tag shouldn’t appear in Yahoo! As we note in our related class=”robots-nocontent” article, we welcome the additional granularity that this option offers to specify how a search engine indexes a page. Unfortunately its value will be limited until all the major search engines adopt it or a similar syntax.

So have you protected your sensitive content from search engine crawlers?

Similar Posts:

Was this article useful? If so, spread the word:
  • Sphinn
  • StumbleUpon
  • Reddit
  • Digg
  • FriendFeed
  • Wikio
  • del.icio.us
  • Mixx
  • Google Bookmarks
  • Slashdot
  • Technorati
  • TwitThis
  • Facebook
  • Diigo
  • Netvibes
  • NewsVine
  • HelloTxt
  • Tumblr
  • Yahoo! Bookmarks
  • email
  • Suggest to Techmeme via Twitter
  • Yahoo! Buzz

If you're new here, you might subscribe to my feed by Email, RSS feed and/or follow me on Twitter, which is updated on a more frequent – and more meaningless – basis in English and Italian. Finally, if you're a Sphinn user, Sphinn love is welcome :-). Thanks for visiting!

Share

Originally published May 6th, 2007 Tags: ··


0 responses so far ↓

  • There are no comments yet...Kick things off by filling out the form below.

Leave a Comment

Warning: Comments are welcome insofar as they add something to the discussion. Anonymous and/or polemical comments without a rational justification of the author's position risk being mercilessly deleted at the sole discretion of the administrator. Yes, life is hard :-).

*
To prove you're a person (not a spam script), type the answer to the math equation shown in the picture. Click on the picture to hear an audio file of the equation.
Click to hear an audio file of the anti-spam equation