Hacking oreilly google hacks, 1st edition

Page 376

• • •

Check your frames. If you use frames, you might be missing out on some indexing. Google recommends you read Danny Sullivan's article, "Search Engines and Frames," at http://www.searchenginewatch.com/webmasters/frames.html. Be sure that Google can either handle your frame setup or that you've created an alternative way for Google to visit, such as using the NOFRAMES tag. Consider your dynamic pages. Google says they "limit the number of amount of dynamic pages" they index. Are you using dynamic pages? Do you have to? Consider how often you update your content. There is some evidence that Google indexes popular pages with frequently updated content more often. How often do you update the content on your front page? Make sure you have a robots.txt file if you need one. If you want Google to index your site in a particular way, make sure you've got a robots.txt file for the Google spider to refer to. You can learn more about robots.txt in general at http://www.robotstxt.org/wc/norobots.html. If you don't want Google to cache your pages, you can add a line to every page that you don't want cached. Add this line to the <HEAD> section of your page:

<META NAME="ROBOTS" CONTENT="NOARCHIVE"> This will tell all robots that archive content, including engines like Daypop and Gigablast, not to cache your page. If you want to exclude just the Google spider from caching your page, you'd use this line:

<META NAME="GOOGLEBOT" CONTENT="NOARCHIVE">


Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.