Archive for December, 2006

Understanding the Google cache

Thursday, December 28th, 2006

Google and other search engines cache pages, storing copies of them that they use for indexing. You can think of the cache as a snapshot of a page at a moment in time.

You can examine the cached copy of a particular page by creating a query like this:

http://www.google.com/search?q=cache:http://www.web1marketing.com

You can look at any page by placing it immediately after “cache:”. You will then see the time and date at which Google stored the cached copy as well as how the page appears to Google.

The purposes of inspecting the cache include the following:

  • If a page is in the cache, then the search engine has been able to find it and has indexed it.
  • You can examine what the search engines see of a particular page to verify that content is visible (and not hidden by CSS, JavaScript, etc.).
  • You can see how recently Google has taken a snapshot of that page and thus get an idea of how likely it is that recent updates have been “noticed.”

Some important caveats about the cache:

  • Just because a page is cached doesn’t mean it’s easy to find or going to rank well.
  • It takes time for some pages to be found and cached. A page that isn’t present may be too new for Google to have found and cached. However, if you know your site is frequently visited by Google and hasn’t been cached in a reasonable period of time (several weeks), make sure your links to that page can be found and followed by search engines. More on this.
  • There is a (considerable) delay between content updates, caching, indexing, and PageRank changes. The date of the cached copy does not mean that the search rankings or PageRank values reflect that version of the content. There is always a lag.
  • Just because you see it in the cache doesn’t mean that the search engines are reading it exactly as you would expect. Google — and other search engines — purposefully are vague about how they actually look at and rank pages so that it’s hard to take advantage of their algorithms. The cache may draw images from your current web server.
  • A page may be found by Google in various ways. Someone else may have linked to it in order for Google to find it. Presence of a cache doesn’t mean that you have a great navigation scheme.

Note that almost all of the above information applies to the other major search engines.

Using Google Webmaster Tools

Wednesday, December 20th, 2006

Google’s Webmaster Tools shed light on how your website is seen by Google and do various indexing-related tasks.

Like the Yahoo Site Explorer we described earlier,  the best functionality is limited to those who can upload a file and thus “verify” themselves as site owners. Once you have done so, you’ll be able to do the following:

  • Tell Google about pages and updates using sitemaps.
  • See when the site was last indexed.
  • See what pages have been indexed and where Google found errors.
  • Examine the robots.txt file.
  • Adjust the crawl rate of your site by Google.
  • Specify URL canonicalization (http://web1marketing.com versus http://www.web1marketing.com).
  • Enable advanced image search.
  • View summaries of your site’s PageRank.
  • Examine top search queries that led people to your site.

The Google Webmaster Tools are quite useful for diagnosing indexing errors as well as providing insight into how people find your site.

Using Yahoo Site Explorer

Wednesday, December 20th, 2006

Yahoo Site Explorer is a handy tool for examining how Yahoo has indexed a website and what sites link back to it. It is similar to Google’s Webmaster Tools.

The tool can be used to show the indexed pages in a domain and all subdomains as well as “inlinks” (in-bound links) to the page or domain.

If you are authenticated for a particular website, you have access to much more information. Authentication requires placement of a file with a special code on that website in order to confirm that you do in fact have access to the site source. Authentication then provides several new features:

  • Monitor and manage the feeds you have submitted for the site.
  • See when crawlers last visited the site and updated the index.
  • Resubmit your feed to let Yahoo know when it has been updated.
  • See what subdomains are known to Yahoo!
  • View additional metadata about the URLs from your site.

This data can be used to check if content is being indexed, how often it is being visited, and who is linking to it.

New link-finding tool

Wednesday, December 13th, 2006

We have launched a new link-finding tool that helps you quickly find websites that accept link requests. Simply enter a keyword or topic, then click “Go”.  You’ll be provided with a long list of possible queries for various search engines that will return lists of targeted websites that you can submit your site to for back-links.  Because each search includes your keywords, the order of results provided by each search engine roughly reflects the value of those websites for that keyword.  In other words, the list of websites in the search results is already prioritized for you.

Start finding backlink opportunities.

Google-bombing for a cause

Monday, December 4th, 2006

A hate-mongering site is Google-bombing the keyword “Martin Luther King,” so we are defending it by pointing it to where it should: the official King Center website.  Martin Luther King.