Why (and how) to Stop Google from Caching Your Pages

Did you know…

  • People can see pages on your site that were up months ago without you knowing?
  • When you delete a page from your site, it can still be seen for months?
  • People can look through your site without showing up in your analytics?
stop caching

Learn to stop showing your website's cache.

This is all possible through the magic of Google’s caching feature.  And although website caching on Google has some benefits, it may not be a good fit for your site.  Let me tell you why (and how) I decided to stop having a cached version of my site on Google.

I’ve been watching my traffic a lot more closely lately, especially 404 errors.  I’ve noticed a lot of very funky stuff going on, especially with people (or bots) coming from cached versions of my site.  When I get a peek at some of the search terms that these people used to get to my site, it is interesting what kinds of things you learn.

If you aren’t sure what Website Caching is (in terms of search engines), check out this explanation from Google’s site:

Google automatically takes a “snapshot” of each page it crawls and archives it. This “cached” version allows a webpage to be retrieved for your end users if the original page is ever unavailable (due to temporary failure of the page’s web server). The cached page appears to users exactly as it looked when Google last crawled it, and we display a message at the top of the page to indicate that it’s a cached version. Users can access the cached version by choosing the “Cached” link on the search results page.  (As seen here)

So the only benefit of caching as far as I can tell is that a copy of your site is available in case your server goes down.  My server doesn’t go down much, so that isn’t a big bonus.  But the downsides seem to outweigh the good.

Here a little video about Caching…

I don’t want people to be able to see cached versions of my site. The cached versions could be pages or posts you deleted, are out of date, or you removed from your site for one reason or another.  I figure: If I took the pages down already, I don’t want Google showing them to people for months (years?) to come.

I would also like to know when people are looking at my site. Usually Google Analytics will give me this data, but if people are looking at cached versions of my site, then they never get counted.  They can browse my whole site (and your whole site) without you knowing about it.

So I did some searching on how to stop Google (and the other search engines) from showing a cached version of your page.  All you have to do is add:

<META NAME=”ROBOTS” CONTENT=”NOARCHIVE” />

to your header.  Supposedly it’s as easy as that.

So, I just added it today and of course Google has to come back through and re-index each of these pages to see the change, but I’m excited to see how this works.

Your Turn:

Do you have any experience with web caching you’d like to share?  Do you have questions about adding the meta to your site?  Are you aware of something that might happen without a cached version I didn’t think about?  Add it to the comments below…

PS:  If you liked what you saw about Thesis Theme, check out my Thesis Theme discount.

Want More out of WordPress?

I can show you how to get more traffic, make WordPress do amazing things, and how to make more money with your website by converting your visitors into customers. Click the button to the right to get a free estimate.

{ 1 trackback }

Prevent Google from Caching Your Site Pages | Shark Lady Tech
January 24, 2010 at 1:50 am

{ 26 comments… read them below or add one }

1 Rachel (Hounds in the Kitchen) January 24, 2010 at 5:37 pm

Makes sense to me. I just added the tag to my wordpress site.

Reply

2 Web Development January 27, 2010 at 1:53 am

I hate how we have to add tags to our pages – because of third party sites – this sort of stuff should be kept inside robots.txt or similar to prevent bandwidth waste.

Reply

3 rawwebdesign January 27, 2010 at 9:09 am

I agree completely. But that's the
world we live in.

Reply

4 rawwebdesign January 27, 2010 at 9:22 am

Good point! Forgot about that…

Thanks.

Reply

5 angeliqueandfriends January 27, 2010 at 10:33 am

How do you do this for WordPress?

Reply

6 rawwebdesign January 27, 2010 at 10:43 am

With thesis you just add it to your
header space, without thesis you
have to edit your header.php file
to include it.

Reply

7 angeliqueandfriends January 27, 2010 at 2:26 pm

I don't see a way to add it to my header space without editing the .php file. I have Thesis, but the only option I have under header is to change the title tag and add no-index to pages. My custom file editor is for CSS.

Reply

8 rawwebdesign January 27, 2010 at 2:28 pm

Did you watch the video? It shows
how to in there. It's under Thesis
Options near the bottom on the left
hand side…

Reply

9 angeliqueandfriends January 27, 2010 at 2:31 pm

That's what I get for reading this on my phone! I didn't realize it was a video. Duh. I'll go watch it!

Reply

10 rawwebdesign January 27, 2010 at 2:33 pm

how does it look on your phone
anyways? (the site i mean…)

Reply

11 angeliqueandfriends January 27, 2010 at 4:08 pm

Okay, watched the video and added the code!

I'm trying to look at your site on my phone again (I have a Blackberry through Verizon) but I can't connect with it right now. It says “the server may be busy,” but since your website is working perfectly the problem is THEIR server. They are just too embarrassed to admit it. I'll post again when I can see it.

Reply

12 rawwebdesign January 28, 2010 at 8:59 am

Awesome. Glad it worked for ya.
RSS this site and I promise there
will be lots of good stuff coming.

-brad

Reply

13 angeliqueandfriends January 28, 2010 at 9:10 am

Don't see an RSS button anywhere (except to subscribe to blog comments.)

Reply

14 rawwebdesign January 28, 2010 at 9:31 am

Oh well I'm going to have to add
that for sure. Thanks!

-brad

Reply

15 SEO Sydney February 12, 2010 at 10:23 pm

I have a question for you:
why go certain pages are indexed in google but have no cache? also why they do not apear in search results?
what can be done about it so it will have cache?
by the way <META NAME=”ROBOTS” CONTENT=”NOARCHIVE” />
was not in any pages. see examples.
site:www.searchenginepro.com.au/SEM
look at pages 60-100 mostly nocahce.

Reply

16 rawwebdesign February 13, 2010 at 10:53 am

That's a good question. I have no
idea why they wouldn't cache your
site. If you consider it a
problem, get more links. Getting
more inbound links cures all SEO
ails IMHO.

brad

Reply

17 Jonathan Boettcher March 27, 2010 at 5:45 pm

Sweet – excellent tip. I’ve been hacking back on what the Goog has access to anyways (old landing pages etc that I don’t want competitors seeing) so this fits perfectly with that.

Reply

18 ihategooglecache April 29, 2010 at 11:40 am

I hate google cache. I just recently found out that it does cache on myspace profiles. It shows all of the stuff you wanted to hide and makes it public. is there anyway i can stop that from happening?

Reply

19 admin April 29, 2010 at 3:24 pm

I don’t know of any way to stop caching of Myspace. That’s one of many reasons why nobody should use myspace. Haha.

But seriously, don’t put anything secret on Myspace. Reserve that for diaries and pillow talk.

Reply

20 ihategooglecache May 6, 2010 at 11:58 am

that doesn’t give me much hope. Who knows how long it’s been caching on my myspace and what I’ve posted. LOL.. too late now to worry. Thanks for the advice.

Reply

21 pedro June 21, 2010 at 5:18 am

hi, when i type in my user name in google it shows up sites that i’v visited or that im a member on, this info is private to me and i dont want people seeing what sites im on when they type in my user name, can i get this info removed or will it only stay displayed on google for a set time

Reply

22 admin June 21, 2010 at 6:20 am

Pedro,

What you are talking about is a bit different than the caching issue, but still a big problem for some people. I would check out this link: http://www.google.com/support/accounts/bin/answer.py?answer=39612 and change your privacy settings on Google.

Best,
Brad

Reply

23 Cheryl July 7, 2010 at 9:12 am

I’ve got cached pages showing up from a website we took down a year ago. How do I get rid of those since there’s no header to add the tag to?

Reply

24 admin July 7, 2010 at 9:46 am

That’s a very good question. You may be able to submit a takedown request from Google, but I’m not sure. If you find out, let me know please.

Reply

25 Dan Random August 18, 2010 at 9:50 am

In Thesis 1.7, this feature is built-in under Thesis Site Options/Robots Meta Tags/No Archive Tag, it looks like this http://bbqshirt.org/wp-content/uploads/2010/08/no-archive.jpg

Dan

Reply

26 Sheet Metal Fabrication August 24, 2010 at 6:55 pm

How I can get rid of cached pages showing on my website pages. Why Google show this message at the top of the site some time. Is it due to failure of searching of the page or something else? I love the Google search and no doubt it is wonderful and quick but tell me the solutions. Please!

Reply

Leave a Comment

Previous post:

Next post: