Frequently Asked Questions

User FAQ

Why would I use Follow That Page?

Follow That Page monitors web pages and notifies you via email when a page has changed.

  • Find new jobs, news, prices, and any other changing information on any website.
  • Monitor your own website. You will know when your website has been changed or damaged within one hour, before your boss or customers know.
  • Integrate Follow That Page into your website, so that your customers can follow it. Follow That Page brings your visitors back to you! Read more.
  • Check the Google Pagerank for your (or any other) website, to see if your website is getting more popular.

How does it work?

See the demo video. The demo video does not show you filtering; we’ll explain that below.

How often does Follow That Page check pages?

Per user, it can do 100 daily checks and 1 hourly check. You can enter any number of pages you like, but after checking 100 pages (in the daily run) it will stop and leave the rest for the next day. So, if you entered more than 100 pages, it will check all pages you entered, but it will take more than one day to do it. The same for hourly pages: if you set 5 pages to hourly checking, it will take 5 hours to check all of them, because it does only one check per hour.

The daily run starts at 8:00 AM Central European time. I’m working on a paid version of this service, which can monitor 1000 pages per day and 50 pages per hour per user.

When you enter a new page, it will be checked within a few minutes for the first time.

What happens when a page has changed?

Follow That Page will send a report of the changes to your email address. You can see an example here.

How does the filter work?

The filters will help you to tell our robot which parts of a page are not interesting to you. Weblogs, for instance, often contain phrases like “five comments”, and every time it changes, you will get a report that is not very useful.

With the line filter, it is easy to ignore certain lines. You just give a keyword or keyphrase (like “comments”), and every line that contains that keyword is ignored. You can also revers the filter: if you only want to know when the price of an article changes, use keyword “price” and select “Ignore lines without these keywords”. You can use more than one keyword or keyphrase. Put them on separate lines.

The block filter works similarly. You use a block filter to ignore a large block of text; more than one line. This is useful to filter out text advertisements or random pieces of text. (You don’t have to filter out Google ads: our robot ignores them anyway). You select two block markers, one for the beginning and one for the end of the block. These markers are pieces of text that usually don’t change on the page. Then you choose whether you want the text between the markers to be ignored, or the other way round: that everything is ignored except the text between the markers.

If you use only one block marker and leave the other blank, it uses the beginning (or end) of the page.

Tip: To know what keywords to use, check the preview that is below the form with the page and filter settings.

Why do you ask for my name?

I use it to look up your user account, in case you don’t remember with which email address you registered. I will never give out your personal information to anyone.

How about my privacy?

I never give out you personal information or email address to others. See the terms of use.

If you are concerned that others may be watching your internet traffic, you can use our secured server, https://www.followthatpage.com. You will get an error that the certificate belongs to alkmaar.xs4all.nl, or that it was self-assigned. That’s because I have no budget yet for a real certificate from a certificate authority. You can safely ignore this warning (but please don’t ignore such a warning on websites where you do financial transactions!). I don’t use the certificate for authenticating this website, but only for encrypting the website traffic.

Can I follow non-English pages?

Follow That Page handles most character sets. But the old Simple Chinese character set GB2312 is not supported, and there might be other exceptions. If Follow That Page sends you unreadable emails, please send me the page address. You can check yourself which character set a page uses, by viewing the page source and looking for a line like this:

<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />

So, the page containing this line uses UTF-8 as a character set.

How does Follow That Page handle JavaScript?

The Follow That Page robot does not execute JavaScript, Java or Flash. I believe it is good policy to put content in plain HTML or XML and not have it depend on JavaScript or other special techniques.

What happens if a website is down? When the page is not found (HTTP 404 error)?

You will be notified of the error. When there is a change in the situation, you will be notified again. When the site is up again, Follow That Page will compare the content to the last version from before the website broke down.

You're missing some changes! (I'm not getting any emails!)

If you were expecting change reports but didn’t get them, there are some possible causes:

  1. The change was undone before our robot saw it.
  2. You are using a filter to ignore certain parts of the page. Use the preview to check.
  3. The webmaster has forbidden our robot to access the page. Use the preview to check.
  4. Your provider uses a spam filter that thinks we are sending spam and blocks our emails; this is called a false positive. Currently there is no way for you to know, except ask me: I can see if your provider has sent me an error message. Then you can complain at your provider or find a better one that does not block legitimate email. I’m working on a way to show mail errors in your page list.

How can I add pages to Follow That Page more easily?

Install the Firefox browser and the Follow That Page extension for Firefox. With this combination, you can right-click a page, select “Follow This Page” and follow the instructions.

Additionally, there is a batch add function that is visible when you are logged in.

Are there any bugs?

If you think you’ve found a bug or have a request for a feature, please contact me.

Who are you?

My name is Onno Zweers. I do this as a hobby. I work at SARA, an academic high performance computing institute, as a grid/cluster administrator. You’ll learn plenty about me through Google.

Webmaster FAQ

How can I add Follow That Page to my website?

Add the following link to your HTML code:

<a href="http://www.followthatpage.com/referrer">Follow This Page</a>

When people click the link, they will go to our website and the page they came from (your page) will be filled in already.

If you want, you can also supply the page address yourself, like this:

<a href="http://www.followthatpage.com/?url=http://www.google.com/">Follow Google</a>

But keep in mind that if there are any parameters in the page address, like this:

http://www.google.nl/search?q=keyword&start=0

then these parameters should be URLencoded, like this:

http://www.google.nl/search%3Fq%3Dkeyword%26start%3D0

You can use our Firefox extension to do it for you.

Isn't Follow That Page violating my copyrights?

I think this service is for the benefit of webmasters as well as their visitors. After all, it brings your visitors back to your website! And it’s more modest than Google’s cache, which copies almost the whole web and shows it to the whole world. But I understand why webmasters may dislike what Follow That Page does with their content. Therefore, I offer webmasters a way to get rid of Follow That Page! See next question.

How do I prevent Follow That Page from checking my website?

Make a file called robots.txt in the root of your webserver and fill it with something like this:

User-agent: FollowThatPage
Disallow: /

For more information about robots.txt files, see http://www.robotstxt.org/wc/exclusion-admin.html. We follow that standard except for one detail: Follow That Page does not listen to a User-agent: * line. Why not? If our users want us to follow a page, we will do so unless you explicitly deny the permission to Follow That Page. Disallowing search engines in general will not stop us, because our users are much more focused on a specific page than users of a search engine.

So, if you want to exclude both Follow That Page and general search engines from a folder called test, use the following:

User-agent: *
Disallow: /test

User-agent: FollowThatPage
Disallow: /test

Make sure that your robots.txt can be opened within 3 seconds. Otherwise it is ignored.