ColorfulCircles on my MacBookPro (2.8GHZ core 2 duo, NVidia 9400M) has very poor frame rate which seems to be related to the blurs and the fact that the 9400M uses system memory instead of VRAM. Using a 1 pixel box blur is much faster, which demonstrates that the issue is related to the actual blur algorithm rather. Using the 9600M GT (also on this notebook) showed good performance initially but soon become very jittery (perhaps this was due to overheating the GPU?).
For many types of blurs where precision is less important (as in ColorfulCircles and a lot of other animated blur situations) we could use MIP Mapped blurring (downscale the image and upscale again) to get the desired effect. Perhaps what we need is a FastBlur class which uses a box, gaussian, or image-scaled approach depending on the situation. In this way we could make sure that typical usages of animated blurs would be fast even on lame hardware (and even in software mode).