Fast Gaussian blur in real time on CUDA
Gaussian filtering is widely used standard algorithm which is a must in many applications, starting from Sharp/USM to SIFT/SURF. Gauss filter is isotropic and separable. These properties are very important for fast and efficient image processing. Gaussian filtering usually is time-consuming task, that's why it's a good idea try to accelerate it with CUDA.
Standard features for Gaussian filtering on CUDA
Benchmarks for Gaussian blur on GeForce GTX 980 (Windows-7 and CUDA-7.5, 32-bit)
Now we need just ~8 ms for Gaussian blur (sigma ~ 1, window 5×5) of 24-bit color image with 3840×2160 resolution. These are benchmarks for 2K / 4K / 20 Mpix images, 24-bit (computations on GPU, without DeviceIO latency)