Added OpenCL kernel for the directional blur. This operation always uses the full input image. In the current implementation this input image is not cached on the device. Future enhancement could be to cache it on the available opencl devices