Maybe not. It depends on the OpenCL implementation and the hardware your program runs on.
The only way to make sure that it provides improvement is by comparing on platforms and implementations of interest - for a range of vector sizes (e.g., comparing 1 (scalar), 2, 4, 8, and 16).
grrussel
source share