I have to run an OpenCL code to parallelize a part of a code. I am able to run it on the GPU of my system and also on the CPU of my system. I have to run it on an Odroid board. The OpenCL code runs fine on the GPU of the Odroid but it gives wrong answers when I run it on the CPU of Odroid.

Does anyone have any idea on what could be the issue?