The cl_event return by clEnqueueNDRange in last nVidia implementation on windows seems to be broken.
If I don't call clWaitEvent, its execution status stay on queued.
And if I try to get the get command queue with clGetEventInfo, the cl_command_queue isn't valid (the address seems to be shifted by few bytes) and a clRetainCommandQueue on it crash.
There is no problem with AMD implementation.
Someone have the same problem?