I have finished my OpenCL code on intel CPU, and verified the correctness by AMD APP and Intel OpenCL tool set.
Now, I want to run the code on some GPU platform to see how well will it be. My questions is:
1. For complicated kernel (relative big code size and local and private memory), Nvidia or AMD GPU, which performance is better?
2. From perspective of development tools convenience, complete, and support resources, Nvidia or AMD, which is better?
3. I heard that CUDA perform better than OpenCL in Nvidia GPU, because Nvidia like CUDA more than OpencCL. is that true? Does it mean that AMD pay more attention to OpenCL and offer better tools?
4. Anybody recommend a graphic card which has better performance over price?

Thank you so much!