Search:

Type: Posts; User: Dithermaster

Page 1 of 10 1 2 3 4

Search: Search took 0.00 seconds.

  1. Replies
    60
    Views
    20,303

    Sticky: Except that SPIR is independent now of OpenCL...

    Except that SPIR is independent now of OpenCL (e.g., works with Vulkan too) so impossible to lock it's version number to OpenCL. Except is SPIR 1.2 and 2.0 which were exclusive to OpenCL.
  2. Replies
    4
    Views
    270

    Check your third parameter to...

    Check your third parameter to clEnqueueNDRangeKernel ("work_dim") -- you have it set to 1, but it should be 2.
  3. Replies
    4
    Views
    270

    I suspect it's your local_item_size of {80,80} --...

    I suspect it's your local_item_size of {80,80} -- that's 6400 items in a workgroup, and most hardware can't do that big (check the device info for the hardware max, it might be something like 128,...
  4. It indicated that a certain OpenCL 1.2 flag was...

    It indicated that a certain OpenCL 1.2 flag was available even in their OpenCL 1.1 driver; now that they fully support OpenCL 1.2 it is no longer needed (perhaps it is still listed for backwards...
  5. Replies
    7
    Views
    578

    Intel GPU supports OpenCL 2.1 and SPIR-V....

    Intel GPU supports OpenCL 2.1 and SPIR-V. Reference: https://software.intel.com/en-us/articles/spir-v-is-a-better-spir-with-intel-opencl-code-builder
  6. Seems legal to me as long as you're not trying to...

    Seems legal to me as long as you're not trying to pass a host pointer in there. If you initialize the value and use the value only on the device side, seems completely legit.
  7. Yes. We do this in some of our kernels. It is...

    Yes. We do this in some of our kernels. It is legal.
  8. Replies
    8
    Views
    680

    Does GPU-Z say either GPU can do OpenCL? If...

    Does GPU-Z say either GPU can do OpenCL?
    If clGetPlatformIDs is crashing, that's a known issue; a bad OpenCL.dll. Back it up, delete it, and re-install your GPU driver which should put back a good...
  9. Replies
    5
    Views
    864

    Your kernel takes three arguments, int, float, &...

    Your kernel takes three arguments, int, float, & buffer
    You are calling setArg with buffer, buffer, & buffer
    The first two are wrong, you should be passing int & float (and also don't need to...
  10. No, they are not emulated. Your target...

    No, they are not emulated.

    Your target platform is AMD GPU and your kernel uses AMD-specific extension but your development system has an NVIDIA GPU that does not support them.

    Every GPU AMD...
  11. get_global_size is the same for all work iteam...

    get_global_size is the same for all work iteam (aka threads). If you enqueued your kernel for 512 work items it would be 512.
    get_global_id is unique for each work items. If you enqueued your kernel...
  12. Short answer: No Long answer: You are trying...

    Short answer: No

    Long answer: You are trying to run code that uses AMD-specific extensions on an NVIDIA GPU, where they are not supported. You should either switch to an AMD GPU or re-write the...
  13. Replies
    5
    Views
    864

    General advice: Since the display driver needs to...

    General advice: Since the display driver needs to use the GPU it kills your long-running compute process. When running OpenCL on the display GPU you should try to keep your kernels in the...
  14. Could your kernel execution time be dependent on...

    Could your kernel execution time be dependent on the input data?
  15. Starting in OpenCL 1.1 (OpenCL 1.0 was not...

    Starting in OpenCL 1.1 (OpenCL 1.0 was not threadsafe).
  16. Replies
    1
    Views
    356

    Doing it just as you describe works fine for us...

    Doing it just as you describe works fine for us on Windows. On Mac there were some issues so we always build from source. What platform are you on? Very sorry, but I can't share source, just confirm...
  17. Another thing to watch for: In CUDA the host...

    Another thing to watch for: In CUDA the host specified the block size and number of blocks. In OpenCL you specify the global size and optionally the block (workgroup) size.
  18. Replies
    2
    Views
    622

    Exactly. OpenCL isn't good for a low-latency...

    Exactly. OpenCL isn't good for a low-latency short calculation. The buffer and command queue overhead would always take longer than just doing the simple calculation on the host. OpenCL is about...
  19. AMD fast path mentioned in their optimization...

    AMD fast path mentioned in their optimization guide: http://developer.amd.com/tools-and-sdks/opencl-zone/amd-accelerated-parallel-processing-app-sdk/opencl-optimization-guide/
  20. It is a mystery to us too since we can't see the...

    It is a mystery to us too since we can't see the code. One possibility is that the compiler doesn't know the condition is always false (perhaps it is passed in as a kernel argument) and your...
  21. I haven't used the physics packages so I'm not...

    I haven't used the physics packages so I'm not help there, sorry.
  22. First time someone named me in a thread title ...

    First time someone named me in a thread title <g>

    I've had both NVIDIA and AMD cards in my HP Z820 for some time (different versions of each over time) as a way to confirm our kernels compile and...
  23. Replies
    2
    Views
    541

    @gregorstopar, all of the errors are defined in...

    @gregorstopar, all of the errors are defined in cl.h

    When you get CL_BUILD_PROGRAM_FAILURE you should get the build log using clGetProgramBuildInfo with CL_PROGRAM_BUILD_LOG so you know what the...
  24. You are doing extensive buffer manipulation in...

    You are doing extensive buffer manipulation in the host code, and this is very likely the source of your performance problems. The host code should only initialize the buffers and get the final...
  25. The OpenCL specification has a section that...

    The OpenCL specification has a section that describes exactly what read_imagef does for every sampler type, including interpolation. It would not be hard to write a replacement (in fact, the CPU...
Results 1 to 25 of 231
Page 1 of 10 1 2 3 4
Proudly hosted by Digital Ocean