Search:

Type: Posts; User: Dithermaster

Page 1 of 9 1 2 3 4

Search: Search took 0.00 seconds.

  1. Another thing to watch for: In CUDA the host...

    Another thing to watch for: In CUDA the host specified the block size and number of blocks. In OpenCL you specify the global size and optionally the block (workgroup) size.
  2. Replies
    2
    Views
    384

    Exactly. OpenCL isn't good for a low-latency...

    Exactly. OpenCL isn't good for a low-latency short calculation. The buffer and command queue overhead would always take longer than just doing the simple calculation on the host. OpenCL is about...
  3. AMD fast path mentioned in their optimization...

    AMD fast path mentioned in their optimization guide: http://developer.amd.com/tools-and-sdks/opencl-zone/amd-accelerated-parallel-processing-app-sdk/opencl-optimization-guide/
  4. It is a mystery to us too since we can't see the...

    It is a mystery to us too since we can't see the code. One possibility is that the compiler doesn't know the condition is always false (perhaps it is passed in as a kernel argument) and your...
  5. I haven't used the physics packages so I'm not...

    I haven't used the physics packages so I'm not help there, sorry.
  6. First time someone named me in a thread title ...

    First time someone named me in a thread title <g>

    I've had both NVIDIA and AMD cards in my HP Z820 for some time (different versions of each over time) as a way to confirm our kernels compile and...
  7. Replies
    2
    Views
    372

    @gregorstopar, all of the errors are defined in...

    @gregorstopar, all of the errors are defined in cl.h

    When you get CL_BUILD_PROGRAM_FAILURE you should get the build log using clGetProgramBuildInfo with CL_PROGRAM_BUILD_LOG so you know what the...
  8. You are doing extensive buffer manipulation in...

    You are doing extensive buffer manipulation in the host code, and this is very likely the source of your performance problems. The host code should only initialize the buffers and get the final...
  9. The OpenCL specification has a section that...

    The OpenCL specification has a section that describes exactly what read_imagef does for every sampler type, including interpolation. It would not be hard to write a replacement (in fact, the CPU...
  10. Two things: 1) local work group size area (width...

    Two things: 1) local work group size area (width * height) cannot be larger than what CL_DEVICE_MAX_WORK_GROUP_SIZE returns (which I've seen as small as 128 on older hardware, which 32x32 is larger...
  11. Replies
    2
    Views
    977

    Some of the Amazon GPU instances have high-end...

    Some of the Amazon GPU instances have high-end NVIDIA GPUs which support OpenCL. I am not affiliated, just sharing.
  12. Replies
    3
    Views
    918

    It depends. If the array rarely changes and you...

    It depends. If the array rarely changes and you need to access it a lot, make a copy of it in i,j (instead of j,i) order. Alternatively, store it in an OpenCL image instead, which has more fair...
  13. Replies
    5
    Views
    1,019

    Of course. For example, and system header that...

    Of course. For example, and system header that includes file system access, system clock access, stdio access, etc. None of these can be accessed from the device.
  14. The benefit of shared local memory is if many...

    The benefit of shared local memory is if many work items in a work group need to access the same memory at different times (for example, a matrix multiply). If each of your work items accesses...
  15. Replies
    5
    Views
    1,019

    Something you are #include'ing is trying to...

    Something you are #include'ing is trying to #include stdarg.h, which is likely not compatible with OpenCL C99. Check your includes (and their includes) to find the culprit.
  16. Replies
    1
    Views
    613

    Intel has some tools which can measure CPU and...

    Intel has some tools which can measure CPU and GPU power usage. I'm not aware of anything off the top of my head for AMD or NVIDIA.
  17. Replies
    5
    Views
    731

    Is there even a use case for pipes on CPU or GPU...

    Is there even a use case for pipes on CPU or GPU devices (that is more efficient or less code than just using global memory or images between kernels), or do they exist just for FPGA devices?
  18. Replies
    4
    Views
    1,134

    To clarify, never install OpenCL.dll in the...

    To clarify, never install OpenCL.dll in the system directory. If you install it in your application folder or bundle, that's less of a system-wrecking technical issue. Legally you might not have the...
  19. Replies
    4
    Views
    1,134

    Do (2). It works great. Never do (4) or (5),...

    Do (2). It works great.

    Never do (4) or (5), only drivers should install the ICD and you could hurt other applications if you do it wrong.
  20. Replies
    5
    Views
    715

    I've heard that pipes benefit OpenCL on FPGA...

    I've heard that pipes benefit OpenCL on FPGA since they fit better into the pipelined hardware nature of those devices. They also seem to have some benefit in dynamic parallelism for...
  21. Replies
    2
    Views
    661

    My understanding is that OpenCL 1.2 is available...

    My understanding is that OpenCL 1.2 is available for Tegra but you have to contact NVIDIA to get it.
  22. I've seen 1024,1,1 only for the Apple CPU device,...

    I've seen 1024,1,1 only for the Apple CPU device, so I agree with your guess that it was that device. Switch to the GPU device for better dimensions.
  23. Replies
    1
    Views
    733

    Because the runtime may choose to run some...

    Because the runtime may choose to run some workgroups to completion before starting others (when the number of workgroups far exceeds the hardware capabilties) there are therefore no global...
  24. On Windows, OpenCL.dll _is_ the ICD, but you...

    On Windows, OpenCL.dll _is_ the ICD, but you still don't want to ship it. It varies by version, for one thing (what if you ship a version 1.2 one, but the vendor driver updated the system ICD...
  25. You do NOT want to ship this DLL with your...

    You do NOT want to ship this DLL with your project. The one installed on the system is the one you want to use. What problem are you trying to solve?
Results 1 to 25 of 214
Page 1 of 9 1 2 3 4
Proudly hosted by Digital Ocean