Search:

Type: Posts; User: Dithermaster

Page 1 of 9 1 2 3 4

Search: Search took 0.00 seconds.

  1. Replies
    1
    Views
    189

    Doing it just as you describe works fine for us...

    Doing it just as you describe works fine for us on Windows. On Mac there were some issues so we always build from source. What platform are you on? Very sorry, but I can't share source, just confirm...
  2. Another thing to watch for: In CUDA the host...

    Another thing to watch for: In CUDA the host specified the block size and number of blocks. In OpenCL you specify the global size and optionally the block (workgroup) size.
  3. Replies
    2
    Views
    425

    Exactly. OpenCL isn't good for a low-latency...

    Exactly. OpenCL isn't good for a low-latency short calculation. The buffer and command queue overhead would always take longer than just doing the simple calculation on the host. OpenCL is about...
  4. AMD fast path mentioned in their optimization...

    AMD fast path mentioned in their optimization guide: http://developer.amd.com/tools-and-sdks/opencl-zone/amd-accelerated-parallel-processing-app-sdk/opencl-optimization-guide/
  5. It is a mystery to us too since we can't see the...

    It is a mystery to us too since we can't see the code. One possibility is that the compiler doesn't know the condition is always false (perhaps it is passed in as a kernel argument) and your...
  6. I haven't used the physics packages so I'm not...

    I haven't used the physics packages so I'm not help there, sorry.
  7. First time someone named me in a thread title ...

    First time someone named me in a thread title <g>

    I've had both NVIDIA and AMD cards in my HP Z820 for some time (different versions of each over time) as a way to confirm our kernels compile and...
  8. Replies
    2
    Views
    395

    @gregorstopar, all of the errors are defined in...

    @gregorstopar, all of the errors are defined in cl.h

    When you get CL_BUILD_PROGRAM_FAILURE you should get the build log using clGetProgramBuildInfo with CL_PROGRAM_BUILD_LOG so you know what the...
  9. You are doing extensive buffer manipulation in...

    You are doing extensive buffer manipulation in the host code, and this is very likely the source of your performance problems. The host code should only initialize the buffers and get the final...
  10. The OpenCL specification has a section that...

    The OpenCL specification has a section that describes exactly what read_imagef does for every sampler type, including interpolation. It would not be hard to write a replacement (in fact, the CPU...
  11. Two things: 1) local work group size area (width...

    Two things: 1) local work group size area (width * height) cannot be larger than what CL_DEVICE_MAX_WORK_GROUP_SIZE returns (which I've seen as small as 128 on older hardware, which 32x32 is larger...
  12. Replies
    2
    Views
    1,018

    Some of the Amazon GPU instances have high-end...

    Some of the Amazon GPU instances have high-end NVIDIA GPUs which support OpenCL. I am not affiliated, just sharing.
  13. Replies
    3
    Views
    963

    It depends. If the array rarely changes and you...

    It depends. If the array rarely changes and you need to access it a lot, make a copy of it in i,j (instead of j,i) order. Alternatively, store it in an OpenCL image instead, which has more fair...
  14. Replies
    5
    Views
    1,050

    Of course. For example, and system header that...

    Of course. For example, and system header that includes file system access, system clock access, stdio access, etc. None of these can be accessed from the device.
  15. The benefit of shared local memory is if many...

    The benefit of shared local memory is if many work items in a work group need to access the same memory at different times (for example, a matrix multiply). If each of your work items accesses...
  16. Replies
    5
    Views
    1,050

    Something you are #include'ing is trying to...

    Something you are #include'ing is trying to #include stdarg.h, which is likely not compatible with OpenCL C99. Check your includes (and their includes) to find the culprit.
  17. Replies
    1
    Views
    629

    Intel has some tools which can measure CPU and...

    Intel has some tools which can measure CPU and GPU power usage. I'm not aware of anything off the top of my head for AMD or NVIDIA.
  18. Replies
    5
    Views
    755

    Is there even a use case for pipes on CPU or GPU...

    Is there even a use case for pipes on CPU or GPU devices (that is more efficient or less code than just using global memory or images between kernels), or do they exist just for FPGA devices?
  19. Replies
    4
    Views
    1,192

    To clarify, never install OpenCL.dll in the...

    To clarify, never install OpenCL.dll in the system directory. If you install it in your application folder or bundle, that's less of a system-wrecking technical issue. Legally you might not have the...
  20. Replies
    4
    Views
    1,192

    Do (2). It works great. Never do (4) or (5),...

    Do (2). It works great.

    Never do (4) or (5), only drivers should install the ICD and you could hurt other applications if you do it wrong.
  21. Replies
    5
    Views
    740

    I've heard that pipes benefit OpenCL on FPGA...

    I've heard that pipes benefit OpenCL on FPGA since they fit better into the pipelined hardware nature of those devices. They also seem to have some benefit in dynamic parallelism for...
  22. Replies
    2
    Views
    678

    My understanding is that OpenCL 1.2 is available...

    My understanding is that OpenCL 1.2 is available for Tegra but you have to contact NVIDIA to get it.
  23. I've seen 1024,1,1 only for the Apple CPU device,...

    I've seen 1024,1,1 only for the Apple CPU device, so I agree with your guess that it was that device. Switch to the GPU device for better dimensions.
  24. Replies
    1
    Views
    750

    Because the runtime may choose to run some...

    Because the runtime may choose to run some workgroups to completion before starting others (when the number of workgroups far exceeds the hardware capabilties) there are therefore no global...
  25. On Windows, OpenCL.dll _is_ the ICD, but you...

    On Windows, OpenCL.dll _is_ the ICD, but you still don't want to ship it. It varies by version, for one thing (what if you ship a version 1.2 one, but the vendor driver updated the system ICD...
Results 1 to 25 of 215
Page 1 of 9 1 2 3 4
Proudly hosted by Digital Ocean