Type: Posts; User: fireplace

Search: Search took 0.00 seconds.

  1. empty struct / enum layout compatibility between host and OpenCL code


    I have just learnt the hard way that when defining a struct with no data members:
    typedef struct {} someEmptyStruct;
    the memory layout for this struct is different on the host (C++) side...
  2. yes, the offset is in bytes, I know. What puzzles...

    yes, the offset is in bytes, I know. What puzzles me is why the alignment requirement:
    i) is there after all (I can well come along with a performance penalty in the case of misalignment; but an...
  3. subbuffer understanding (alignment / usefulness)


    I just tried created to cl::Buffer subbuffer-objects, where for the second creation I received an error:

    cl_buffer_region bufReg;
    bufReg.origin = 0;
    bufReg.size = x;
  4. Re: memory allocation / dealloaction / copying timing questi

    Ok so for allocation I assume that all devices attached to a context must have sufficient memory available, even if never invoked for the given kernel.
    Sure I am responsible for allocationg memory...
  5. Re: Regular question about dual gpu on single board

    I second that, ask the manufacturer but expect it to see two independent GPU devices each at 3GB -> as if you'd plug two ordinary 7970s into your system.

    Presuming you haven't purchased the card...
  6. Re: memory allocation / dealloaction / copying timing questi

    Thanks for the reply!

    For the first question, at #1 no device has been specified yet. Assume one context attaches to several GPU-devices of the same kind, but the kernel is finally run only on a...
  7. memory allocation / dealloaction / copying timing questions


    I am trying to figure out memory interaction issues (allocation, duration and copy operations etc.) between host and devices, using the C++ API (whens, hows, guarantees, nogos etc.). So assume...
  8. typedefs (cl_int etc) for mulitple platforms


    Consider multiple platforms on the same system, e.g. Intel OpenCL SDK for targeting an Intel CPU, and Nvidia SDK for targeting an Nvidia GPU. Both platforms / devices shall be targeted in the...
  9. device memory limits (unexpectedly low for CPU)


    This is somewhat an implementation issue, but I hope general enought to get here insights.

    Consider two input vectors of N double (64-bit) elements, and one output vector, and allocating...
  10. Re: dual-GPU graphics cards: one or two devices?

    So assume you want to keep both GPUs of the card busy. I suppose the most straightforward way is simply setting up two command queues (one for each device), and then split the data, i.e....
  11. dual-GPU graphics cards: one or two devices?


    Consider a dual-GPU graphics card like the Nvidia 590 or the Radeon 6990.
    Does anyone know if such a card cwill show up as a single or two devices? If just one device, has the device...
  12. Replies

    [newbie] OpenCL code within C++ program


    I have a present code written in C++. It uses only platform-independent C++98 features and boost libraries; of the latter only the threads library is used as non-header only.

    The code has...
Results 1 to 12 of 12
Proudly hosted by Digital Ocean