Results 1 to 1 of 1

Thread: Copying Command Queue With Everything in it

  1. #1
    Junior Member
    Join Date
    Sep 2012

    Copying Command Queue With Everything in it

    I have a program doing heavy work on host side and enqueueing a lot of kernels (such as 50 kernels for a reduction) by it adds too much latency because of that host side sluggishness but device side is faster even though its a low end GPU. To overcome that, is it safe to copy a command queue to another command queue (I don't know how) and re-enqueue everything from that queue into the original queue in a single API command? If yes, how can I do it? For example some programs such as fluid advection needs even hundreds of kernels at each step. here it is seen the CPU part enqueue operations take nearly 1/3 of total cycle and the big blue blob is already at maximum pci-e bandwidth (2.78 GB/s (8x 2.0))

    Kernel-only device part takes no more than 1.5 ms for 4M element float reduction(sum).

    Thank you for your time.
    Last edited by Tugrul; 05-26-2017 at 02:20 PM.

Tags for this Thread

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
Proudly hosted by Digital Ocean