Results 1 to 3 of 3

Thread: ULP accuracy profiles

  1. #1
    Junior Member
    Join Date
    Feb 2011

    ULP accuracy profiles

    I'd like to see a more accurate ULP accuracy profile and someway to detect or request it.

    If you compare the CUDA minimum ulp accuracy information with the opencl spec you will find that the CUDA specification requires a higher minimum accuracy.
    i.e. If you write to the CUDA api you are guaranteed a higher accuracy.

    Also there are no equivalents of the CUDA -prec-sqrt=true and -prec-div=true

  2. #2
    Senior Member
    Join Date
    Mar 2011

    Re: ULP accuracy profiles

    How about a ALU and memory transfer instruction cycle count profile? One could do a benchmark to make it, but if a vendor provides that information in a manual then they might also provide it in OpenCL. This kind of information could be useful when deciding where to run kernels, on the CPU, GPU, or etc.

  3. #3
    Senior Member
    Join Date
    May 2010
    Toronto, Canada

    Re: ULP accuracy profiles

    I've forwarded this thread to the spec editor.

    Personal comment: DRAM memory latency is not a deterministic value. Also, ALU latencies are (a) something that hardware vendors would probably rather not disclose and (b) again, possibly non-deterministic.

    The only way to know which device will run a kernel faster is actually running it.
    Disclaimer: Employee of Qualcomm Canada. Any opinions expressed here are personal and do not necessarily reflect the views of my employer. LinkedIn profile.

Similar Threads

  1. Replies: 1
    Last Post: 11-30-2009, 10:43 PM
  2. New profiles
    By mike260 in forum OpenGL ES
    Replies: 0
    Last Post: 09-20-2003, 01:25 PM

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
Proudly hosted by Digital Ocean