Results 1 to 3 of 3

Thread: memory coalescing

  1. #1
    Junior Member
    Join Date
    Feb 2013

    memory coalescing

    actually i am working on memory coalescing technique. and i have searched so much but hardly able to get code regarding this issue. Do you have code regarding this?

  2. #2
    Senior Member
    Join Date
    Oct 2012

    Re: memory coalescing

    I think the best explanation on that will ne the lectures here:
    For Memory Coalescing, have a look at
    CUDA University Courses

    University of Illinois : ECE 498AL
    Taught by Professor Wen-mei W. Hwu and David Kirk, NVIDIA CUDA Scientist.
    --> Memory Bank Conflicts (115 MB)

  3. #3
    Senior Member
    Join Date
    Dec 2011

    Re: memory coalescing

    There are many nuances and details, but for simple kernels the key element is this: For adjacent work items, you want them accessing adjacent memory. Sometimes this means doing things in a counter-intuitive fashion. An example is using a 1D kernel to process 2D images (there are reasons why you'd want to do this) -- you should run it on columns and not rows (i.e., interpret get_global_id(0) as X) because then for each iteration of the Y loop inside the kernel the work items will be accessing horizontally adjacent pixels.

Similar Threads

  1. Replies: 6
    Last Post: 02-28-2013, 03:59 PM
  2. global memory coalescing question
    By openclnewb in forum OpenCL
    Replies: 3
    Last Post: 03-31-2010, 10:47 AM

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
Proudly hosted by Digital Ocean