Results 1 to 2 of 2

Thread: Cache miss in kernel

  1. #1
    Junior Member
    Join Date
    Oct 2013

    Cache miss in kernel


    Should I consider the caches of a single core ?

    The input data is 2 3D matrices each contains 16x256x16 elements.

    When the core access the data is does it slowly.

    So I guess I caused a lot of cache miss.

    Where can I find information about the size of L1,L2 cache of a display card ?

    I'm using NVIDIA's GeForce 9400 GT:

    The spec does not contains this information.


  2. #2
    Hi Zvika,

    Geforce 9400 GT is compute capability 1.0 (see here:

    Look at CUDA programming guide, Appendix G.3, for explanation on Compute Capability 1.x architecture, and how to access the memory (it's a split warp architectures).


Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
Proudly hosted by Digital Ocean