Fermi L2 Cache Delay?

Question

Fermi L2 Cache Delay?

Does anyone know related information about L2 cache in Fermi? I heard that it is as slow as global memory, and using L2 is just to increase memory bandwidth. But I can not find an official source confirming this. Has anyone measured L2 hit latency? What about size, row size and other parameters?

Essentially, how does L2 read misses affect performance? In my sense, L2 only makes sense in memory related applications. Please feel free to give your opinion.

thank

+3

opencl gpu gpgpu cuda

Zk1001 Jul 19 '11 at 8:11

source share

2 answers

, . , , CUDA : " L1 L2 , ." , NVIDIA ? - .

. L2 768 , - 128 . F4 CUDA , F4.1 F4.2. http://developer.download.nvidia.com/compute/DevZone/docs/html/C/doc/CUDA_C_Programming_Guide.pdf

0

jmsu 19 . '11 16:42

Grizzly · Accepted Answer · 2012-01-16T14:36:29+0000

nvidia . , , 100% , , , ( ):

1020 (L1 , )
1020 (L1 )
365 L2 (L1 )
88 L1 (L1 )

:

1060
248 L2
18 L1

Fermi L2 Cache Delay?

More articles: