Fermi L2 Cache Delay?

Does anyone know related information about L2 cache in Fermi? I heard that it is as slow as global memory, and using L2 is just to increase memory bandwidth. But I can not find an official source confirming this. Has anyone measured L2 hit latency? What about size, row size and other parameters?

Essentially, how does L2 read misses affect performance? In my sense, L2 only makes sense in memory related applications. Please feel free to give your opinion.

thank

+3
source share
2 answers

nvidia . , , 100% , , , ( ):

1020 (L1 , )

1020 (L1 )

365 L2 (L1 )

88 L1 (L1 )

:

1060

248 L2

18 L1

+3

, . , , CUDA : " L1 L2 , ." , NVIDIA ? - .

. L2 768 , - 128 . F4 CUDA , F4.1 F4.2. http://developer.download.nvidia.com/compute/DevZone/docs/html/C/doc/CUDA_C_Programming_Guide.pdf

0

All Articles