I have a small data structure for each processor in the Linux kernel module, where each processor often writes and reads its own data. I know that I need to make sure that these data elements are not in the same cache line, because if they were then, the kernels would forever pollute each other. However, is there anything at the page level that I need to worry about in terms of SMP performance? i.e. will there be any performance impact from filling these structures with one processor up to 4096 bytes and their alignment?
This is on Linux 2.6 on x86_64.
(About whether it is worth optimizing, and the proposals that I make are not needed, what I'm looking for, is there a theoretical basis for concern about page alignment).
optimization with memory-management linux-kernel
kdt
source share