. x86 ( SSE).
. SSE, .
, , SSE, , . , . , , SSE, .
And then there is the opportunity to hint to the memory controller how you want to access the memory, for example. if you want to store data so that it bypasses the cache or not. For starving bandwidth algorithms that can give you extra extra speed on this.
source
share