Relates :
|
|
Relates :
|
|
Relates :
|
The addition of atomic unrolled drain loops which precede fix-up segments which are significantly faster than scalar code. The requirement is that the main loop is super unrolled after vectorization.
|