Now that there is a condition in the loop count there is possibility the compiler doesn't optimize this. Be conservative an manually unroll for better codegen.
Now that there is a condition in the loop count there is possibility the compiler doesn't optimize this. Be conservative an manually unroll for better codegen.