test/source at df29211eeb59f54079123e2bc82578a561431290 - test

Files

Hans Goudey d2f0cb6745 BLI: Unroll vector loops for better performance on GCC

On GCC, the loops created by `BLI_VEC_OP_IMPL` were not always
unrolled, leading to branching. For `attribute_math::mix4<float3>`,
this lead to a significant performance regression compared to its
older `interp_v3_v3v3v3v3` counterpart.

Instead of a using macros to create the for loops, use variadic
templates to manually unroll them. The compiler might do it anyway
(I didn't observe any effect on Clang in my tests), but there should
be no reason not to unroll these small loops, and making it explicit
and removing use of macros seems better.

On a Ryzen 3700x, this commits doubles the performance of Catmull
Rom curve position evaluation (from 18-19ms to around 9-10ms).

Differential Revision: https://developer.blender.org/D16136

2022-10-04 11:16:25 -05:00

blender

BLI: Unroll vector loops for better performance on GCC

2022-10-04 11:16:25 -05:00

creator

Support environment variables to override USER & SYSTEM resource paths

2022-10-04 13:54:09 +11:00

tools @ 2ab59df2c9

Bump submodule versions

2022-09-28 13:45:22 +02:00

CMakeLists.txt

Clang-tidy: Don't warn about unrecognized compiler flags

2022-05-06 15:26:54 +02:00