test2/kernel at 47da8dcbcad4ccc5349bc303394e1d01d1c822c5 - test2 - Vibe4D

griefith/test2

Files

History

Stefan Werner 47da8dcbca Cycles: Improved thread order for better CUDA performance.

This patch puts threads that render the same pixel closer together,
as opposed to threads that render the same sample. Thus threads
within a warp are more coherent in memory access and control flow,
leading to performance improvements.

Example benchmarks on a Quadro RTX4000 (WDDM) on Windows 10:
Koro:                 4:23 ->  3:46
BMW:                  1:18 ->  1:25
Barbershop Interior: 17:52 -> 14:55
Classroom:            4:37 ->  3:45

Performance differences on OpenCL/AMD were hit and miss, some scenes
became faster, others lost significantly. Therefore, this is kept as
CUDA only change for now.

2019-03-14 11:45:58 +01:00

..

Cycles: Fix uninitialized number of hits

2019-02-20 23:20:07 +01:00

Cycles: Cleanup, spacing after preprocessor

2018-11-09 11:34:54 +01:00

Cycles: animation denoising support in the kernel.

2019-02-06 15:18:42 +01:00

Fix T62481: Cycles crash rendering with UV pass after recent changes.

2019-03-12 14:11:36 +01:00

Cycles OpenCL: Remove single program

2019-03-08 16:31:35 +01:00

Cycles: Added Float2 attribute type.

2019-03-05 14:55:21 +01:00

Fix T61470: incorrect saturation clamping in recent bugfix.

2019-02-14 19:28:44 +01:00

Fix T60300: Cycles SSS render hanging with AMD OpenCL.

2019-01-08 15:37:16 +01:00

Fix T61103: Cycles bevel wrong on objects with negative scale.

2019-03-11 14:26:06 +01:00

CMakeLists.txt

Cycles OpenCL: Remove single program

2019-03-08 16:31:35 +01:00

kernel_accumulate.h

Fix Cycles AO pass not working for shadow catcher objects.

2018-08-20 16:09:17 +02:00

kernel_bake.h

Cleanup: fix compiler warnings.

2019-02-14 19:39:39 +01:00

kernel_camera.h

Cycles: support arbitrary number of motion blur steps for cameras.

2018-03-10 06:27:19 +01:00

kernel_color.h

Cycles: Cleanup, spacing after preprocessor

2018-11-09 11:34:54 +01:00

kernel_compat_cpu.h

Cycles: Cleanup, spacing after preprocessor

2018-11-09 11:34:54 +01:00

kernel_compat_cuda.h

Cycles: Cleanup, spacing after preprocessor

2018-11-09 11:34:54 +01:00

kernel_compat_opencl.h

Cycles: Cleanup, spacing after preprocessor

2018-11-09 11:34:54 +01:00

kernel_differential.h

Cleanup: strip trailing space for cycles

2018-07-06 10:17:58 +02:00

kernel_emission.h

Cleanup: strip trailing space for cycles

2018-07-06 10:17:58 +02:00

kernel_film.h

Cleanup: strip trailing space for cycles

2018-07-06 10:17:58 +02:00

kernel_globals.h

Cycles: Add sample-based runtime profiler that measures time spent in various parts of the CPU kernel

2018-11-29 02:45:24 +01:00

kernel_id_passes.h

Cleanup: trailing space

2018-11-25 08:01:14 +11:00

kernel_jitter.h

Cleanup: strip trailing space for cycles

2018-07-06 10:17:58 +02:00

kernel_light.h

Cycles: Fixed OpenCL build. sqr(float4) is available on CUDA and CPU, but not on OpenCL.

2018-07-30 15:42:00 +02:00

kernel_math.h

Cycles: Cleanup, spacing after preprocessor

2018-11-09 11:34:54 +01:00

kernel_montecarlo.h

Cycles: Cleanup, spacing after preprocessor

2018-11-09 11:34:54 +01:00

kernel_passes.h

Cycles: Add sample-based runtime profiler that measures time spent in various parts of the CPU kernel

2018-11-29 02:45:24 +01:00

kernel_path_branched.h

Fix T54962: Cycles crash using subsurface scattering texture blur.

2019-01-03 17:10:37 +01:00

kernel_path_common.h

Code refactor: remove rng_state buffer and compute hash on the fly.

2017-10-04 21:11:14 +02:00

kernel_path_state.h

Cycles: Cleanup, style

2018-08-24 14:36:18 +02:00

kernel_path_subsurface.h

Fix T54962: Cycles crash using subsurface scattering texture blur.

2019-01-03 17:10:37 +01:00

kernel_path_surface.h

Cycles: Add sample-based runtime profiler that measures time spent in various parts of the CPU kernel

2018-11-29 02:45:24 +01:00

kernel_path_volume.h

Cycles: Cleanup, spacing after preprocessor

2018-11-09 11:34:54 +01:00

kernel_path.h

Cycles: Add sample-based runtime profiler that measures time spent in various parts of the CPU kernel

2018-11-29 02:45:24 +01:00

kernel_profiling.h

Cycles: Add sample-based runtime profiler that measures time spent in various parts of the CPU kernel

2018-11-29 02:45:24 +01:00

kernel_projection.h

Cleanup: strip trailing space for cycles

2018-07-06 10:17:58 +02:00

kernel_queues.h

Cycles: Cleanup, spacing after preprocessor

2018-11-09 11:34:54 +01:00

kernel_random.h

Cycles: Cleanup, spacing after preprocessor

2018-11-09 11:34:54 +01:00

kernel_shader.h

Fix T61103: Cycles bevel wrong on objects with negative scale.

2019-03-11 14:26:06 +01:00

kernel_shadow.h

Cycles: Cleanup, spacing after preprocessor

2018-11-09 11:34:54 +01:00

kernel_subsurface.h

Fix T54962: Cycles crash using subsurface scattering texture blur.

2019-01-03 17:10:37 +01:00

kernel_textures.h

Cycles: Added Float2 attribute type.

2019-03-05 14:55:21 +01:00

kernel_types.h

Cycles: prefilter feature passes separate from denoising.

2019-02-06 15:18:29 +01:00

kernel_volume.h

Cleanup: fix compiler warnings.

2019-02-14 19:39:39 +01:00

kernel_work_stealing.h

Cycles: Improved thread order for better CUDA performance.

2019-03-14 11:45:58 +01:00

kernel.h

Cycles: Cleanup, spacing after preprocessor

2018-11-09 11:34:54 +01:00