griefith/test

Author	SHA1	Message	Date
Sebastian Herholz	d9bc8f189c	Cycles: add build option to enable a debugging feature for MIS This patch adds a CMake option "WITH_CYCLES_DEBUG" which builds cycles with a feature that allows debugging/selecting the direct-light sampling strategy. The same option may later be used to add other debugging features that could affect performance in release builds. The three options are: * Forward path tracing (e.g., via BSDF or phase function) * Next-event estimation * Multiple importance sampling combination of the previous two methods Such a feature is useful for debugging light different sampling, evaluation, and pdf methods (e.g., for light sources and BSDFs). Differential Revision: https://developer.blender.org/D13152	2021-11-17 18:03:56 +01:00
Brecht Van Lommel	9937d5379c	Cycles: add packed_float3 type for storage Introduce a packed_float3 type for smaller storage that is exactly 3 floats, instead of 4. For computation float3 is still used since it can use SIMD instructions. Ref T92212 Differential Revision: https://developer.blender.org/D13243	2021-11-17 17:29:41 +01:00
Thomas Dinges	83a4d51997	Cleanup: Remove unused show_samples() device code in Cycles.	2021-11-17 11:16:48 +01:00
Campbell Barton	1143bf281a	Cleanup: spelling in comments, comment block formatting	2021-11-13 13:07:13 +11:00
Campbell Barton	acc800d24d	Cleanup: clang-format	2021-11-13 12:47:18 +11:00
Patrick Mours	809ae823b7	Merge branch 'blender-v3.0-release'	2021-11-12 19:00:23 +01:00
Patrick Mours	9d0d4b8601	Fix T93029: OptiX denoising artifacts at high sample counts in specific scenes Partially reverts commit rB440a3475b8f5410e5c41bfbed5ce82771b41356f because "optixDenoiserComputeIntensity" does not currently support input images that are not packed (the "pixelStrideInBytes" field is not zero). As a result the intensity calculation would take into account data from other passes in the image, some of which was scaled by the number of samples still and therefore produce widely incorrect results that then caused artifacts in the denoised image. Maniphest Tasks: T93029	2021-11-12 18:59:50 +01:00
Thomas Dinges	25e7365d0d	Cleanup CUDA / HIP comments Remove outdated CUDA comments for bindless textures and cleanup some HIP comments that still mentioned CUDA. Differential Revision: https://developer.blender.org/D13189	2021-11-11 16:37:29 +01:00
Sergey Sharybin	ce395c84a3	Merge branch 'blender-v3.0-release'	2021-11-11 15:29:35 +01:00
Sergey Sharybin	d26d3cfe19	Fix T92868: Cycles catcher with transparency crashes The issue was caused by splitting happening twice. Fixed by checking for split flag which is assigned to the both states during split. The tricky part was to write catcher data at the moment of split: the transparency and shadow catcher sample count is to be accumulated at that point. Now it is happening in the `intersect_closest` kernel. The downside is that render buffer is to be passed to the kernel, but the benefit is that extra split bounce check is not needed now. Had to move the passes write to shadow catcher header, since include of `film/passes.h` causes all the fun of requirement to have BSDF data structures available. Differential Revision: https://developer.blender.org/D13177	2021-11-11 15:21:35 +01:00
Thomas Dinges	9ca8bf0b29	Merge branch 'blender-v3.0-release'	2021-11-10 22:28:03 +01:00
Thomas Dinges	040630bb9a	Fix wrong device check in HIP kernel compile. Also cleanup some related code, that was falsely copied from CUDA. Differential Revision: https://developer.blender.org/D13180	2021-11-10 22:24:53 +01:00
Brecht Van Lommel	3fa86f4b28	Merge branch 'blender-v3.0-release'	2021-11-10 20:19:09 +01:00
Thomas Dinges	e507a789b3	Cycles: disable graphics interop for HIP devices This is due to a driver bug, so disable it for now until it gets resolved in a future driver release. Ref T92972 Differential Revision: https://developer.blender.org/D13167	2021-11-10 20:16:44 +01:00
Aaron Carlisle	7383f95443	Merge branch 'blender-v3.0-release'	2021-11-09 11:43:18 -05:00
Brecht Van Lommel	fd0ba6449b	Cycles: mark both RDNA and RDNA2 as support for HIP	2021-11-09 17:38:25 +01:00
Patrick Mours	faeb2cc900	Merge branch 'blender-v3.0-release'	2021-11-09 14:49:47 +01:00
Patrick Mours	440a3475b8	Cycles: Improve OptiX denoising with dark images and fix crash when denoiser is destroyed Adds a pass before denoising that calculates the intensity of the image, which can be passed into the OptiX denoiser for more optimal results for very dark or very bright images. In addition this also fixes a crash that sometimes occurred on exit. The OptiX denoiser object has to be destroyed before the OptiX device context object (since it references that). But in C++ the destructor function of a class is called before its fields are destructed, so "~OptiXDevice" was always called before "OptiXDevice::~Denoiser" and therefore "optixDeviceContextDestroy" was called before "optixDenoiserDestroy", hence the crash. Differential Revision: https://developer.blender.org/D13160	2021-11-09 14:49:00 +01:00
Patrick Mours	9daf6a69a6	Fix T92472: OptiX denoising artifacts with recent GPU driver 495.29.05 or newer on Linux Adds a workaround for a driver bug in r495 that causes artifacts with OptiX denoising. `optixDenoiserSetup` is not working properly there when called with a stream other than the default stream, so use the default stream for now and force synchronization across the entire context afterwards to ensure the other stream Cycles uses to enqueue the actual denoising command cannot execute before the denoising setup has finished. Maniphest Tasks: T92472 Differential Revision: https://developer.blender.org/D13158	2021-11-09 14:47:26 +01:00
Hans Goudey	9e611c5616	Merge branch 'blender-v3.0-release'	2021-11-05 16:33:08 -05:00
Brecht Van Lommel	97ff37bf54	Cycles: perform CPU film reading in the kernel, to use AVX2 half conversion Adds a bunch of CPU kernel function to process on row of pixels, and use those instead of calling unoptimized implementations. Fixes T92598	2021-11-05 22:04:36 +01:00
Germano Cavalcante	df3e30398f	Merge branch 'blender-v3.0-release'	2021-11-04 16:56:32 -03:00
Brecht Van Lommel	e51735d276	Cleanup: fix typo	2021-11-04 20:36:35 +01:00
Brecht Van Lommel	4960ad420b	Cycles: add code to check for supported HIP device architectures RDNA2 only for now to be conservative, but testing more hardware is underway. Ref T92393 Differential Revision: https://developer.blender.org/D12958	2021-11-04 20:34:21 +01:00
Brian Savery	36f5198282	Fix Cycles HIP Kernels loading on Arch names with extra options The kernel file names are search for based on the arch name, for example gfx1010. However HIP's gcnArchName can contain options such as xnack- in the name. For example gfx1010:sramecc-:xnack-. This revision tokenizes the info from gcnArchName and just uses the first token for choosing the Kernel file to use. Kernels are portable across those features in the arch name. Also remove the bit for recompiling ptx as clearly that is not relevant. Differential Revision: https://developer.blender.org/D13117	2021-11-04 20:24:39 +01:00
Clément Foucault	3f0991266f	Merge branch 'blender-v3.0-release'	2021-11-01 12:15:09 +01:00
Thomas Dinges	5327413b37	Cleanup: Remove Cycles device checks for half float. All supported devices support half float now, so we can remove the check. Differential Revision: https://developer.blender.org/D13021	2021-11-01 10:18:30 +01:00
Brecht Van Lommel	806521f703	Fix T92671: confusing Cycles debug logs about CPU architecture Instead of printing debug flags listing various CPU and GPU settings that may or may not be used, print when we are using them. This include CPU kernel types, OptiX debugging and CUDA and HIP adaptive compilation. BVH type was already printed.	2021-11-01 08:36:50 +01:00
Brecht Van Lommel	949dbb08d2	Cleanup: remove useless WITH_CYCLES_DEVICE_MULTI	2021-10-26 15:37:59 +02:00
Brecht Van Lommel	fd25e883e2	Cycles: remove prefix from source code file names Remove prefix of filenames that is the same as the folder name. This used to help when #includes were using individual files, but now they are always relative to the cycles root directory and so the prefixes are redundant. For patches and branches, git merge and rebase should be able to detect the renames and move over code to the right file.	2021-10-26 15:37:04 +02:00
Brecht Van Lommel	d7d40745fa	Cycles: changes to source code folders structure * Split render/ into scene/ and session/. The scene/ folder now contains the scene and its nodes. The session/ folder contains the render session and associated data structures like drivers and render buffers. * Move top level kernel headers into new folders kernel/camera/, kernel/film/, kernel/light/, kernel/sample/, kernel/util/ * Move integrator related kernel headers into kernel/integrator/ * Move OSL shaders from kernel/shaders/ to kernel/osl/shaders/ For patches and branches, git merge and rebase should be able to detect the renames and move over code to the right file.	2021-10-26 15:36:39 +02:00
Sayak Biswas	d092933abb	Cycles: various fixes for HIP and compilation of HIP binaries * Additional structs added to the hipew loader for device props * Adds hipRTC functions to the loader for future usage * Enables CPU+GPU usage for HIP * Cleanup to the adaptive kernel compilation process * Fix for kernel compilation failures with HIP with latest master Ref T92393, D12958	2021-10-22 12:15:29 +02:00
Brecht Van Lommel	be558d2d97	Fix T92363: OptiX fails with ambient occlusion node, after recent changes This triggered a compiler bug where it does not handle the sub.s16 PTX instruction. Instead refactor the code so we don't need to do uint16_t subtraction at all. Also update OptiX device to remove the AO pass direct callable. Thanks Patrick Mours for figuring this out.	2021-10-21 21:25:34 +02:00
Brecht Van Lommel	df00463764	Cycles: add shadow path compaction for GPU rendering Similar to main path compaction that happens before adding work tiles, this compacts shadow paths before launching kernels that may add shadow paths. Only do it when more than 50% of space is wasted. It's not a clear win in all scenes, some are up to 1.5% slower. Likely caused by different order of scheduling kernels having an unpredictable performance impact. Still feels like compaction is just the right thing to avoid cases where a few shadow paths can hold up a lot of main paths. Differential Revision: https://developer.blender.org/D12944	2021-10-21 15:38:03 +02:00
Brecht Van Lommel	39810b3f51	Cleanup: make HIP and CUDA code more consistent Ref D12834	2021-10-21 13:08:10 +02:00
William Leeson	f0df0e9e07	Fix: Add cast to atof for CYCLES_CONCURRENT_STATES_FACTOR env variable parsing. The conversion from double to float was causing a build failure. Differential Revision: https://developer.blender.org/D12946	2021-10-20 21:01:39 +02:00
Brecht Van Lommel	7d111f4ac2	Cleanup: remove unused code	2021-10-20 18:15:21 +02:00
Brecht Van Lommel	001f548227	Cycles: reduce kernel reserved local memory when not using shader raytracing Ref T87836	2021-10-20 17:50:31 +02:00
Sayak Biswas	ba4e227def	HIP device code cleanup and fix for high VRAM usage This patch cleans up code for HIP device and makes it more consistent with the CUDA code. It also fixes the issue with high VRAM usage on AMD cards using HIP allowing better performance and usage on cards like 6600XT. Added a check in intern/cycles/kernel/bvh/bvh_util.h to prevent compiler error with hipcc Reviewed By: brecht, leesonw Maniphest Tasks: T92124 Differential Revision: https://developer.blender.org/D12834	2021-10-20 14:04:28 +02:00
Brecht Van Lommel	fd77a28031	Cycles: bake transparent shadows for hair These transparent shadows can be expansive to evaluate. Especially on the GPU they can lead to poor occupancy when only some pixels require many kernel launches to trace and evaluate many layers of transparency. Baked transparency allows tracing a single ray in many cases by accumulating the throughput directly in the intersection program without recording hits or evaluating shaders. Transparency is baked at curve vertices and interpolated, for most shaders this will look practically the same as actual shader evaluation. Fixes T91428, performance regression with spring demo file due to transparent hair, and makes it render significantly faster than Blender 2.93. Differential Revision: https://developer.blender.org/D12880	2021-10-19 15:11:09 +02:00
Brecht Van Lommel	1df3b51988	Cycles: replace integrator state argument macros * Rename struct KernelGlobals to struct KernelGlobalsCPU * Add KernelGlobals, IntegratorState and ConstIntegratorState typedefs that every device can define in its own way. * Remove INTEGRATOR_STATE_ARGS and INTEGRATOR_STATE_PASS macros and replace with these new typedefs. * Add explicit state argument to INTEGRATOR_STATE and similar macros In preparation for decoupling main and shadow paths. Differential Revision: https://developer.blender.org/D12888	2021-10-18 19:02:10 +02:00
Brecht Van Lommel	2ba7c3aa65	Cleanup: refactor to make number of channels for shader evaluation variable	2021-10-15 15:42:44 +02:00
Brecht Van Lommel	04857cc8ef	Cycles: fully decouple triangle and curve primitive storage from BVH2 Previously the storage here was optimized to avoid indirections in BVH2 traversal. This helps improve performance a bit, but makes performance and memory usage of Embree and OptiX BVHs a bit worse also. It also adds code complexity in other parts of the code. Now decouple triangle and curve primitive storage from BVH2. * Reduced peak memory usage on all devices * Bit better performance for OptiX and Embree * Bit worse performance for CUDA * Simplified code: Intersection.prim/object now matches ShaderData.prim/object No more offset manipulation for mesh displacement before a BVH is built Remove primitive packing code and flags for Embree and OptiX Curve segments are now stored in a KernelCurve struct * Also happens to fix a bug in baking with incorrect prim/object Fixes T91968, T91770, T91902 Differential Revision: https://developer.blender.org/D12766	2021-10-06 17:52:04 +02:00
Sergey Sharybin	6e268a749f	Fix adaptive sampling artifacts on tile boundaries Implement an overscan support for tiles, so that adaptive sampling can rely on the pixels neighbourhood. Differential Revision: https://developer.blender.org/D12599	2021-10-05 16:19:14 +02:00
Campbell Barton	74f45ed9c5	Cleanup: spelling in comments	2021-10-03 12:13:29 +11:00
Brecht Van Lommel	a754e35198	Cycles: refactor API for GPU display * Split GPUDisplay into two classes. PathTraceDisplay to implement the Cycles side, and DisplayDriver to implement the host application side. The DisplayDriver is now a fully abstract base class, embedded in the PathTraceDisplay. * Move copy_pixels_to_texture implementation out of the host side into the Cycles side, since it can be implemented in terms of the texture buffer mapping. * Move definition of DeviceGraphicsInteropDestination into display driver header, so that we do not need to expose private device headers in the public API. * Add more detailed comments about how the DisplayDriver should be implemented. The "driver" terminology might not be obvious, but is also used in other renderers. Differential Revision: https://developer.blender.org/D12626	2021-09-30 20:48:08 +02:00
Campbell Barton	6dceaafe5a	Cleanup: trailing space, newlines at EOF	2021-09-29 07:30:34 +10:00
Brecht Van Lommel	86ec9d79ec	Fix build without Cycles HIP device	2021-09-28 20:00:55 +02:00
Brian Savery	044a77352f	Cycles: add HIP device support for AMD GPUs NOTE: this feature is not ready for user testing, and not yet enabled in daily builds. It is being merged now for easier collaboration on development. HIP is a heterogenous compute interface allowing C++ code to be executed on GPUs similar to CUDA. It is intended to bring back AMD GPU rendering support on Windows and Linux. https://github.com/ROCm-Developer-Tools/HIP. As of the time of writing, it should compile and run on Linux with existing HIP compilers and driver runtimes. Publicly available compilers and drivers for Windows will come later. See task T91571 for more details on the current status and work remaining to be done. Credits: Sayak Biswas (AMD) Arya Rafii (AMD) Brian Savery (AMD) Differential Revision: https://developer.blender.org/D12578	2021-09-28 19:18:55 +02:00
Patrick Mours	2189dfd6e2	Cycles: Rework OptiX visibility flags handling Before the visibility test against the visibility flags was performed in an any-hit program in OptiX (called `__anyhit__kernel_optix_visibility_test`), which was using the `__prim_visibility` array. This is not entirely correct however, since `__prim_visibility` is filled with the merged visibility flags of all objects that reference that primitive, so if one object uses different visibility flags than another object, but they both are instances of the same geometry, they would appear the same way. The reason that the any-hit program was used rather than the OptiX instance visibility mask is that the latter is currently limited to 8 bits only, which is not sufficient to contain all Cycles visibility flags (12 bits). To mostly fix the problem with multiple instances and different visibility flags, I changed things to use the OptiX instance visibility mask for a subset of the Cycles visibility flags (`PATH_RAY_CAMERA` to `PATH_RAY_VOLUME_SCATTER`, which fit into 8 bits) and only fall back to the visibility test any-hit program if that isn't enough (e.g. the ray visibility mask exceeds 8 bits or when using the built-in curves from OptiX, since the any-hit program is then also used to skip the curve endcaps). This may also improve performance in some cases, since by default OptiX can now perform the normal scene intersection trace calls entirely on RT cores without having to jump back to the SM on every hit to execute the any-hit program. Fixes T89801 Differential Revision: https://developer.blender.org/D12604	2021-09-27 17:12:43 +02:00

1 2 3 4 5 ...

945 Commits