test2

Author	SHA1	Message	Date
Ray Molenkamp	16eb4430f5	Cleanup: CMake: Modernize bf_render dependencies Pretty straightforward - Remove any bf_render paths from INC - Add a dependency though LIB when missing context: https://devtalk.blender.org/t/cmake-cleanup/30260 Pull Request: https://projects.blender.org/blender/blender/pulls/132355	2024-12-26 18:50:53 +01:00
Ray Molenkamp	b7407aabb5	Cleanup: CMake: Modernize bf_gpu dependencies Pretty straightforward - Remove any bf_gpu paths from INC - Add a dependency though LIB when missing context: https://devtalk.blender.org/t/cmake-cleanup/30260 Pull Request: https://projects.blender.org/blender/blender/pulls/132286	2024-12-23 21:38:19 +01:00
Ray Molenkamp	a7c39896c6	Cleanup: CMake: Modernize bf_blenkernel dependencies Pretty straightforward - Remove any bf_blenkernel paths from INC - Add a dependency though LIB when missing context: https://devtalk.blender.org/t/cmake-cleanup/30260 Pull Request: https://projects.blender.org/blender/blender/pulls/132282	2024-12-23 20:08:37 +01:00
Thomas Dinges	1be75e86aa	Cleanup: replace floatX_to_floatY() with make_floatY() Now that function overloads are usable on all GPUs, replace the former explicit functions. Pull Request: https://projects.blender.org/blender/blender/pulls/132067	2024-12-19 09:41:55 +01:00
salipourto	4e5a9c5dfb	Cycles: Handling SDK/ROCm 6+ lack of backward compatibility with pre ROCm 6 This commit introduces proper handling of ROCm 5 and ROCm 6 runtimes on Linux, based on the version of the ROCm compiler used at build time. Previously, HIPEW (the HIP equivalent of Cuda Wrangler) defaulted to loading the ROCm 5 runtime. If ROCm 5 was unavailable, it would attempt to load ROCm 6. However, ROCm 6 introduces changes in certain structures and functions that are not backward compatible, leading to potential issues when kernels compiled with the ROCm 6 compiler are executed on the ROCm 5 runtime. ### Summary of Changes: Separation of Structures and Functions: Structures and functions are now separated into hipew5 and hipew6 to accommodate the differences between ROCm versions. Build-Time Version Detection: The ROCm version is determined during build time, and the corresponding hipew5 or hipew6 is included accordingly. Runtime Default to ROCm 6: By default, HIPEW now loads the ROCm 6 runtime and includes hipew6 (Linux only). JIT Compilation Behavior: Since ROCm 6 is the default version, JIT compilation is supported only when the ROCm 6 compiler is detected at runtime. HIP-RT Update: HIP-RT has been updated to load the ROCm 6 runtime by default. These changes ensure compatibility and stability when switching between ROCm versions, avoiding issues caused by runtime and compiler mismatches. Co-authored-by: Alaska <alaskayou01@gmail.com> Co-authored-by: Sergey Sharybin <sergey@blender.org> Pull Request: https://projects.blender.org/blender/blender/pulls/130153	2024-12-17 16:19:36 +01:00
Alaska	8e6a981487	Fix #131927 : Cycles: Reduce uncertain light tree traversal in scenes with one distant light When a scene contains distant lights and local lights, the first step of the light tree traversal is to compute the importance of distant lights vs local lights and pick one based on a random number. In the specific case of when there is only one distant light, the line of code that had been changed in this commit effectively reduced to: `min_importance = fast_cosf(x) < cosf(x) ? 0.0 : compute_min_importance` And depending on the hardware, compiler, and the specific value being tested, different configurations could take different code paths. This commit fixes this issue by turning the comparison into `fast_cosf(x) < fast_cosf(x)`. --- Why does `cos_theta_plus_theta_u < cosf(bcone.theta_e - bcone.theta_o)` reduce to `fast_cos(x) < cos(x)` in this specific case? - `cos_theta_plus_theta_u` is computed as `cos_theta * cos_theta_u - sin_theta * sin_theta_u` - `cos_theta` is always 1.0 in the case of a single distant light. - `cos_theta_u` is computed earlier as `fast_cosf(theta_e)` in `distant_light_tree_parameters()` - `sin_theta` is zero, and so that side of the equation doesn't matter. This reduces `cos_theta_plus_theta_u` to `fast_cosf(theta_e)`. `cosf(bcone.theta_e - bcone.theta_o)` reduces to `cosf(bcone.theta_e)` because for the case of a single distant light `theta_o` is always 0. Pull Request: https://projects.blender.org/blender/blender/pulls/131932	2024-12-17 10:51:43 +01:00
Thomas Dinges	22e16ca096	Cycles: add make_float4(float3 a, float b) type This resolves a todo from the code. Part of the Quality Project. Pull Request: https://projects.blender.org/blender/blender/pulls/131915	2024-12-17 09:11:08 +01:00
Alaska	c42894a695	Fix: Various issues with Cycles HIP JIT compilation On Linux, Cycles HIP has a JIT compilation feature. This feature is used when Cycles can not find a precompiled kernel for your GPU. Which is most common when using hardware that wasn't out at the time that a version of Blender was released. There were various issues with this JIT compilation system, this commit aims to solve them. The changes include: - Enable `WITH_NANOVDB` when Blender is built with NanoVDB. - This fixes a issue where VDB objects would not render. - Enable some extra debug options for developers when desired (This is so we match the CUDA implementation of the same feature). - Reduce the optimizaiton level from -O3 to the default. - This is to avoid any extra issues that may occur as a result of an increase optimization level that isn't tested with precompiled kernels. - Reduce the optimization level even further to -O1 for Vega. - This was done on precompiled kernels to work around some issues, so I decided to apply it to JIT kernels as well. - Note: Although Vega is not officially supported, this may help people that unofficially use Vega. - Added some previously missing compiler arguments and fixed errors that were introduced when enabling these compiler arguments. - Fixed a issue where JIT compilation would fail if Blener was installed in a path that had a space in it. Pull Request: https://projects.blender.org/blender/blender/pulls/131853	2024-12-17 01:02:39 +01:00
Aras Pranckevicius	35d7477371	Cycles: fix accuracy issues in fast_sin/fast_cos/fast_sincos Most of these originate from OIIO of about 10 years ago. Integrate the upstream fix from OIIO: https://github.com/AcademySoftwareFoundation/OpenImageIO/commit/88feb65fc992 Cover them with unit tests. Before the fix, fast_sinf(1.57085085f) was returning 0.0 instead of 1.0 as expected. Revert previous hair workaround (`a16879a5f0`) Co-authored-by: Sergey Sharybin <sergey@blender.org> Pull Request: https://projects.blender.org/blender/blender/pulls/131957	2024-12-16 10:05:47 +01:00
Brecht Van Lommel	edbd95881b	Cleanup: Fix warning from deprecated header with OpenVDB 12 Ref #131833 Pull Request: https://projects.blender.org/blender/blender/pulls/131867	2024-12-15 01:26:00 +01:00
Weizhen Huang	c99b7e66b2	Cycles: support Mie Scattering with particle size smaller than 5um Previous implemenation of 5 < d < 50 was taken from the main paper, fitting for smaller sizes are found in the supplemental. They are less forward-scattering. Pull Request: https://projects.blender.org/blender/blender/pulls/130234	2024-12-13 15:50:54 +01:00
Brecht Van Lommel	433264585f	Cycles: Support building with OpenVDB 12 For the upcoming VFX platform upgrade. Pull Request: https://projects.blender.org/blender/blender/pulls/131833	2024-12-13 15:27:26 +01:00
Weizhen Huang	16132f8c79	Fix #117667 : Remove volume density weight cutoff `CLOSURE_WEIGHT_CUTOFF` avoids allocating a closure when its weight is too small. It makes sense for surface closures, but for volume closures the contribution also depends on the object size/ray length, such a cutoff seems random and is causing problem in atmospheric scatterings. Therefore remove the cutoff for volume, just make sure the weight is positive. Pull Request: https://projects.blender.org/blender/blender/pulls/131696	2024-12-13 10:28:49 +01:00
Weizhen Huang	27fc091be8	Fix #131723 : Cycles volume not sampling channels with zero extinction The original paper uses the single scattering albedo `sigma_s/sigma_t` to pick a channel for sampling the scattering distance. However, this only considers the situation where there is scattering inside the volume. If some channel has an extinction coefficient of zero, the light passes through without attenuation for that channel. We assign such channel with a weight of 1 instead of 0 to make sure it can be sampled. Pull Request: https://projects.blender.org/blender/blender/pulls/131741	2024-12-13 10:27:53 +01:00
Weizhen Huang	a16879a5f0	Fix #131240 : Cycles: Negative integration range in Huang Hair The cause is numerical issues with `fast_sinf()`. While fixing `fast_sinf()` would ultimately fix the problem, it involves more complications in other code paths, and it is safer to clamp the integration range anyway. Pull Request: https://projects.blender.org/blender/blender/pulls/131689	2024-12-10 21:56:04 +01:00
Aras Pranckevicius	322415d5b2	Fix: Build WITH_CYCLES_OSL=OFF failure Code added in `d1796b8df0` was not put under WITH_OSL checks	2024-12-10 09:58:22 +02:00
Lukas Stockner	d1796b8df0	Cycles: Make OSL shader compilation threadsafe and multi-threaded The original OSL Shading System API was stateful: You'd create a shader group, configure it, and then end it. However, this means that only one group can be created at a time. Further, since Cycles reuses the Shading System for multiple instances (e.g. viewport render and material preview), a process-wide mutex is needed. However, for years now OSL has had a better interface, where you explicitly provide the group you refer to. With this, we can not only get rid of the mutex, but actually multi-thread the shader setup even within one instance. Realistically, most time is still spent in the JIT stage, but it's still better than nothing. Pull Request: https://projects.blender.org/blender/blender/pulls/130133	2024-12-09 14:36:35 +01:00
Lukas Stockner	91a3039bb2	Cleanup: Cycles: Make BlenderCamera a class with proper initialization Before, we'd just zero out the memory of the struct and then set the defaults afterwards, but that: - Prevents us from storing non-POD types - Silently assumes that array<float> is safe to zero out (it currently is, but that is still ugly and risky) - Bloats the code since every non-zero entry now needs two lines So, just make use of C++11 here. All the default values that were previously unset are taken from the Blender-side defaults. Pull Request: https://projects.blender.org/blender/blender/pulls/130870	2024-12-06 20:40:24 +01:00
Weizhen Huang	bb3b8d78c2	Refactor: Cycles: split volume integration into smaller functions Pull Request: https://projects.blender.org/blender/blender/pulls/131414	2024-12-06 16:23:04 +01:00
Weizhen Huang	59ad6d2b9c	Refactor: Cycles: Extract code block to check homogeneous volume into a function	2024-12-06 16:23:00 +01:00
Weizhen Huang	13fb28581b	Refactor: Cycles: Share function between volume scattering and shadowing	2024-12-06 16:23:00 +01:00
Weizhen Huang	910c2e2ba6	Refactor: Cycles: Add helper struct for stepping through the volume	2024-12-06 16:23:00 +01:00
Weizhen Huang	b16cfd2a87	Cleanup: Cycles: remove unused parameter `absorption_only` `!vstate.absorption_only` is always false	2024-12-06 16:23:00 +01:00
Michael Jones	8fe2e37dd0	Fix #130641 : MetalRT: Motion Blur (render errors) This PR fixes #130641. The bug was caused by a missing self-object constraint when performing SSS on motion blur scenes. scene_intersect_local tests were erroneously hitting other objects, and out of range primitive IDs were causing spurious downstream behavior. Pull Request: https://projects.blender.org/blender/blender/pulls/131156	2024-12-03 20:24:36 +01:00
Weizhen Huang	e2d7681fe6	Cleanup: Cycles: remove unused `ccl_loop_no_unroll` Was added in `6121c28501` to ensure compiling on OpenCL, now the definition is empty on all platforms Pull Request: https://projects.blender.org/blender/blender/pulls/131100	2024-11-28 16:37:01 +01:00
Weizhen Huang	aa09169e0a	Cleanup: Cycles: remove unused parameter `skip_phase` in volume This logic is copied from surface shader, so that the sampled closure does not need to be evaluated twice when summing all the closures, but it is not used in volume.	2024-11-28 15:56:28 +01:00
Lukas Stockner	0de1cea5c5	Cycles: Use fused OptiX OSL programs Based on #123377 by @brecht, but Gitea doesn't like the rebase these so here's a new PR. The purpose here is to switch to fused OptiX programs for OSL execution on CUDA. On the one hand, this makes the code easier since, but there's also another advantage - how memory allocation is managed. OSL shaders need memory to store intermediate values, but how much is needed depends on the complexity of the shader. With the split program approach, Cycles had to provide that memory, so we had to allocate a certain amount (2 KiB, to be precise) statically and show an error if the shader would need more. If the shader used less (which is the case for the vast majority), the memory was just wasted. By switching to fused kernels, OSL knows the required amount during JIT codegen, so it can allocate only what's required, which avoids this waste. One still needs to set a maximum, and in theory, OSL would also support spilling over into a Cycles-provided alternative memory region. However, we currently don't implement that - instead, we default to the same 2048 limit as before and let advanced users override it via the CYCLES_OSL_GROUPDATA_ALLOC environment variable if really needed. Co-authored-by: Brecht Van Lommel <brecht@blender.org> Pull Request: https://projects.blender.org/blender/blender/pulls/130149	2024-11-26 23:58:32 +01:00
YimingWu	6d7eecb1c4	Fix #130799 : Prevent cycles material template to show in Grease Pencil The `poll()` function for `CYCLES_PT_context_material` was using the legacy `GPENCIL` identifier as opposed to `GREASEPENCIL`. This caused duplicated material templates to show in the material tab. Pull Request: https://projects.blender.org/blender/blender/pulls/130962	2024-11-26 12:18:37 +01:00
Lukas Stockner	0fc58f0aba	Cycles: Disable -fassociative-math on Linux to avoid GGX issues This is the safer fix for now, we can revisit this later.	2024-11-26 02:28:09 +01:00
Thomas Dinges	5ddf8a6495	Merge branch 'blender-v4.3-release'	2024-11-18 19:14:10 +01:00
Lukas Stockner	1a40efbded	Fix #130389 : Cycles: Numerical issues in GGX_D with associative math flag Turns out that with `-fassociative-math`, GCC turns `(1.0f - cos_NH2) + alpha2 * cos_NH2` into `cos_NH2 * (alpha2 - 1.0f) + 1.0f`. Not sure why since the operation count is the same, but if alpha2 is very small, `alpha2 - 1.0f` will be exactly -1.0f, which then causes issues. Luckily, having one_minus_cos_NH2 as its own variable appears to be enough to make GCC keep the original formulation. Just to be safe, I've also used one_minus_cos_NH2 in the other branch to hopefully reduce the chance of it being folded in again. Also turns a division into a reciprocal, which is in theory slightly faster. Pull Request: https://projects.blender.org/blender/blender/pulls/130469	2024-11-18 19:12:22 +01:00
Patrick Mours	6f0ed29378	Cycles: Add OptiX 8.1 support The function table symbol declared in the headers was renamed starting in OptiX 8.1, from `g_optixFunctionTable` to `g_optixFunctionTable_<ABI version>`. This adds support for that by using the new macro for the name when available (after OptiX 8.1) and falling back to the old name when it is not (before OptiX 8.1). Pull Request: https://projects.blender.org/blender/blender/pulls/130451	2024-11-18 17:20:49 +01:00
Falk David	1d571a810f	Fix: Cycles: Compiler warning The `ProjectionTransform` object has no trivial copy-assignment constructor. This results in the following warning on `gcc (Ubuntu 13.2.0-23ubuntu4) 13.2.0`: ``` /.../blender-git/blender/intern/cycles/kernel/../util/projection.h: In function ‘ccl::ProjectionTransform ccl::projection_inverse(ProjectionTransform)’: /.../blender-git/blender/intern/cycles/kernel/../util/projection.h:219:9: warning: ‘void* memcpy(void, const void, size_t)’ writing to an object of type ‘ccl::ProjectionTransform’ {aka ‘struct ccl::ProjectionTransform’} with no trivial copy-assignment; use copy-assignment or copy-initialization instead [-Wclass-memaccess] 219 \| memcpy(&tfmR, R, sizeof(R)); \| ~~~~~~^~~~~~~~~~~~~~~~~~~~~ /.../blender-git/blender/intern/cycles/kernel/../util/projection.h:67:16: note: ‘ccl::ProjectionTransform’ {aka ‘struct ccl::ProjectionTransform’} declared here 67 \| typedef struct ProjectionTransform { \| ^~~~~~~~~~~~~~~~~~~ ``` To fix the warning, cast the pointer to `(void *)`. Pull Request: https://projects.blender.org/blender/blender/pulls/130321	2024-11-15 15:24:49 +01:00
Nikita Sirgienko	2aa9203f2f	Cycles: Reintroduce noinline keyword for oneAPI device In `891d71a4d4` this keyword was dropped due to performance regression after `fdc2962beb`, but currently code does not experience this performance degradation, and in fact there is minor performance improvement on Lunar Lake GPUs, along with an expected improvement in compile time. However, this change brings a minor performance regression to shade_surface kernel on Intel Arc and Meteor Lake GPUs, which will be solved later by disabling this keyword for these platforms only. Pull Request: https://projects.blender.org/blender/blender/pulls/130299	2024-11-15 12:09:37 +01:00
Bastien Montagne	b325142d17	Merge branch 'blender-v4.3-release'	2024-11-12 16:55:40 +01:00
Bastien Montagne	0b3a7cbe69	Cleanup: Move `BKE_image.h` and related headers to C++. NOTE: This also required some changes to Cycles code itself, who is now directly including `BKE_image.hh` instead of declaring a few prototypes of these functions in its `blender/utils.h` header (due to C++ functions names mangling, this was not working anymore). Pull Request: https://projects.blender.org/blender/blender/pulls/130174	2024-11-12 16:53:54 +01:00
weizhen	9488375049	Fix: Cycles: Missing inclusion in CMakeLists	2024-11-12 15:02:59 +01:00
weizhen	43187cf174	Fix: Cycles: Compile error on GPU Missing function qualifiers. Oversight of `93a34b1077`	2024-11-12 15:02:59 +01:00
Weizhen Huang	93a34b1077	Refactor: Cycles: add helper struct `Interval` To improve readability Pull Request: https://projects.blender.org/blender/blender/pulls/130156	2024-11-12 12:06:09 +01:00
Weizhen Huang	675e8173fa	Refactor: Cycles: separate volume stack and single entry evaluation So that volume shader evaluation does not rely on the volume stack. In the future this is useful for baking the density when building the volume Octree. Pull Request: https://projects.blender.org/blender/blender/pulls/130157	2024-11-12 12:05:37 +01:00
Weizhen Huang	90ed91dfdb	Cleanup: Cycles: Add `Kernel` prefix to light tree bounding shapes BoundingBox -> KernelBoundingBox BoundingCone -> KernelBoundingCone Pull Request: https://projects.blender.org/blender/blender/pulls/130141	2024-11-11 17:13:55 +01:00
Weizhen Huang	ec3128ee37	Cleanup: Cycles: Remove unused functions Wasn't used even when they were added	2024-11-11 16:42:50 +01:00
Weizhen Huang	e9593a6619	Cleanup: Cycles: update light tree paper link The original one was expired	2024-11-11 15:46:52 +01:00
Sergey Sharybin	40ba18c4cd	Merge branch 'blender-v4.3-release'	2024-11-08 17:22:02 +01:00
Sergey Sharybin	e5de274faf	Fix: Cycles HIP-RT compilation happens in parallel with CUDA This was an oversight in #129945: the cycles_kernel_hiprt was not handled in the original code, hence it was missing in the change. Pull Request: https://projects.blender.org/blender/blender/pulls/130037	2024-11-08 17:21:20 +01:00
Sergey Sharybin	9abf9b15f4	Merge branch 'blender-v4.3-release'	2024-11-08 11:06:11 +01:00
Sergey Sharybin	f58522fc10	Cycles: Tweak scheduling of GPU kernel compilation This change makes it so only kernels of the same vendor are compiled in parallel. For example for the release builds it will be: 1. All CUDA kernels 2. All OptiX kernels 3. All HIP kernels 4. All OneAPI kernels This potentially leads to a lower CPU utilization, but it makes it much easier to manage memory usage and tweak per-vendor concurrency. The goal of this change is to solve occasional out-of-memory during the GPU kernels compilation step on the CI/CD farm. This change also includes tweaks to the prallel jobs for HIP-RT and oneAPI. The tweak is based on measuring apparent memory usage peak on Linux when doing single-thread compilation, and giving some safe margin from the available memory on the buildbot. Pull Request: https://projects.blender.org/blender/blender/pulls/129945	2024-11-08 11:05:38 +01:00
Patrick Mours	d0dd587b60	Fix #108372 : GPU implementation of OSL matrix intrinsic functions All the OSL matrix functions had been implemented using the `Transform` utility of Cycles, but that's built around a 4x3 matrix, when the OSL matrix functions are working with 4x4 matrices. This resulted in them not producing results consistent with the CPU implementation. This fixes that by making use of the `ProjectionTransform` utility of Cycles instead, because it's built around a 4x4 matrix. Since matrix inversion is required, I had to make a few more utility functions available on the GPU (except Metal, due to use of references/pointers without specification) that were previously CPU-only. Co-authored-by: Brecht Van Lommel <brecht@blender.org> Pull Request: https://projects.blender.org/blender/blender/pulls/110102	2024-11-04 17:59:29 +01:00
Sergey Sharybin	aec4ba39b9	Merge branch 'blender-v4.3-release'	2024-11-04 17:54:52 +01:00
Michael Jones	d1368883ed	Cycles: MetalRT: Fix logic bug when deciding if HW RT should be used Don't try to use MetalRT by default unless the device explicitly reports that RT is supported. We shouldn't just rely on an assumption that it's supported for M3 and beyond, ad infinitum. Pull Request: https://projects.blender.org/blender/blender/pulls/129688	2024-11-04 17:54:12 +01:00

1 2 3 4 5 ...

8806 Commits