griefith/test

Author	SHA1	Message	Date
Philipp Oeser	d6da557358	Fix #135955 : OSL Window texture coordinate wrong for panoramic cameras There is code that properly handles panoramic cameras in `camera_world_to_ndc`, the transform matrices (e.g. `OSLRenderServices::get_inverse_matrix`) in the `transform("NDC", P)` call dont do the "full work" here (maybe they should though?). But we can get to `camera_world_to_ndc` by just getting the "NDC" attribute, so use that for now. Pull Request: https://projects.blender.org/blender/blender/pulls/136097	2025-03-18 09:11:45 +01:00
Brecht Van Lommel	9984adc7de	Merge branch 'blender-v4.4-release'	2025-03-17 16:19:46 +01:00
Brecht Van Lommel	f896f7ffc3	Fix #136047 : Cycles OSL gettextureinfo crash with missing image Missing null pointer check. Pull Request: https://projects.blender.org/blender/blender/pulls/136075	2025-03-17 16:18:31 +01:00
Bastien Montagne	dd98cede18	Merge branch 'blender-v4.4-release'	2025-03-14 18:20:26 +01:00
Sahar A. Kashi	9ad3b74867	Fix: SSS and Motion Blur or Curves not working on HIP-RT This change fixes the remaining failing tests with SSS when using HIP-RT. This includes crash when SSS is used on curves, and objects with motion blur and SSS rendering black. The root cause for both cases was the fact that traversal was always assuming regular BVH (built for triangles), while curves and motion triangles are using custom primitives, which requires specialized BVH traversal. This change includes: - Early output from `scene_intersect_local()` for non-triangle and non-motion-triangle primitives. This fixes `sss_hair.blend` test, and also avoids unnecessary BVH traversal when the local intersection is requested from curve object. The same early-output could be added to other BVH traversal implementation. - Use `hiprtGeomCustomTraversalAnyHitCustomStack` for motion triangles primitives. This fixes motion blur on objects with SSS render black. Fixes #135856 Co-authored-by: Sahar A. Kashi <sahar.alipourkashi@amd.com> Co-authored-by: Sergey Sharybin <sergey@blender.org> Pull Request: https://projects.blender.org/blender/blender/pulls/135943	2025-03-14 18:17:54 +01:00
Xavier Hallade	0ebce03d41	Cycles: Reorder ShaderData elements to improve cache utilization Profiling on Arc B580 shown that sd->num_closure queries were often stalling. Packing it closer to other often accessed elements within ShaderData (type, flag..) does speedup rendering by ~5% in most scenes. Pull Request: https://projects.blender.org/blender/blender/pulls/135980	2025-03-14 16:11:17 +01:00
Xavier Hallade	3de1a60c6a	Cycles: oneAPI: Force large GRF for shade_surface_* kernels While auto lets the compiler make the right choice for shade_surface kernel when compiling for Battlemage and Lunar Lake, that's not the case for Alchemist and Meteor Lake, so now we force this mode.	2025-03-13 14:22:48 +01:00
Campbell Barton	6ef7dae8ef	Cleanup: spelling in comments (make check_spelling_*)	2025-03-13 13:41:17 +11:00
Sergey Sharybin	977a334f6f	Merge branch 'blender-v4.4-release'	2025-03-12 19:24:01 +01:00
Sergey Sharybin	a3eb0faa3f	Fix: Incorrect ray time used for HIP-RT local intersections It was always hard-coded to be 0. It does not seem to result in any extra tests passing, but they are probably not sophisticated enough. Noticed while looking into details for the #135856. Pull Request: https://projects.blender.org/blender/blender/pulls/135878	2025-03-12 19:23:38 +01:00
Brecht Van Lommel	07b60c189b	Cycles: Perform attribute subdivision on the host side * Add SubdAttributeInterpolation class for linear attribute interpolation. * Dicing computes ptex UV and face ID for interpolation. * Simplify mesh storage of subd primitive counts * Remove kernel code for subd attribute interpolation * Remove patch table packing and upload The old optimization adds a fair amount of complexity to the kernel, affecting performance even when not using the feature. It's also not that useful as it does not work for UVs that needs special interpolation. With this simpler code it should be easier to make it feature complete. Pull Request: https://projects.blender.org/blender/blender/pulls/135681	2025-03-11 20:58:07 +01:00
Brecht Van Lommel	6ec541ca4e	Refactor: Cycles: Remove face normal attribute It's already computed on demand in the kernel, no need to have it host side. Pull Request: https://projects.blender.org/blender/blender/pulls/135681	2025-03-11 20:57:51 +01:00
Sergey Sharybin	32d49541c0	Fix #135572 : Cycles shadow linking through transparency is broken on GPU Make the ray self primitives store and restore reliable for cases when the intersect_shadow kernel is called multiple times: - Light object and primitive are stored in dedicated fields in the state. This adds 2 integers per state. - The self object and primitive are used from the previous intersection when the intersect_shadow is called multiple times. There is more detailed explanation added in the code. The issue was introduced by the light refactor to be objects in #134846. Pull Request: https://projects.blender.org/blender/blender/pulls/135573	2025-03-07 17:16:04 +01:00
Brecht Van Lommel	48398b223b	Cleanup: Fix various divisions by zero reported by ASAN Pull Request: https://projects.blender.org/blender/blender/pulls/135326	2025-03-06 22:34:23 +01:00
Brecht Van Lommel	b75b2e883d	Cleanup: Always use fullsize ShaderData on the CPU This avoids compiler and ASAN warnings. This optimization exist for GPU rendering.	2025-03-06 22:34:22 +01:00
Xavier Hallade	91f332e7c6	Cycles: oneAPI: Force normal GRF for integrator_intersect_* kernels With auto mode, integrator_intersect_subsurface still ended up being compiled in large GRF mode on Intel Arc B580, while normal GRF provides the best performance for this kernel.	2025-03-06 22:24:45 +01:00
Xavier Hallade	d1120c51db	Cycles: oneAPI: Enable automated GRF mode selection The default was large GRF mode for all kernels and normal GRF for intersection kernels. path_array kernels also benefit from normal GRF, being almost 2x faster in this mode, as measured on my Arc B580. This translates to a much smaller 1-3% speedup in overall rendering. Instead of manually adding them to the list of kernels to compile in normal GRF mode, I've switched to auto that provides the same result.	2025-03-06 17:47:19 +01:00
Xavier Hallade	90a10dcd50	Cycles: Adjust inlining attributes for oneAPI device Now ccl_device sets inlining and ccl_device_inline forces inlining. This matches more closely with what is currently done for cuda and metal backends. I've measured from 1% to 6% overall performance improvement in rendering benchmark scenes on Arc B580, as well as a small decrease in compile time.	2025-03-03 18:20:02 +01:00
Alaska	fb7b53143e	Merge branch 'blender-v4.4-release'	2025-02-27 12:03:30 +13:00
Alaska	d840d249b3	Cycles: Re-enable HIPRT point cloud rendering Previously point cloud rendering was disabled on the HIPRT backend due to unexpected performance regressions introduce by it. With the recent update to HIP SDK 6.3 and HIPRT 2.5, these performance regressions have been resolved and so this commit re-enables point cloud rendering on HIPRT. Pull Request: https://projects.blender.org/blender/blender/pulls/134902	2025-02-27 00:01:35 +01:00
Xavier Hallade	a5d8bd2e29	Cycles: Drop inline hint on light_tree_pdf Dropping the inlining hint for `light_tree_pdf` and reverting to the default inlining thresholds for DPC++ compiler gives a ~4% speedup on classroom and other scenes on Arc B580. Pull Request: https://projects.blender.org/blender/blender/pulls/135042	2025-02-26 20:14:05 +01:00
Weizhen Huang	7b3f7ccae0	Merge branch 'blender-v4.4-release'	2025-02-26 15:51:55 +01:00
Lukas Stockner	9254532b8b	Fix #129306 : Cycles: Principled coat doesn't pass furnace test This implements three improvements to the energy preservation and albedo scaling logic, which help the Principled BSDF pass the white-furnace test when using the coat layers at high roughness. Specifically, at roughness 0.3, the albedo scaling brings it from 60% at the edge to 95%, and with the energy preservation it's 99.8%. Pull Request: https://projects.blender.org/blender/blender/pulls/134620	2025-02-26 15:47:21 +01:00
Weizhen Huang	9ee9a2b789	Fix #135145 : object visible to volume scatter when ray visibility is off The comment was added in `e0857ad152`, and volume scatter visibility is supported since `cdd1d5a93c`. Pull Request: https://projects.blender.org/blender/blender/pulls/135168	2025-02-26 15:45:58 +01:00
Brecht Van Lommel	655ccf1d7a	Fix #135086 : Light ray visibility not taken into account without light tree Re-enable tests that were temporarily disabled for this. Caused by #134846, `e813e46327`	2025-02-25 17:24:04 +01:00
Brecht Van Lommel	e813e46327	Cycles: Refactor lights to be objects This is an intermediate steps towards making lights actual geometry. Light is now a subclass of Geometry, which simplifies some code. The geometry is not added to the BVH yet, which would be the next step and improve light intersection performance with many lights. This makes object attributes work on lights. Co-authored-by: Lukas Stockner <lukas@lukasstockner.de> Pull Request: https://projects.blender.org/blender/blender/pulls/134846	2025-02-24 23:44:14 +01:00
Lukas Stockner	8cb5e05c48	Cleanup: Cycles: Deduplicate kernel attribute code using templating The attribute handling code in the kernel is currently highly duplicated since it needs to handle five different data types and we couldn't use templates back then. We can now, so might as well make use of it and get rid of ~1000 lines. There are also some small fixes for the GPU OSL code: - Wrong derivative for .w component when converting float2/float3->float4 - Different conversion for float2->float (CPU averages, GPU used to take .x) - Removed useless code for converting to float2, not used by OSL Pull Request: https://projects.blender.org/blender/blender/pulls/134694	2025-02-20 19:28:45 +01:00
Sergey Sharybin	c114ac97be	Merge branch 'blender-v4.4-release'	2025-02-20 17:36:37 +01:00
Sahar A. Kashi	6363181af9	Cycles: HIP-RT 2.5 integration and gfx12 support This change brings the following improvements on the user level - Support of GPUs with gfx12 architecture - New HIP-RT library which in addition to the gfx12 support brings various bug-fixes. The known limitation of gfx12 is that OpenImageDenoiser does not yet support this GPU architecture. This means that while Cycles will use the full advantage of the gfx12 (including hardware accelerated ray-tracing), denoising will only be possible on CPU, or secondary gfx11 or below GPU. This is something that requires a change in OIDN and it is to late to do it for Blender 4.4, but it is something to look forward for Blender 4.5. The gfx12 changes for the pre-compiled kernels is rather trivial, so it comes together (in the same PR) as the bigger HIP-RT change. On the development side this change brings the following improvements: - One step compile and link (much simpler CMake rules) - Embedding BVH binaries in hiprt dll (which makes it easier to package and load, without relying on special path configuration) Co-authored-by: Sahar Kashi <sahar.kashi@amd.com> Co-authored-by: Sergey Sharybin <sergey@blender.org> Co-authored-by: Brecht Van Lommel <brecht@blender.org> Pull Request: https://projects.blender.org/blender/blender/pulls/133129	2025-02-20 17:34:14 +01:00
Lukas Stockner	a6a183ab4d	Merge branch 'blender-v4.4-release'	2025-02-15 19:56:23 +01:00
Lukas Stockner	0177537c9a	Fix #129279 : Cycles: Matrix-typed attributes not working in OptiX OSL The issue here is that the individual components of TypeDesc are 8-bit each, but the code was applying a 4-bit mask. It just so happens that the only case where this matters is TypeDesc::MATRIX44, since its value is 16.	2025-02-15 19:31:07 +01:00
Alaska	870f92c6f9	Merge branch 'blender-v4.4-release'	2025-02-15 13:25:13 +13:00
Alaska	bcec81c4be	Fix: Overflow when using light falloff node on distant lights In Cycles lights can be given a light falloff node to control their light falloff. This worked by multiplying the light's strength by different combinations of the ray length, which would be FLT_MAX for distant lights. This resulting in almost every configuration of the light falloff node overflowing when used on distant lights, which is undesirable. This commit fixes this issue by ignoring most of the functions of the light falloff node when used on a distant light. And in the process fixes a small discrepancy between SVM and OSL when using the light falloff node on distant lights. Pull Request: https://projects.blender.org/blender/blender/pulls/134539	2025-02-15 01:23:12 +01:00
Alaska	212d22e039	Merge branch 'blender-v4.4-release'	2025-02-14 17:10:41 +13:00
Alaska	fee2f10208	Fix: Incorrect ray depth on emission in Cycles OSL This commit fixes a issue where ray depth for emissive objects (E.g. Lights) was incorrect when using the ray depth output of the light path node in Cycles OSL. Pull Request: https://projects.blender.org/blender/blender/pulls/134496	2025-02-14 05:09:08 +01:00
Brecht Van Lommel	68a510fe9b	Merge branch 'blender-v4.4-release'	2025-02-12 21:50:28 +01:00
Nikita Sirgienko	2bab4ae370	Cycles: oneAPI: Optimize texture access by using GPU HW sampler The current usage of software-based texture operations in the oneAPI implementation puts additional register pressure on the GPU compiler during register allocation. And it also creates code that requires maintenance. This commit is intended to address this situation by utilizing a recently productized SYCL bindless texture API to enable HW-based texture operations using Intel GPUs' hardware sampler. This currently translates to 1-11% rendering speedups (scene-specific) on my Arc A770 and Arc B580. At the moment, there are small performance regressions with NanoVDB texture operations on Arc B580 and small performance regressions in shade surface MNEE and Raytrace kernels on Arc A770, but they look recoverable and will be handled in the future. Pull Request: https://projects.blender.org/blender/blender/pulls/133457	2025-02-12 21:47:34 +01:00
Nikita Sirgienko	a0b7ad436b	Cleanup: Cycles: oneAPI: Switch to non-experimental work item API There is now a non-experimental API for this_work_item functionality, so let's use it for better code quality and also to avoid the deprecation warning during compilation. No functional or performance changes are expected. Pull Request: https://projects.blender.org/blender/blender/pulls/133472	2025-02-12 21:46:22 +01:00
Xavier Hallade	89f70855e0	Cycles: oneAPI: Enable kernels compression This is done using zstd with compression level 19 so it's aligned with the other backends that had it enabled in `4bde68cdd6`. Pull Request: https://projects.blender.org/blender/blender/pulls/134049	2025-02-12 21:45:16 +01:00
Brecht Van Lommel	ff6ec94226	Merge branch 'blender-v4.4-release'	2025-02-10 19:56:17 +01:00
Brecht Van Lommel	f63c192543	Fix: Cycles OptiX with OSL 1.14 failure in some render tests	2025-02-10 19:34:24 +01:00
Brecht Van Lommel	2f44765bc8	Merge branch 'blender-v4.4-release'	2025-02-10 16:52:04 +01:00
Brecht Van Lommel	d9ea0e0a29	Fix: Cycles oneAPI kernel not rebuilding when the compiler is updated This is not a complete solution because there may be indirect changes to the compiler other than the binary that require a rebuild, but this should catch the simple cases at least.	2025-02-10 16:41:10 +01:00
Weizhen Huang	b76fbb285e	Cycles: Change the integration measure in Huang Hair from `gamma` to `h` To align better with the pixel and reduce the samples needed. The paper was using gamma because the jacobian \|d_gamma/d_h\| approaches infinity at the boundaries, but it seems that clamping at 0.999 is enough for numerical stability. In practice I did not notice a change in the noise level, but it simplifies the range computation and renders faster due to reduced sample amount. Co-authored-by: Olivier Maury <omaury@meta.com> Ref: !129616 Pull Request: https://projects.blender.org/blender/blender/pulls/134130	2025-02-10 14:58:26 +01:00
Sergey Sharybin	4e9f85b76f	Merge branch 'blender-v4.4-release'	2025-02-10 14:55:57 +01:00
Patrick Mours	5810c94f95	Cycles: Add Blackwell to Cycles CUDA binaries architectures Enables building of a Cubin for GPUs based on Blackwell architecture if CUDA toolkit version 12.8 or higher is installed. Only added sm_120 to the default set, since it is the one relevant for consumer GPUs (RTX 5090 etc.) that are generally used with Blender. Pull Request: https://projects.blender.org/blender/blender/pulls/134170	2025-02-10 14:55:28 +01:00
Campbell Barton	e659e87a67	Cleanup: cmake formatting, sort file lists	2025-02-10 19:40:06 +11:00
Alaska	a32b8b546d	Merge branch 'blender-v4.4-release'	2025-02-08 01:25:00 +13:00
Alaska	3791001b1c	Fix: Object info random on lights do not work in Cycles OSL This commit adds the `light:random` attribute to OSL, allowing the object info node to now match between SVM and OSL when using the random output on a light. Pull Request: https://projects.blender.org/blender/blender/pulls/134095	2025-02-07 13:23:51 +01:00
weizhen	19fa8e5e77	Merge branch 'blender-v4.4-release'	2025-02-06 18:44:04 +01:00

1 2 3 4 5 ...

3782 Commits