griefith/test

Author	SHA1	Message	Date
Xavier Hallade	e00cc8c100	Cycles: oneAPI: Use default linker on Windows The initial issues that led to the choice of forcing the use of linker.exe seem gone and there is currently no strong reason to use linker.exe explicitly, so let's simplify and use the default setting.	2025-03-28 12:34:16 +01:00
Xavier Hallade	c4cf399755	Cycles: oneAPI: Re-enable -ffast-math The initial limitation preventing from using -ffast-math, worked around in `09df1f4caf`, got fixed upstream in LLVM and the fix is part of current DPC++ compiler: `63ecd2a725` We're now able to go back to using -ffast-math, which helps simplifying the set of compiler flags. No performance nor conformance change is expected from this change (most of the gain is achieved already with the use of -cl-fast-relaxed-math since `284b89a0a3`) and this has been verified on Arc B580 under Windows.	2025-03-27 17:18:30 +01:00
Brecht Van Lommel	15390b9257	License: Change NanoVDB header to Apache 2, following upstream OpenVDB This license was changed upstream, and it's simpler if we can use the same as most of the Cycles code.	2025-03-27 14:48:06 +01:00
Alaska	2e829ca4cf	Fix #136303 : Normalize the normals on the Ambient Occlusion node This commit simply normalizes the normals of the Ambient occlusion node before computing the output to avoid odd behaviour with unnormalized normals. Pull Request: https://projects.blender.org/blender/blender/pulls/136315	2025-03-27 02:58:19 +01:00
Campbell Barton	42ad772a1f	Cleanup: spelling & repeated terms (make check_spelling_*) Also use comment blocks for English text.	2025-03-27 01:13:34 +00:00
Sergey Sharybin	2ab231d802	Refactor: Pass proper KernelGlobals HIP-RT functions do have access to kg, and it was used inconsistently: some functions were passed actual kg, other were passed nullptr. This change makes it consistent and passes kg everywhere. Pull Request: https://projects.blender.org/blender/blender/pulls/136503	2025-03-26 11:07:06 +01:00
Sergey Sharybin	709371b278	Refactor: Avoid creation of local copy of RaySelfPrimitives	2025-03-26 11:07:04 +01:00
Sergey Sharybin	888c7e1df9	Cleanup: Avoid redundant data fetch	2025-03-26 11:07:04 +01:00
Sergey Sharybin	3d882acee2	Cleanup: Else after return	2025-03-26 11:07:04 +01:00
Sergey Sharybin	b2dd523d0d	Cleanup: Avoid default hit initialization The entire object is assigned later on, no need to initialize it.	2025-03-26 11:07:04 +01:00
Sergey Sharybin	323e27d825	Cleanup: Remove redundant assignment The payload stores pointers, no need to restore pointer of the function argument to the same value.	2025-03-26 11:07:04 +01:00
Sergey Sharybin	e92a8042c3	Refactor: Payload for shadow intersection and filter in HIP-RT The code before this change was relying on the ShadowPayload have the same "header" as RayPayload for some of the primitive types (curve, motion triangle, point): intersection functions were shared between "regular" and shadow rays (shadow in this case is shadow_all), but extra filter function was used for shadow rays. This is fragile if someone changes one of these structures. What is worse is that compiler might actually decide to shuffle things in some structs, or remove unused fields. This change also solves confusion about ShadowPayload::prim_type seemingly only being assigned to PRIMITIVE_NONE. With time it is not impossible that compiler will also see this, and constant-fold some checks, or even remove the field. If that happens then the render result will be wrong. Maybe it is already happening as there are some GPU and driver and optimization flag specific bugs in the area. It is unclear whether it was causing any actual problem: W7800 seems to render all hair correctly on Linux.	2025-03-26 11:07:04 +01:00
Sergey Sharybin	cdb3f34944	Cleanup: Use full name for the primitive_type Makes it extra clear locally type of what the variable contains: primitive, ray, or something else.	2025-03-26 11:07:04 +01:00
Sergey Sharybin	72542f3bb4	Cleanup: Follow Blender style and use more const Also make some style decisions more consistent: for example, the way how stop/continue search return value is commented. Prefer lower vertical space for those.	2025-03-26 11:07:04 +01:00
Sergey Sharybin	bf9c95f164	Cleanup: Move payload type cast to caller in HIP-RT Mainly readability purposes: - Having variables called local_payload is ambiguous: does it refer to LocalPayload type or to a variable be local in a function? - Some of the functions are used for different ray types, so having the type case in intersectFunc and filterFunc makes it easier to scan. For the latter: now it is more obvious that Curve_Intersect_Shadow expects RayPayload, but Curve_Filter_Shadow expects ShadowPayload. It might not be a problem currently as ShadowPayload has the same "header" RayPayload, but it might change in the future. Also, compiler might optimize fields out from one but not from the other.	2025-03-26 11:07:04 +01:00
Sergey Sharybin	3daaf21bab	Cleanup: Remove unused function argument in HIP-RT	2025-03-26 11:07:04 +01:00
salipour	ae710101f5	Fix #136138 , #136449 : Cycles HIP RDNA2 white and blue render artifacts There is a known precision bug in the current HIP compiler version (RDNA2 family/Windows) that has already been fixed and will be available in a future HIP SDK release. Enabling more precise math prevents the artifacts. This may cause a 5-10% performance drop in some scenes. Fix #136138: Microfacet BSDF Fix #136449: Hair BSDF Pull Request: https://projects.blender.org/blender/blender/pulls/136341	2025-03-25 18:21:16 +01:00
nubnubbud	5e2afb3f6f	Cycles: Replace bump correction algorithm to better respect normal maps The new correction avoids washed out areas near the shadow terminator, preserving more detail from normal and bump maps. It implements the method from the paper "A Microfacet-Based Shadowing Function to Solve the Bump Terminator Problem" by Alejandro Conty Estevez, Pascal Lecocq, and Clifford Stein. Pull Request: https://projects.blender.org/blender/blender/pulls/135380	2025-03-25 18:01:01 +01:00
Brecht Van Lommel	f987ef7b6e	Shaders: Add Filter Width input to Bump node This makes it possible to restore previous Blender 4.3 behavior of bump mapping, where the large filter width was sometimes (ab)used to get a bevel like effect on stepwise textures. For bump from the displacement socket, filter width remains fixed at 0.1. Ref #133991, #135841 Pull Request: https://projects.blender.org/blender/blender/pulls/136465	2025-03-25 16:29:13 +01:00
Sergey Sharybin	b524d0fe39	Cycles: Disable spatial splits for hair BVH On the user level spatial splits on hair BVH leads to very long build times, without giving too much advantage in the render times. There is also some issues and possibly bugs in the builder which lead to all sort of numerical issues (like divisions by zero). There are also performance issues that comes from the fact that the alignment space is applied every time primitive's aligned bounds are requested. It also seems that the splitting might not be considering aligned space consistently when calculating SAH and performing splits. It does sound like issues we'd get fixed ideally, but the importance of the BVH2 is fading out with the HW-RT becoming more and more popular. This change contains fix needed for the split algorithm to avoid numerical issue reported by UBSAN when rendering the `BVH2 particle simple.blend` from the #126508. Ref #126508 Ref #136245 Pull Request: https://projects.blender.org/blender/blender/pulls/136430	2025-03-24 15:46:39 +01:00
Sergey Sharybin	5ce4e91a80	Fix #136319 : Incorrect transparent bounce count with spatial splits The transparent bounce test was too optimistic in regards to the intersection being considered. The check needs to happen after it has been validated that it is not duplicate. It was already the case for Metal and HIP-RT, but not for Embree and BVH2. Tests updated by: Alaska <Alaskayou01@gmail.com> Pull Request: https://projects.blender.org/blender/blender/pulls/136325	2025-03-22 04:51:42 +01:00
Sergey Sharybin	50180283e9	Fix #117527 : Spatial split leads to artifacts on transparent shadows The reason for this to happen is because when spatial split is used the same intersection could be recorded twice (via different BVH nodes). This change introduces check for the intersection being already recoded, similar to the check in the local BVH. The check is done during BVH intersection which allows to properly ignore intersections even for the maximum bounce number check. A faster approach would be to do such filtering after sorting, but then we can not keep bounce check in the BVH code consistent with and without spatial splits. Intuitively it seems that it should be possible to merge the new loop with the one that checks for which intersection to keep. But it is not so trivial in practice: it doesn't run for all intersections, and also it is formulated in a way that updates isect_index for the next record. Pull Request: https://projects.blender.org/blender/blender/pulls/136251	2025-03-21 13:56:50 +01:00
Campbell Barton	d616c87d03	Cleanup: spelling in comments (make check_spelling_*)	2025-03-21 11:51:50 +11:00
Sergey Sharybin	bf65b64708	Refactor: De-duplicate local intersection reservoir sampling logic The code which was checking whether local intersection is to be recorded, and under which index was duplicated for triangles, motion triangles, and HIP-RT triangle filter function. This change moves the common logic to an utility function which is reused from all the places mentioned above. Pull Request: https://projects.blender.org/blender/blender/pulls/136244	2025-03-20 17:19:31 +01:00
Sergey Sharybin	7165146fb2	Cleanup: More spelling fixes in comments	2025-03-20 10:37:09 +01:00
Sergey Sharybin	ae4f6026dc	Cleanup: Spelling in comments	2025-03-20 10:36:12 +01:00
Philipp Oeser	d6da557358	Fix #135955 : OSL Window texture coordinate wrong for panoramic cameras There is code that properly handles panoramic cameras in `camera_world_to_ndc`, the transform matrices (e.g. `OSLRenderServices::get_inverse_matrix`) in the `transform("NDC", P)` call dont do the "full work" here (maybe they should though?). But we can get to `camera_world_to_ndc` by just getting the "NDC" attribute, so use that for now. Pull Request: https://projects.blender.org/blender/blender/pulls/136097	2025-03-18 09:11:45 +01:00
Brecht Van Lommel	9984adc7de	Merge branch 'blender-v4.4-release'	2025-03-17 16:19:46 +01:00
Brecht Van Lommel	f896f7ffc3	Fix #136047 : Cycles OSL gettextureinfo crash with missing image Missing null pointer check. Pull Request: https://projects.blender.org/blender/blender/pulls/136075	2025-03-17 16:18:31 +01:00
Bastien Montagne	dd98cede18	Merge branch 'blender-v4.4-release'	2025-03-14 18:20:26 +01:00
Sahar A. Kashi	9ad3b74867	Fix: SSS and Motion Blur or Curves not working on HIP-RT This change fixes the remaining failing tests with SSS when using HIP-RT. This includes crash when SSS is used on curves, and objects with motion blur and SSS rendering black. The root cause for both cases was the fact that traversal was always assuming regular BVH (built for triangles), while curves and motion triangles are using custom primitives, which requires specialized BVH traversal. This change includes: - Early output from `scene_intersect_local()` for non-triangle and non-motion-triangle primitives. This fixes `sss_hair.blend` test, and also avoids unnecessary BVH traversal when the local intersection is requested from curve object. The same early-output could be added to other BVH traversal implementation. - Use `hiprtGeomCustomTraversalAnyHitCustomStack` for motion triangles primitives. This fixes motion blur on objects with SSS render black. Fixes #135856 Co-authored-by: Sahar A. Kashi <sahar.alipourkashi@amd.com> Co-authored-by: Sergey Sharybin <sergey@blender.org> Pull Request: https://projects.blender.org/blender/blender/pulls/135943	2025-03-14 18:17:54 +01:00
Xavier Hallade	0ebce03d41	Cycles: Reorder ShaderData elements to improve cache utilization Profiling on Arc B580 shown that sd->num_closure queries were often stalling. Packing it closer to other often accessed elements within ShaderData (type, flag..) does speedup rendering by ~5% in most scenes. Pull Request: https://projects.blender.org/blender/blender/pulls/135980	2025-03-14 16:11:17 +01:00
Xavier Hallade	3de1a60c6a	Cycles: oneAPI: Force large GRF for shade_surface_* kernels While auto lets the compiler make the right choice for shade_surface kernel when compiling for Battlemage and Lunar Lake, that's not the case for Alchemist and Meteor Lake, so now we force this mode.	2025-03-13 14:22:48 +01:00
Campbell Barton	6ef7dae8ef	Cleanup: spelling in comments (make check_spelling_*)	2025-03-13 13:41:17 +11:00
Sergey Sharybin	977a334f6f	Merge branch 'blender-v4.4-release'	2025-03-12 19:24:01 +01:00
Sergey Sharybin	a3eb0faa3f	Fix: Incorrect ray time used for HIP-RT local intersections It was always hard-coded to be 0. It does not seem to result in any extra tests passing, but they are probably not sophisticated enough. Noticed while looking into details for the #135856. Pull Request: https://projects.blender.org/blender/blender/pulls/135878	2025-03-12 19:23:38 +01:00
Brecht Van Lommel	07b60c189b	Cycles: Perform attribute subdivision on the host side * Add SubdAttributeInterpolation class for linear attribute interpolation. * Dicing computes ptex UV and face ID for interpolation. * Simplify mesh storage of subd primitive counts * Remove kernel code for subd attribute interpolation * Remove patch table packing and upload The old optimization adds a fair amount of complexity to the kernel, affecting performance even when not using the feature. It's also not that useful as it does not work for UVs that needs special interpolation. With this simpler code it should be easier to make it feature complete. Pull Request: https://projects.blender.org/blender/blender/pulls/135681	2025-03-11 20:58:07 +01:00
Brecht Van Lommel	6ec541ca4e	Refactor: Cycles: Remove face normal attribute It's already computed on demand in the kernel, no need to have it host side. Pull Request: https://projects.blender.org/blender/blender/pulls/135681	2025-03-11 20:57:51 +01:00
Sergey Sharybin	32d49541c0	Fix #135572 : Cycles shadow linking through transparency is broken on GPU Make the ray self primitives store and restore reliable for cases when the intersect_shadow kernel is called multiple times: - Light object and primitive are stored in dedicated fields in the state. This adds 2 integers per state. - The self object and primitive are used from the previous intersection when the intersect_shadow is called multiple times. There is more detailed explanation added in the code. The issue was introduced by the light refactor to be objects in #134846. Pull Request: https://projects.blender.org/blender/blender/pulls/135573	2025-03-07 17:16:04 +01:00
Brecht Van Lommel	48398b223b	Cleanup: Fix various divisions by zero reported by ASAN Pull Request: https://projects.blender.org/blender/blender/pulls/135326	2025-03-06 22:34:23 +01:00
Brecht Van Lommel	b75b2e883d	Cleanup: Always use fullsize ShaderData on the CPU This avoids compiler and ASAN warnings. This optimization exist for GPU rendering.	2025-03-06 22:34:22 +01:00
Xavier Hallade	91f332e7c6	Cycles: oneAPI: Force normal GRF for integrator_intersect_* kernels With auto mode, integrator_intersect_subsurface still ended up being compiled in large GRF mode on Intel Arc B580, while normal GRF provides the best performance for this kernel.	2025-03-06 22:24:45 +01:00
Xavier Hallade	d1120c51db	Cycles: oneAPI: Enable automated GRF mode selection The default was large GRF mode for all kernels and normal GRF for intersection kernels. path_array kernels also benefit from normal GRF, being almost 2x faster in this mode, as measured on my Arc B580. This translates to a much smaller 1-3% speedup in overall rendering. Instead of manually adding them to the list of kernels to compile in normal GRF mode, I've switched to auto that provides the same result.	2025-03-06 17:47:19 +01:00
Xavier Hallade	90a10dcd50	Cycles: Adjust inlining attributes for oneAPI device Now ccl_device sets inlining and ccl_device_inline forces inlining. This matches more closely with what is currently done for cuda and metal backends. I've measured from 1% to 6% overall performance improvement in rendering benchmark scenes on Arc B580, as well as a small decrease in compile time.	2025-03-03 18:20:02 +01:00
Alaska	fb7b53143e	Merge branch 'blender-v4.4-release'	2025-02-27 12:03:30 +13:00
Alaska	d840d249b3	Cycles: Re-enable HIPRT point cloud rendering Previously point cloud rendering was disabled on the HIPRT backend due to unexpected performance regressions introduce by it. With the recent update to HIP SDK 6.3 and HIPRT 2.5, these performance regressions have been resolved and so this commit re-enables point cloud rendering on HIPRT. Pull Request: https://projects.blender.org/blender/blender/pulls/134902	2025-02-27 00:01:35 +01:00
Xavier Hallade	a5d8bd2e29	Cycles: Drop inline hint on light_tree_pdf Dropping the inlining hint for `light_tree_pdf` and reverting to the default inlining thresholds for DPC++ compiler gives a ~4% speedup on classroom and other scenes on Arc B580. Pull Request: https://projects.blender.org/blender/blender/pulls/135042	2025-02-26 20:14:05 +01:00
Weizhen Huang	7b3f7ccae0	Merge branch 'blender-v4.4-release'	2025-02-26 15:51:55 +01:00
Lukas Stockner	9254532b8b	Fix #129306 : Cycles: Principled coat doesn't pass furnace test This implements three improvements to the energy preservation and albedo scaling logic, which help the Principled BSDF pass the white-furnace test when using the coat layers at high roughness. Specifically, at roughness 0.3, the albedo scaling brings it from 60% at the edge to 95%, and with the energy preservation it's 99.8%. Pull Request: https://projects.blender.org/blender/blender/pulls/134620	2025-02-26 15:47:21 +01:00
Weizhen Huang	9ee9a2b789	Fix #135145 : object visible to volume scatter when ray visibility is off The comment was added in `e0857ad152`, and volume scatter visibility is supported since `cdd1d5a93c`. Pull Request: https://projects.blender.org/blender/blender/pulls/135168	2025-02-26 15:45:58 +01:00

1 2 3 4 5 ...

3808 Commits