test2

Author	SHA1	Message	Date
Xavier Hallade	9e9baa9085	Cycles: Upgrade to new Embree 4 while staying compatible with Embree 3 For more information about Embree 3->4 API changes: https://github.com/embree/embree/blob/master/doc/src/api.md#upgrading-from-embree-3-to-embree-4 This is not yet enabling HW RT on Arc GPUs using Embree, which is worked on in https://projects.blender.org/blender/blender/pulls/106266 Co-authored-by: Nikita Sirgienko <nikita.sirgienko@intel.com> Co-authored-by: Stefan Werner <stefan.werner@intel.com> Pull Request: https://projects.blender.org/blender/blender/pulls/105974	2023-04-05 11:03:06 +02:00
Michael Jones	5f61eca7af	Cycles: Exploit non-uniform threadgroup sizes on Metal This patch replaces `dispatchThreadgroups` with `dispatchThreads` which takes care of non-uniform threadgroup bounds. This allows us to remove the bounds guards in the integrator kernel entry points. Pull Request: https://projects.blender.org/blender/blender/pulls/106217	2023-03-29 21:46:11 +02:00
Michael Jones	944a5854c6	Cycles: Fix MetalRT shadow all hit bug This patch fixes a MetalRT issue where viable shadow hits are discounted based on the false assumption that hits are ordered by distance. With this patch, the following unit tests now pass: - openvdb smoke - shadow catcher pt transparent lamp only 0.8 - shadow catcher pt transparent lamp only 1.0 Pull Request: https://projects.blender.org/blender/blender/pulls/106276	2023-03-29 20:20:07 +02:00
Nikita Sirgienko	7ee0bf671e	Cycles: use 8-bit type for number of ray hits when possible INTEGRATOR_SHADOW_ISECT_SIZE is lower than 256 for GPUs, this allows using only a 8-bit type for storing intersection counts there.	2023-03-15 22:01:48 +01:00
Nikita Sirgienko	f9922b7074	Cycles: Use ray->tfar in Embree filter functions This allows to tell embree to stop intersecting beyond the distance when max number of hits is reached.	2023-03-15 22:01:48 +01:00
Nikita Sirgienko	1a580dbfdd	Cycles: Use IntegratorShadowState directly in Embree filter functions	2023-03-15 22:01:48 +01:00
Nikita Sirgienko	b97a6daa9a	Cycles: Use geometryUserPtr from Embree filter functions arguments This saves calls to rtcGetGeometryUserData.	2023-03-15 22:01:47 +01:00
Julian Eisel	30e517c3ca	Merge branch 'blender-v3.5-release'	2023-03-15 13:07:26 +01:00
Michael Jones	089e8a1887	Cycles: Fix Metal API validation error (use uint instead of ushort) This PR fixes an error that is given when Metal API validation is enabled. The compute grid can exceed 65536 threads so `ushort` is not sufficient for `metal_grid_id [[threadgroup_position_in_grid]]`. This PR also fixes OS version warnings ([Cycles Metal: Unguarded access to newer macOS features #105630](https://projects.blender.org/blender/blender/issues/105630)) Pull Request: https://projects.blender.org/blender/blender/pulls/105763	2023-03-14 22:05:55 +01:00
Brecht Van Lommel	9eee008691	Fix Cycles oneAPI build error due to conflicting CONSTANT define	2023-03-06 00:13:21 +01:00
William Leeson	6c03339e48	Cycles: reduce mesh memory usage by unflattening To improve mesh upload speeds and reduce the size of the scene data which allows larger scenes to be rendered. The meshes in Cycles are currently stored as flattened meshes, where each triangle is stored as a set of 3 vertices. Unflattening writes out the vertices in a list according to the index buffer. This uses a lot of memory and for current hardware does not provide a noticeable benefit. This change unflattens the mesh by directly using the meshes vertex and index buffers directly and skips the unflattening. This change allows for larger scenes and also a reduction in the sizes of the meshes. Further it results in a decrease the amount of time it takes to upload the data to a GPU. This is especially important for when multiple GPUs are used in a single machine. Pull Request #105173	2023-02-27 10:39:19 +01:00
Brecht Van Lommel	02c2970983	Cycles: add NanoVDB support for Metal on Apple Silicon Contributed by Yulia Kuznetcova at Apple. NanoVDB is patched to give add address spaces required by Metal. We hope that in the future Metal will support the generic address space. For AMD and Intel this is currently not available since it causes a performance regression also on scenes without volumes. Pull Request #104837	2023-02-21 15:03:52 +01:00
Campbell Barton	91346755ce	Cleanup: use '#' prefix for issues instead of 'T' Match the convention from Gitea instead of Phabricator's T for tasks.	2023-02-12 14:56:05 +11:00
Michael Jones	2d994de77c	Cycles: MetalRT optimisation for subsurface intersection queries This patch optimises subsurface intersection queries on MetalRT. Currently intersect_local traverses from the scene root, retrospectively discarding all non-local hits. Using a lookup of bottom level acceleration structures, we can explicitly query only the relevant instance. On M1 Max, with MetalRT selected, this can give a render speedup of 15-20% for scenes like Monster which make heavy use of subsurface scattering. Patch authored by Marco Giordano. Reviewed By: brecht Differential Revision: https://developer.blender.org/D17153	2023-02-06 19:12:29 +00:00
Brecht Van Lommel	773a36d2f8	Fix Cycles OneAPI build error after recent changes	2023-02-06 15:36:49 +01:00
Brecht Van Lommel	9ad3a85f8b	Fix Cycles GPU binaries build error after recent changes for Metal	2023-02-06 13:17:57 +01:00
Michael Jones	654e1e901b	Cycles: Use local atomics for faster shader sorting (enabled on Metal) This patch adds two new kernels: SORT_BUCKET_PASS and SORT_WRITE_PASS. These replace PREFIX_SUM and SORTED_PATHS_ARRAY on supported devices (currently implemented on Metal, but will be trivial to enable on the other backends). The new kernels exploit sort partitioning (see D15331) by sorting each partition separately using local atomics. This can give an overall render speedup of 2-3% depending on architecture. As before, we fall back to the original non-partitioned sorting when the shader count is "too high". Reviewed By: brecht Differential Revision: https://developer.blender.org/D16909	2023-02-06 11:18:26 +00:00
Campbell Barton	79c82fc1c5	Cleanup: trailing space	2023-01-31 15:49:04 +11:00
Campbell Barton	27b4916b1a	Cleanup: spelling in comments Also minor changes in comments: - Reference BLENDER_HISTORY_FILE instead of the literal file-name (simplifies looking up usage). - Use usernames in tags, as noted in code-style.	2023-01-31 14:22:23 +11:00
Xavier Hallade	1c90f8209d	Cycles: fix rendering with Nishita Sky Texture on Intel Arc GPUs Speckles and missing lights were experienced in scenes with Nishita Sky Texture and a Sun Size smaller than 1.5°, such as in Lone Monk and Attic scenes. Increasing the precision of cosf fixes it.	2023-01-24 09:58:22 +01:00
Brecht Van Lommel	a84a8a528d	Cycles: remove SSE3 and AVX kernel optimization levels While keeping SSE2, SSE4.1 and AVX2. This does not affect hardware support, it only slightly reduces performance for some older CPUs. To reduce maintenance cost and improve compile times. Differential Revision: https://developer.blender.org/D16978	2023-01-16 17:53:36 +01:00
Nikita Sirgienko	858fffc2df	Cycles: oneAPI: add support for SYCL host task This functionality is related only to debugging of SYCL implementation via single-threaded CPU execution and is disabled by default. Host device has been deprecated in SYCL 2020 spec and we removed it in `305b92e05f`. Since this is still very useful for debugging, we're restoring a similar functionality here through SYCL 2020 Host Task.	2023-01-03 20:47:24 +01:00
Hallam Roberts	a501a2dbff	Images: add mirror extension type This adds a new mirror image extension type for shaders and geometry nodes (next to the existing repeat, extend and clip options). See D16432 for a more detailed explanation of `wrap_mirror`. This also adds a new sampler flag `GPU_SAMPLER_MIRROR_REPEAT`. It acts as a modifier to `GPU_SAMPLER_REPEAT`, so any `REPEAT` flag must be set for the `MIRROR` flag to have an effect. Differential Revision: https://developer.blender.org/D16432	2022-12-14 19:27:29 +01:00
Brecht Van Lommel	222b64fcdc	Fix Cycles CUDA crash when building kernels without optimizations (for debug) In this case the blocksize may not the one we requested, which was assumed to be the case. Instead get the effective block size from the compiler as was already done for Metal and OneAPI.	2022-11-30 21:46:17 +01:00
Michael Jones	b0e2e45496	Cycles: Enable MetalRT pointclouds & other fixes Code authored by Marco Giordano. This fixes pointcloud rendering on MetalRT and some other subtle MetalRT bugs: - Incorrect kernel hashing - Missing specialisation constants - Incorrect visibility filtering - Missing null pointer check Reviewed By: brecht Differential Revision: https://developer.blender.org/D16499	2022-11-14 16:39:18 +00:00
Patrick Mours	e6b38deb9d	Cycles: Add basic support for using OSL with OptiX This patch generalizes the OSL support in Cycles to include GPU device types and adds an implementation for that in the OptiX device. There are some caveats still, including simplified texturing due to lack of OIIO on the GPU and a few missing OSL intrinsics. Note that this is incomplete and missing an update to the OSL library before being enabled! The implementation is already committed now to simplify further development. Maniphest Tasks: T101222 Differential Revision: https://developer.blender.org/D15902	2022-11-09 15:30:21 +01:00
Brecht Van Lommel	e1b3d91127	Refactor: replace Cycles sse/avx types by vectorized float4/int4/float8/int8 The distinction existed for legacy reasons, to easily port of Embree intersection code without affecting the main vector types. However we are now using SIMD for these types as well, so no good reason to keep the distinction. Also more consistently pass these vector types by value in inline functions. Previously it was partially changed for functions used by Metal to avoid having to add address space qualifiers, simple to do it everywhere. Also removes function declarations for vector math headers, serves no real purpose. Differential Revision: https://developer.blender.org/D16146	2022-11-08 12:28:40 +01:00
Xavier Hallade	305b92e05f	Cycles: oneAPI: remove use of SYCL host device Host device is deprecated in SYCL 2020 spec, cpu device or standard C++ should be used instead.	2022-10-21 15:36:48 +02:00
Lukas Stockner	e2a93e9c7c	Fix T94136: Cycles: No Hair Shadows with Transparent BSDF	2022-10-20 04:47:21 +02:00
Morteza Mostajab	e6902d19a0	Cycles: Allow Intel GPUs under Metal Known Issues: - Command buffer failures when using binary archives (binary archives is disabled for Intel GPUs as a workaround) - Wrong texture sampler being applied (to be addressed in the future) Ref T92212 Reviewed By: brecht Maniphest Tasks: T92212 Differential Revision: https://developer.blender.org/D16253	2022-10-19 17:09:38 +01:00
Xavier Hallade	5bfce9a822	Cycles: oneAPI: preload kernels only when not using prebuilt binaries sycl::build triggers compilation even if prebuilt binaries are available, we'll have to find a better way in this case.	2022-10-19 16:42:10 +02:00
Xavier Hallade	2943997d2a	Cycles: oneAPI: include sycl/sycl.hpp instead of CL/sycl.hpp Since SYCL 2020 API, sycl/sycl.hpp is the way.	2022-10-19 16:42:10 +02:00
Nikita Sirgienko	58324f0c86	Cycles: oneAPI: Make test kernel more representative Test kernel will now test functionalities related to kernel execution with USM memory allocations instead of with SYCL buffers and accessors as these aren't currently used in the backend.	2022-10-14 11:22:11 +02:00
Nikita Sirgienko	82a5790d2a	Cycles: oneAPI: Trigger compilation of used kernels only JIT compilation of oneAPI kernels now happens during load stage and proper message gets shown in the GUI during compilation. Also, this implementation skips kernels that aren't needed for the used scene, reducing overall (re)compilation time.	2022-10-10 16:38:11 +02:00
Xavier Hallade	7eeeaec6da	Cycles: use direct linking for oneAPI backend This is a minimal set of changes, allowing a lot of cleanup that can happen afterward as it allows sycl method and objects to be used outside of kernel.cpp. Reviewed By: brecht, sergey Differential Revision: https://developer.blender.org/D15397	2022-10-07 09:50:05 +02:00
Michael Jones	2b88ee50fb	Cycles: Tweak inlining policy on Metal This patch optimises the Metal inlining policy. It gives a small speedup (2-3% on M1 Max) with no notable compilation slowdown vs what is already in master. Previously noted compilation slowdowns (as reported in T100102) were caused by forcing inlining for `ccl_device`, but we get better rendering perf by relying on compiler heuristics in these cases. Reviewed By: brecht Differential Revision: https://developer.blender.org/D16081	2022-09-27 17:01:28 +01:00
Sebastian Herhoz	75a6d3abf7	Cycles: add Path Guiding on CPU through Intel OpenPGL This adds path guiding features into Cycles by integrating Intel's Open Path Guiding Library. It can be enabled in the Sampling > Path Guiding panel in the render properties. This feature helps reduce noise in scenes where finding a path to light is difficult for regular path tracing. The current implementation supports guiding directional sampling decisions on surfaces, when the material contains a least one diffuse component, and in volumes with isotropic and anisotropic Henyey-Greenstein phase functions. On surfaces, the guided sampling decision is proportional to the product of the incident radiance and the normal-oriented cosine lobe and in volumes it is proportional to the product of the incident radiance and the phase function. The incident radiance field of a scene is learned and updated during rendering after each per-frame rendering iteration/progression. At the moment, path guiding is only supported by the CPU backend. Support for GPU backends will be added in future versions of OpenPGL. Ref T92571 Differential Revision: https://developer.blender.org/D15286	2022-09-27 15:56:32 +02:00
Xavier Hallade	125ac1f914	Cycles: increase min-supported driver version for Intel GPUs Windows drivers 101.3430 fix an important GUI-related crash and it's best to prevent users from running into it. Linux drivers weren't affected but still had relevant gpu binary compatibility fixes, so it makes sense to keep the min-supported version aligned across OSes.	2022-09-26 07:41:47 -07:00
Werner, Stefan	0c824837ab	Cycles: Cleanup in oneAPI math includes and definitions Now explicitly including math.h first before #defining funcitons. This avoids undefined behavior and improves compatibility with different SYCL compilers and backends.	2022-09-22 11:33:57 +02:00
Brecht Van Lommel	6d08ba8a50	Fix T100824: Cycles GPU render broken on macOS 13 Beta and Apple silicon The recent revert of Apple silicon inlining changes to avoid long compile times worked on macOS 12, but in macOS 13 Beta it results in render errors. This may be a compiler bug and perhaps get fixed in time, but try to be on the safe side and ensure Blender 3.3.0 works regardless. This brings part of the inlining back, which brings improved performance but also longer compiler times again. Compile time is around 2min now, where the previous full inlining was about 5-7min. Patch by Michael Jones. Differential Revision: https://developer.blender.org/D15897	2022-09-06 19:11:52 +02:00
Campbell Barton	6c6a53fad3	Cleanup: spelling in comments, formatting, move comments into headers	2022-09-06 16:25:20 +10:00
Brecht Van Lommel	cf57624764	Cleanup: refactoring of kernel film function names and organization	2022-09-02 17:13:28 +02:00
Xavier Hallade	3e73afb536	Merge branch 'blender-v3.3-release'	2022-08-31 15:34:44 +02:00
Xavier Hallade	b1231e616a	Cycles: Enforce Windows driver version requirements for sycl sycl/L0 runtime reports compute-runtime version since Intel graphics driver 101.3268 on Windows, when querying driver version from sycl. Prior to this driver, it was 0. Now we can bump minimum requirement to this one and filter-out devices returning 0. Maniphest Tasks: T100648	2022-08-31 15:33:16 +02:00
Nikita Sirgienko	658ff994c5	Merge branch 'blender-v3.3-release'	2022-08-29 19:21:49 +02:00
Nikita Sirgienko	805d1063a0	Cycles: Remove "return" and "assert" from oneAPI kernel code	2022-08-29 19:18:50 +02:00
Nikita Sirgienko	48e1a66af0	Merge branch 'blender-v3.3-release'	2022-08-29 18:21:56 +02:00
Nikita Sirgienko	1cd8ca49f9	Cycles: Increased minimum supported driver for Windows in oneAPI	2022-08-29 18:10:56 +02:00
Sergey Sharybin	d4764a385a	Merge branch 'blender-v3.3-release'	2022-08-25 11:50:55 +02:00
Sergey Sharybin	9c2bc57cbd	Fix Cycles oneAPI for a newer DPC++ compiler version	2022-08-25 11:50:22 +02:00

1 2 3 4

164 Commits