test2

Author	SHA1	Message	Date
Jeroen Bakker	d271b69c67	Vulkan: Optimize swapchain image selection Improve the number of images that are requested in the swapchain. For mailbox presentation mode we select tripple buffering. For V-Sync presentation mode we select double buffering. In future we might want to make the presentation mode and swapchain images dynamic based on the actual action that the user is performing. When doing action that benefit from low latency (paint cursor) we should select mailbox, otherwise we can use fifo to reduce CPU/GPU usage and safe some energy usage along the way. See #136923 for design task. Pull Request: https://projects.blender.org/blender/blender/pulls/136922	2025-04-03 10:30:09 +02:00
Campbell Barton	58e971138b	Cleanup: quiet warnings from clangd Also remove spaces around field id, following our own convention.	2025-04-03 13:51:52 +11:00
Brecht Van Lommel	1ea89c82d4	Build: Remove OpenMP It's better for performance to use a single thread pool for all areas of Blender, and this gets us closer to that. Bullet, Quadriflow, Mantaflow and Ceres still contain OpenMP code, but it was already disabled. On macOS, our OpenMP libraries are no longer compatible with the latest Xcode 16.3. By removing OpenMP we no longer have to solve that problem. OpenMP was disabled for bpy module builds on Windows ARM64, which also no longer needs to be solved. Pull Request: https://projects.blender.org/blender/blender/pulls/136865	2025-04-02 16:50:50 +02:00
Brecht Van Lommel	da9a9093ec	Refactor: Eigen: Switch from OpenMP to TBB Only the parallel sparse matrix code was updated. This is used by e.g. LSCM and ABF unwrap, and performance seems about the same or better. Parallel GEMM (dense matrix-matrix multiplication) is used by libmv, for example in libmv_keyframe_selection_test for a 54 x 54 matrix. However it appears to harm performance, removing parallelization makes that test run 5x faster on a Apple M3 Max. There has been no new Eigen release since 2021, however there is active development in master and it includes support for a C++ thread pool for GEMM. So we could upgrade, but the algorithm remains the same and looking at the implementation it just does not seem designed for modern many core CPUs. Unless the matrix is much larger, there's too much thread synchronization overhead. So it does not seem useful to enable that thread pool for us. Pull Request: https://projects.blender.org/blender/blender/pulls/136865	2025-04-02 16:50:50 +02:00
Brecht Van Lommel	98b3b36411	Refactor: Build: Add bf::dependencies::eigen target To make adding a dependeny on TBB easier. Additional changes: * Using LIB for libmv tests, as it now brings in includes * Removing Eigen header listing in iTaSC Pull Request: https://projects.blender.org/blender/blender/pulls/136865	2025-04-02 16:50:46 +02:00
Sergey Sharybin	ccb493b805	Libmv: Abstract parallel range in camera intrinsics The goal is to be able to switch away from OpenMP by default. In order to achieve this a new parallel_for() function is added, which follows the same declaration as the one from TBB. For now the simplest formulation is used where range is provided by start and last indices (the last one is excluded from the range). The side effect is that Libmv now expects the threading limits to be imposed by the default thread area (or have an explicit thread area with the limit in the caller). In a way this is similar to other external libraries. Pull Request: https://projects.blender.org/blender/blender/pulls/136834	2025-04-02 10:25:49 +02:00
Campbell Barton	90fd070c28	Cleanup: spelling in comments (make check_spelling_*)	2025-04-02 03:02:01 +00:00
Campbell Barton	f89cf19ba6	Cleanup: indentation for CMake files, strip trailing space	2025-04-02 03:01:59 +00:00
Harley Acheson	8325e7f5e4	UI: Use busyButClickableCursor for MacOS Wait Cursor MacOS is currently using one of our custom "hourglass" cursors during busy times. This PR changes it to an OS-supplied cursor, a pointer arrow with an animated blue spinner, meant for this purpose. Although undocumented it has been used by many applications for many years. Including Firefox for 11 years now. Pull Request: https://projects.blender.org/blender/blender/pulls/136735	2025-04-02 01:30:13 +02:00
Sergey Sharybin	36559fd89f	Fix #136811 : HIP-RT performance regression in 4.5 Reduce the register pressure and branching in the switch() by using subclass and cast from void* to the base class. This ensures intersection functions are not inlined multiple times, bringing performance back. Alternative could be to avoid functions (they are quite large) but that only partially resolves the performance regression. Pull Request: https://projects.blender.org/blender/blender/pulls/136823	2025-04-01 17:59:44 +02:00
Jeroen Bakker	439a4e4687	Vulkan: Partial revert of swap chain refactor. On Windows presenting mode mailbox would reduce lag. We might want to be more selective on which mode to use when.	2025-04-01 16:33:31 +02:00
Jeroen Bakker	aed9f22233	Refactor: Vulkan: swapchain This PR refactors the way how swapchains are used. Allow scaling of the swapchain content to the actual resolution of the swapchain. can reduce artefacts when resizing windows when supported. When frame rate is to fast the previous implementation could use a semaphore that were still in use, leading to unwanted stuttering on certain platforms. Waiting when the rendering has finished (GHOST_Frame.submission_fence), before the next image is acquired from the swap chain. Mailbox has been disabled as it can calculate more frames then actually been presented, leading to a lag and increased power usage on others. Pull Request: https://projects.blender.org/blender/blender/pulls/136603	2025-04-01 16:01:22 +02:00
Xavier Hallade	17e0d88c05	Cycles: oneAPI: Avoid returning 0 from get_max_num_threads_per_multiprocessor Instead of relying on the Intel extensions that may not be implemented, we can use max_work_group_size until there is a better alternative. Thanks to Codeplay for this proposal. Co-authored-by: Georgi Mirazchiyski <georgi.mirazchiyski@codeplay.com>	2025-04-01 11:10:08 +02:00
Campbell Barton	8de112efc5	Cleanup: remove unused variables in mataflow	2025-04-01 13:11:17 +11:00
Campbell Barton	00deec7c66	Cleanup: spelling in comments	2025-04-01 12:37:36 +11:00
Campbell Barton	e3d6051181	Cleanup: suppress cast-function-type warnings for CLANG Extend the existing GCC pragma's and add the warning suppression for Cycles & Freestyle.	2025-04-01 12:06:03 +11:00
Campbell Barton	fc8f6ee853	Cleanup: resolve ignored qualifier warning for CLANG	2025-04-01 01:01:38 +00:00
Weizhen Huang	c2ec66a75b	Fix #136726 : Cycles: Area light visible when size is 0 Area light with a size of zero should not contribute to the scene, so set the light as disabled. This does not only fix the reported bug where such light is visible to the camera, but also a regression in 4.2 where the light contributes to the scene when light tree is off. Pull Request: https://projects.blender.org/blender/blender/pulls/136763	2025-03-31 16:17:59 +02:00
Xavier Hallade	795a76029a	Cycles: oneAPI: Restrict use of experimental copy optimization to L0 This API is not properly implemented in other SYCL backends at the moment and we don't want it to fail at runtime, so we conservatively enable it only for Level-Zero.	2025-03-31 16:14:36 +02:00
Jeroen Bakker	472cc10cea	Fix #136521 : Vulkan: Use mailbox present mode by default Mailbox is lowest latency V-Sync enabled mode. Previously we selected FIFO as that was always available and has more support when using debug tools. Improve the #136521 situation as well. Pull Request: https://projects.blender.org/blender/blender/pulls/136753	2025-03-31 11:06:08 +02:00
Xavier Hallade	e00cc8c100	Cycles: oneAPI: Use default linker on Windows The initial issues that led to the choice of forcing the use of linker.exe seem gone and there is currently no strong reason to use linker.exe explicitly, so let's simplify and use the default setting.	2025-03-28 12:34:16 +01:00
Campbell Barton	5d232486e6	Cleanup: minor clarifications in doc-strings for NDOF & OSKEY	2025-03-28 14:06:36 +11:00
Campbell Barton	38d9d971e5	WM: add a "Hyper" capability flag Avoids the need to check against ghost backends.	2025-03-28 13:46:39 +11:00
Campbell Barton	460b1a805d	Fix #134733 : NDOF jumps when switching between applications Window de-activation wasn't reliably clearing the NDOF motion state. See code-comments for details.	2025-03-28 12:48:30 +11:00
Campbell Barton	403900909a	GHOST/Win32: Set the window "inactive" on window de-activation Match the behavior of other platforms by clearing the active window when it's de-activated. On Win32 Blender always considered the last-active window to be active because de-activation wasn't handled. Part of !136122, needed to fix #134733. Co-authored-by: Kamil Galik <kgalik@3dconnexion.com>	2025-03-28 12:48:30 +11:00
Brecht Van Lommel	378b4e2ca4	Refactor: Improve image buffer save/load functions in GHOST	2025-03-27 22:31:03 +01:00
Brecht Van Lommel	124c0f1692	OpenColorIO: Support using file rules to detect colorspace The Blender config.ocio does not have any rules, but custom ones can. The default file rule is ignored if default_byte or default_float roles exist. These roles are Blender specific, so would not be found in a typical OCIO config. But when they are set appropriately, they help provide better default guesses than what is possible with standard OCIO rules. Pull Request: https://projects.blender.org/blender/blender/pulls/136516	2025-03-27 22:07:50 +01:00
Alex Fuller	a1b7ce1d22	Cycles: Move UV tangent computation into the core This makes it available in Cycles standalone, and the implementation can be shared with Blender. This also makes it possible to compute tangents after tessellation for adaptive subdivision. There is a difference in UV map tangents when there are no UVs. They are now generated from object space coordinates instead of auto texture space coordinates. This is more efficient, and a corner case that we don't have to keep compatible. Co-authored-by: Brecht Van Lommel <brecht@blender.org> Pull Request: https://projects.blender.org/blender/cycles/pulls/25	2025-03-27 22:07:50 +01:00
Alex Fuller	438ac2a653	Cycles: Add OSL metadata for default geometry attributes Add support for OSL parameter metadata named `defaultgeomprop`, whose values are interpreted the same way as the property on MaterialX node inputs. When set to `Tworld` the tangent is then automatically linked to the shader and generated for the mesh. Pull Request: https://projects.blender.org/blender/cycles/pulls/25	2025-03-27 22:07:50 +01:00
Brecht Van Lommel	47e1b24c29	Refactor: Cycles: Apply static transforms later in scene update It makes more sense to do it after geometry processing, in particular for the tangent computation that is coming. Pull Request: https://projects.blender.org/blender/blender/pulls/136576	2025-03-27 22:07:50 +01:00
Brecht Van Lommel	e394fd191b	Refactor: Cycles: Sync various build fixes from the standalone repository Pull Request: https://projects.blender.org/blender/blender/pulls/136576	2025-03-27 22:07:50 +01:00
Pierre Pontier	178b0cbff9	Cleanup: Fix warnings about comparing int and size_t Pull Request: https://projects.blender.org/blender/cycles/pulls/24	2025-03-27 22:07:50 +01:00
Brecht Van Lommel	26ccbaf8b5	Fix: Cycles build broken without OpenSubdiv	2025-03-27 22:07:50 +01:00
Xavier Hallade	c4cf399755	Cycles: oneAPI: Re-enable -ffast-math The initial limitation preventing from using -ffast-math, worked around in `09df1f4caf`, got fixed upstream in LLVM and the fix is part of current DPC++ compiler: `63ecd2a725` We're now able to go back to using -ffast-math, which helps simplifying the set of compiler flags. No performance nor conformance change is expected from this change (most of the gain is achieved already with the use of -cl-fast-relaxed-math since `284b89a0a3`) and this has been verified on Arc B580 under Windows.	2025-03-27 17:18:30 +01:00
Jeroen Bakker	3885a37541	Vulkan: Initial OpenXR support The Blender's VkInstance cannot be shared with OpenXR VkInstance. The reason is a chicken and egg problem where OpenXR needs to be started before Vulkan. OpenXR can add special vulkan specific requirements (instance&device) that are only available when the user starts an OpenXR session. The goal implementation is to share memory between both instances using [VK_KHR_external_memory](https://registry.khronos.org/vulkan/specs/latest/man/html/VK_KHR_external_memory.html) and related extensions. However this seems to be a bridge to far as a initial step. Reason: There are not that many samples/ guides and documentation to be found to handle the workflow that we require. We want to do a smaller step by step approach to gain the needed knowledge. For that reason this PR does the most stupidest thing that can be done to share memory between instances. Download the render result to CPU RAM share the host pointer with the OpenXR instance which copies it to the swap chain. Also the synchronization is done using wait idle commands. <video src="attachments/32a0d69b-c3fa-4272-aea0-d207609afaaf" title="Screencast From 2025-03-18 11-16-17.webm" controls></video> Gaining knowledge - Experiment with `VK_KHR_external_memory_host` extension for uploading vertex buffers (not related to OpenXR). - Import host pointer with `VK_KHR_external_memory_host`. This reduces the additional memcpy on OpenXR side. - Export host pointer from Blender side from a mappable buffer. - Replace host pointers with fd/dmabuf/winhandle - Remove mappable buffer. Ref #133718 Pull Request: https://projects.blender.org/blender/blender/pulls/133824	2025-03-27 16:57:51 +01:00
Brecht Van Lommel	15390b9257	License: Change NanoVDB header to Apache 2, following upstream OpenVDB This license was changed upstream, and it's simpler if we can use the same as most of the Cycles code.	2025-03-27 14:48:06 +01:00
Alaska	2e829ca4cf	Fix #136303 : Normalize the normals on the Ambient Occlusion node This commit simply normalizes the normals of the Ambient occlusion node before computing the output to avoid odd behaviour with unnormalized normals. Pull Request: https://projects.blender.org/blender/blender/pulls/136315	2025-03-27 02:58:19 +01:00
Campbell Barton	42ad772a1f	Cleanup: spelling & repeated terms (make check_spelling_*) Also use comment blocks for English text.	2025-03-27 01:13:34 +00:00
Xavier Hallade	7a257359f8	Cycles: oneAPI: Use max_compute_units in get_num_multiprocessors Instead of returning 0 in case the Intel extension for getting the count of Execution Units isn't available, we now use sycl::info::device::max_compute_units. We keep using the Intel extension in priority since it logically goes with sycl::ext::intel::info::device::gpu_hw_threads_per_eu used in get_max_num_threads_per_multiprocessor(), for which there is no sycl::info::device::max_threads_per_compute_unit replacement yet.	2025-03-26 23:15:49 +01:00
Xavier Hallade	06b03b0fa6	Fix: Windows debug release builds fail at linking The debug set of Embree prebuilt libraries currently lacks SYCL support while the release ones have it. This case was not gracefully handled for debug builds with Embree on GPU enabled, leading to linking errors, trying to resolve rtcNewSYCLDevice and rtcIsSYCLDeviceSupported. We now test for this case to explicitly disable the use of Embree on GPU for debug builds on Windows and print this status from CMake.	2025-03-26 23:01:39 +01:00
Sergey Sharybin	42cbc52b07	Fix: Warning in Cycles motion blur kernel features expression This fixes the following warning with MSVC: device_impl.cpp(287): warning C4805: '\|=': unsafe mix of type 'bool' and type 'ccl::uint' in operation The similar fix is applied to Metal code as well. There is no short-circuiting boolean operator \|\|=, so expand the expression. Pull Request: https://projects.blender.org/blender/blender/pulls/136561	2025-03-26 17:20:33 +01:00
Gon Solo	eaee5fc658	Fix: Spelling mistake in Vulkan error print Pull Request: https://projects.blender.org/blender/blender/pulls/136535	2025-03-26 14:25:49 +01:00
Sergey Sharybin	2ab231d802	Refactor: Pass proper KernelGlobals HIP-RT functions do have access to kg, and it was used inconsistently: some functions were passed actual kg, other were passed nullptr. This change makes it consistent and passes kg everywhere. Pull Request: https://projects.blender.org/blender/blender/pulls/136503	2025-03-26 11:07:06 +01:00
Sergey Sharybin	709371b278	Refactor: Avoid creation of local copy of RaySelfPrimitives	2025-03-26 11:07:04 +01:00
Sergey Sharybin	888c7e1df9	Cleanup: Avoid redundant data fetch	2025-03-26 11:07:04 +01:00
Sergey Sharybin	3d882acee2	Cleanup: Else after return	2025-03-26 11:07:04 +01:00
Sergey Sharybin	b2dd523d0d	Cleanup: Avoid default hit initialization The entire object is assigned later on, no need to initialize it.	2025-03-26 11:07:04 +01:00
Sergey Sharybin	323e27d825	Cleanup: Remove redundant assignment The payload stores pointers, no need to restore pointer of the function argument to the same value.	2025-03-26 11:07:04 +01:00
Sergey Sharybin	e92a8042c3	Refactor: Payload for shadow intersection and filter in HIP-RT The code before this change was relying on the ShadowPayload have the same "header" as RayPayload for some of the primitive types (curve, motion triangle, point): intersection functions were shared between "regular" and shadow rays (shadow in this case is shadow_all), but extra filter function was used for shadow rays. This is fragile if someone changes one of these structures. What is worse is that compiler might actually decide to shuffle things in some structs, or remove unused fields. This change also solves confusion about ShadowPayload::prim_type seemingly only being assigned to PRIMITIVE_NONE. With time it is not impossible that compiler will also see this, and constant-fold some checks, or even remove the field. If that happens then the render result will be wrong. Maybe it is already happening as there are some GPU and driver and optimization flag specific bugs in the area. It is unclear whether it was causing any actual problem: W7800 seems to render all hair correctly on Linux.	2025-03-26 11:07:04 +01:00
Sergey Sharybin	cdb3f34944	Cleanup: Use full name for the primitive_type Makes it extra clear locally type of what the variable contains: primitive, ray, or something else.	2025-03-26 11:07:04 +01:00

1 2 3 4 5 ...

14851 Commits