griefith/test

Author	SHA1	Message	Date
Nikita Sirgienko	54766b6a54	Cycles: Introducing the code for adoption of Embree 4.4 Embree 4.4 introduces an improvement in the Embree GPU implementation by dropping shared memory usage in favor of direct controllable memory transfers. This should allow addressing several problems spotted in Blender regarding multithreading and memory corruption when BVH and rendering happen at the same time. However, to implement such improvements, the API has changed for several functions, and this commit adopts Blender code to these changes, making Blender buildable and functional with all existing Embree 4.X versions, before and after 4.4. No functional changes in Blender behavior are expected if using Embree versions below 4.4. Pull Request: https://projects.blender.org/blender/blender/pulls/139061	2025-05-19 11:25:50 +02:00
Brecht Van Lommel	4d7bd22beb	Refactor: Cycles: Graphics interop changes * Add GraphicsInteropDevice to check if interop is possible with device * Rename GraphcisInterop to GraphicsInteropBuffer * Include display device type and memory size in GraphicsInteropBuffer * Unnest graphics interop class to make forward declarations possible Pull Request: https://projects.blender.org/blender/blender/pulls/137363	2025-04-28 11:38:56 +02:00
Alaska	0a7a12f873	Cycles: Print additional warnings about unsupported oneAPI driver versions to terminal This commit adds some extra prints to terminal related to oneAPI driver information in the situation that the driver version is considered incompatible with the current version of Cycles. Pull Request: https://projects.blender.org/blender/blender/pulls/137272	2025-04-15 09:03:45 +02:00
Xavier Hallade	17e0d88c05	Cycles: oneAPI: Avoid returning 0 from get_max_num_threads_per_multiprocessor Instead of relying on the Intel extensions that may not be implemented, we can use max_work_group_size until there is a better alternative. Thanks to Codeplay for this proposal. Co-authored-by: Georgi Mirazchiyski <georgi.mirazchiyski@codeplay.com>	2025-04-01 11:10:08 +02:00
Xavier Hallade	795a76029a	Cycles: oneAPI: Restrict use of experimental copy optimization to L0 This API is not properly implemented in other SYCL backends at the moment and we don't want it to fail at runtime, so we conservatively enable it only for Level-Zero.	2025-03-31 16:14:36 +02:00
Xavier Hallade	7a257359f8	Cycles: oneAPI: Use max_compute_units in get_num_multiprocessors Instead of returning 0 in case the Intel extension for getting the count of Execution Units isn't available, we now use sycl::info::device::max_compute_units. We keep using the Intel extension in priority since it logically goes with sycl::ext::intel::info::device::gpu_hw_threads_per_eu used in get_max_num_threads_per_multiprocessor(), for which there is no sycl::info::device::max_threads_per_compute_unit replacement yet.	2025-03-26 23:15:49 +01:00
Sean Stirling	5372346978	Cycles: oneAPI: Use linear USM memory for 1D images Rewrite the ONEAPI Blender texture allocation code to make use of 1D images backed by linear USM memory. This increases parity with the CUDA implementation and sets the ground work for enabling host USM allocations in Blender. By enabling this functionality, previously failing benchmarks are now passing. Together with the previous commit, no functional changes are expected.	2025-02-28 17:52:41 +01:00
Nikita Sirgienko	dcbc7c1623	Cycles: oneAPI: Remove some texture code from the squished bindless texture commit This code will be reintroduced back shortly, but under proper credentials. No functional changes are expected along with the next commit.	2025-02-28 17:51:35 +01:00
Brecht Van Lommel	c87a269021	Fix #133953 : Cycles oneAPI texture randomly renders black * Do oneAPI copy optimization as part of host memory alloc and free, so it is properly released before host memory is freed. * Synchronize after loading texture info, like CUDA and HIP. https://projects.blender.org/blender/blender/pulls/134412	2025-02-13 19:58:56 +01:00
Brecht Van Lommel	f99f958c47	Refactor: Cycles: Add host_alloc/free to device API This may be used for device to do host memory allocation in a way that is more efficient for copy the host memory to the device. Also rename and group device memory allocation functions for clarity. Pull Request: https://projects.blender.org/blender/blender/pulls/134412	2025-02-13 19:58:56 +01:00
Campbell Barton	c83c62439e	Cleanup: correct typo	2025-02-13 11:14:50 +11:00
Nikita Sirgienko	2bab4ae370	Cycles: oneAPI: Optimize texture access by using GPU HW sampler The current usage of software-based texture operations in the oneAPI implementation puts additional register pressure on the GPU compiler during register allocation. And it also creates code that requires maintenance. This commit is intended to address this situation by utilizing a recently productized SYCL bindless texture API to enable HW-based texture operations using Intel GPUs' hardware sampler. This currently translates to 1-11% rendering speedups (scene-specific) on my Arc A770 and Arc B580. At the moment, there are small performance regressions with NanoVDB texture operations on Arc B580 and small performance regressions in shade surface MNEE and Raytrace kernels on Arc A770, but they look recoverable and will be handled in the future. Pull Request: https://projects.blender.org/blender/blender/pulls/133457	2025-02-12 21:47:34 +01:00
Nikita Sirgienko	bee534eea5	Build: Upgrade Intel Graphics Compiler to 2.1.14 on Linux This corresponds the latest rolling 2448.13 release: https://dgpu-docs.intel.com/releases/packages.html?release=Rolling+2448.13&os=Ubuntu+24.04 Graphics compiler upgrades require increasing the minimum required driver (compute-runtime) version to the corresponding one to guarantee compatibility, which is XX.XX.31740.15 in this release, so we bump this requirement accordingly. Co-authored-by: Xavier Hallade <me@ph0b.com> Pull Request: https://projects.blender.org/blender/blender/pulls/134051	2025-02-05 15:00:04 +01:00
Xavier Hallade	e7589f8973	Fix: Cycles: Missing texture transfers in oneAPI backend Since `2cfe2e0bfe`, textures were not being allocated nor transfered to device. This fix improves the situation reported in https://projects.blender.org/blender/blender/issues/133953 but is not enough to make all unit tests pass.	2025-02-03 20:20:21 +01:00
Brecht Van Lommel	e8ebcb3ee3	Fix: Cycles: Check if memory is host mapped without access to device_mem_map This avoids concurrency issues. Pull Request: https://projects.blender.org/blender/blender/pulls/132912	2025-01-29 14:12:23 +01:00
Brecht Van Lommel	cd3d3b2646	Refactor: Cycles: Delay load_texture_info() to enqueue Doing it immediately after moving textures to the host is less efficient, and interacts in confusing ways. Pull Request: https://projects.blender.org/blender/blender/pulls/132912	2025-01-29 14:12:06 +01:00
Brecht Van Lommel	fec593ec3b	Fix: Cycles: Avoid unnecessary move to host with multi-device If one of the devices already used host happed memory but another not, it would previously realloc both. Thanks to Jorn Visser for investigating and finding this problem. Pull Request: https://projects.blender.org/blender/blender/pulls/132912	2025-01-29 14:12:02 +01:00
Brecht Van Lommel	2cfe2e0bfe	Fix: Cycles: Re-copy memory from host to device without realloc Should be a bit more efficient, and it fixes host memory fallback bugs, where host memory was incorrectly freed during re-copy. For the case where memory should get reallocated on the host, a new mem_move_to_host was added. Thanks to Jorn Visser for investigating and finding this problem. Pull Request: https://projects.blender.org/blender/blender/pulls/132912	2025-01-29 14:11:50 +01:00
Xavier Hallade	ce463bd6b1	Cycles: oneAPI: optimize device<->host copies There is a large overhead when doing copies between a device and non-USM host memory. Using the prepare/release API avoids it, as presented in the optimization guide: https://www.intel.com/content/www/us/en/docs/oneapi/optimization-guide-gpu/2025-0/optimizing-data-transfers.html This currently translates to a 4-5% overall rendering speedups on my Arc B580 in most scenes. Pull Request: https://projects.blender.org/blender/blender/pulls/132859	2025-01-09 21:00:12 +01:00
Stefan Werner	a79d95099f	Cycles: Fix OneAPI crash after unique_ptr refactor Memory was freed too early, probably a typo.	2025-01-07 09:37:47 +01:00
Brecht Van Lommel	9971648783	Refactor: Cycles: Replace new/delete by unique_ptr, in simple cases Pull Request: https://projects.blender.org/blender/blender/pulls/132361	2025-01-03 10:23:30 +01:00
Brecht Van Lommel	57ff24cb99	Refactor: Cycles: Add const keyword to more function parameters Pull Request: https://projects.blender.org/blender/blender/pulls/132361	2025-01-03 10:23:24 +01:00
Brecht Van Lommel	dd51c8660b	Refactor: Cycles: Add const keyword where possible, using clang-tidy Check was misc-const-correctness, combined with readability-isolate-declaration as suggested by the docs. Temporarily clang-format "QualifierAlignment: Left" was used to get consistency with the prevailing order of keywords. Pull Request: https://projects.blender.org/blender/blender/pulls/132361	2025-01-03 10:23:20 +01:00
Brecht Van Lommel	60bec183cb	Refactor: Cycles: Replace foreach() by range based for loops Pull Request: https://projects.blender.org/blender/blender/pulls/132361	2025-01-03 10:23:05 +01:00
Brecht Van Lommel	d0c2e68e5f	Refactor: Cycles: Automated clang-tidy fixups in Cycles * Use .empty() and .data() * Use nullptr instead of 0 * No else after return * Simple class member initialization * Add override for virtual methods * Include C++ instead of C headers * Remove some unused includes * Use default constructors * Always use braces * Consistent names in definition and declaration * Change typedef to using Pull Request: https://projects.blender.org/blender/blender/pulls/132361	2025-01-03 10:22:55 +01:00
Xavier Hallade	2cfe69c07d	Cycles: Fix error handling of BVH transfer to device Previously, in case of a failure during BVH transfer, when running out of memory for example, we could get an error such as "BVH failed to migrate to the GPU due to Embree library error (no error)", because embree error status was actually reset before being queried. This commit fixes its propagation. Pull Request: https://projects.blender.org/blender/blender/pulls/129022	2024-10-15 10:31:30 +02:00
Xavier Hallade	a1182e07b1	Build: upgrade Intel Graphics Compiler and ocloc on Linux IGC 1.0.17384, ocloc 24.31.30508, which: - add support for Battlemage and Lunar Lake GPUs - recover from recent performance regression on Linux - allow to drop older work-around (`9d5164d472`) and need for a patched version on Windows - ocloc now needs "dg2,mtl" naming for fat binaries. opencl-clang patches don't get applied anymore by igc build scripts when llvm is not a git repository, hence I could also drop we can drop current patch disabling patching. I've only slightly pushed min-driver-version updates after carefull testing, instead of jumping to the same version as ocloc as we use to. Pull Request: https://projects.blender.org/blender/blender/pulls/127251	2024-09-12 09:11:56 +02:00
Xavier Hallade	56db2d393d	Cycles: oneAPI: use ocloc 101.5972 on Windows This new version of the graphics compiler solves a performance regression on Arc, adds support for Battlemage and Lunar Lake GPUs, and allows to drop older patch to build fat binaries with broad compatibility. This latter change requires using -device dg2,mtl naming instead of passing architecture ids. Pull Request: https://projects.blender.org/blender/blender/pulls/127371	2024-09-11 17:34:13 +02:00
Nikita Sirgienko	94c9898f41	Fix #124811 : Cycles: oneAPI: no hair strands in viewport with Embree oneAPI kernels preloading logic was letting un-needed kernels to be compiled without features, which would then miss when these kernels were needed later. Pull Request: https://projects.blender.org/blender/blender/pulls/127114	2024-09-04 11:08:00 +02:00
Nikita Sirgienko	74c09b2e63	Cycles: oneAPI: Fix undefined behavior when embree fails initializing Embree device pointer can end up being nullptr even when Embree on GPU is expected to be used. Previous implementation overlooked this possibility, leading to a completely silent fallback to the non-Hardware ray-tracing path, this commit fixes it. We've noticed this as now Embree relies on a driver component: https://github.com/intel/level-zero-raytracing-support that can potentially be missing from a system. Pull Request: https://projects.blender.org/blender/blender/pulls/124085	2024-07-03 14:13:01 +02:00
Xavier Hallade	4477641467	Cycles: oneAPI: Fix driver version check for future Intel GPU drivers SYCL runtime currently relies on an internal driver behavior that will break the driver version string returned by SYCL if it changes: https://github.com/oneapi-src/unified-runtime/issues/1777 This will be fixed at SYCL runtime level but until we use a new enough one, we need to add additional verifications to avoid blocking execution on a driver that will change this internal behavior. Pull Request: https://projects.blender.org/blender/blender/pulls/124084	2024-07-03 14:12:16 +02:00
Sergey Sharybin	b803d7fabb	Fix: Command line Cycles render crash on multi-CUDA device Since #118841 there are more cases where Cycles would check for the graphics interop support. This could lead to a crash when graphics interop functions are called without having active graphics context. This change makes it so there is no graphics interop calls when doing headless render. In order to achieve this the device creation is now aware of the headless mode. Pull Request: https://projects.blender.org/blender/blender/pulls/122844	2024-06-07 17:53:44 +02:00
Nikita Sirgienko	8ee8d01711	Cycles: oneAPI: Fix Out-Of-Memory errors on some integrated GPUs	2024-05-29 21:57:13 +02:00
Campbell Barton	c5a27f011e	Cleanup: spelling in comments	2024-05-29 12:49:07 +10:00
Nikita Sirgienko	759bb6c768	Cycles: oneAPI: Enable host memory migration This enables scenes with all textures not fitting in GPU memory to finally render. For scenes that are fitting, no functional change or performance change is expected. Pull Request: https://projects.blender.org/blender/blender/pulls/122385	2024-05-28 19:04:19 +02:00
Attila Áfra	26c93c8359	Cycles: Enable OIDN 2.3 lazy device module loading This enables the new lazy module loading behavior introduced in OIDN 2.3, without breaking compatibility with older versions of OIDN (using separate code paths). Also, the detection of OIDN support for devices is now much cleaner, and devices do not need to be matched by PCI address or device name anymore. Pull Request: https://projects.blender.org/blender/blender/pulls/121362	2024-05-07 14:07:39 +02:00
Xavier Hallade	cbc7962a73	Cycles: Tune kernel sizes for oneAPI device This brings a 1-3% performance improvement depending on the scenes, on the Arc A770.	2024-04-04 16:04:13 +02:00
Xavier Hallade	98343c0c17	Build: Upgrade Intel Graphics Compiler to 1.0.15468 on Linux This corresponds the latest stable LTS release: https://dgpu-docs.intel.com/releases/LTS_803.29_20240131.html Graphics compiler upgrades require increasing the mininum required driver (compute-runtime) version to the corresponding one to guarantee compatibility, which is XX.XX.27642.38 in this release, so we bump this requirement accordingly. Fixes #118713 Pull Request: https://projects.blender.org/blender/blender/pulls/118814	2024-02-28 18:24:30 +01:00
Xavier Hallade	18a20f373f	Cycles: increase min Intel GPU driver version on Windows With drivers 101.4972 to 101.5085, some Arc and Meteor Lake devices ignore the prebuilt GPU binaries and since the addition of Meteor Lake binaries, fail caching newly generated ones on Windows. This got fixed in drivers 101.5186 so it's preferable to require these new drivers to be used.	2024-01-31 12:49:30 +01:00
Stefan Werner	8a6f7640d6	Cycles: Make OIDN on GPU use the existing SYCL queue There's already a queue from the Cycles rendering device, so let OIDN use the same instead of creating a new one. Co-authored-by: Werner, Stefan <stefan.werner@intel.com> Pull Request: https://projects.blender.org/blender/blender/pulls/115650	2023-12-07 14:16:21 +01:00
Brecht Van Lommel	e06561a27a	Build: replace Blender specific DEBUG by standard NDEBUG NDEBUG is part of the C standard and disables asserts. Only this will now be used to decide if asserts are enabled. DEBUG was a Blender specific define, that has now been removed. _DEBUG is a Visual Studio define for builds in Debug configuration. Blender defines this for all platforms. This is still used in a few places in the draw code, and in external libraries Bullet and Mantaflow. Pull Request: https://projects.blender.org/blender/blender/pulls/115774	2023-12-06 16:05:14 +01:00
Xavier Hallade	e317831a82	Cycles: oneAPI: adjust reported device capabilities It notably adds driver version as it can be helpful when triaging end-user issues.	2023-11-09 21:46:27 +01:00
Xavier Hallade	1161455084	Build: upgrade Intel Graphics Compiler on Linux In order to speedup compilation, we upgrade IGC to 1.0.14828.26 along with ocloc and the associated dependencies. We also bump min-driver version accordingly to 26918. Ref !114341	2023-11-02 22:27:38 +11:00
Xavier Hallade	ef80392721	Cycles: oneAPI: update windows ocloc to 101.4723 The first public Windows driver version with a higher number is 101.4824, so we bump the min-required driver version on Windows to this one to ensure compatibility.	2023-10-11 14:47:48 +02:00
Nikita Sirgienko	56ff55ed09	Cycles: oneAPI: Resolve an compilation error in assert	2023-09-04 15:41:05 +02:00
Xavier Hallade	6a301cf07f	Cycles: oneAPI: increase windows min driver requirement to 101.4644 101.4644 has been released 2023-08-15 and fixes the Windows specific issue #109282.	2023-08-31 10:22:51 +02:00
Bastien Montagne	7c4f5b7dc6	Temp 'Fix' broken compilation from `abab47a805`. Proper actual fix is obviously needed, but at least now Blender builds again with oneAPI enabled...	2023-08-23 10:39:21 +02:00
Campbell Barton	f66fa32ada	Cleanup: spelling in comments	2023-08-23 13:30:55 +10:00
Nikita Sirgienko	abab47a805	Cycles: oneAPI: Refactoring of local size choice logic	2023-08-22 19:04:16 +02:00
Xavier Hallade	1489c5a57b	Merge branch 'blender-v3.6-release'	2023-06-23 13:12:58 +02:00

1 2

81 Commits