griefith/test

Author	SHA1	Message	Date
Sergey Sharybin	11d311e300	Fix: Cycles assert in device consistency check A regression since #118841. It is possible that the selected preference device is not found, in which case a default-initialized DeviceInfo would have added to the list. This device is set to CPU, but with differnet other fields (such as description) compared to the actual CPU device. Pull Request: https://projects.blender.org/blender/blender/pulls/122701	2024-06-04 12:49:30 +02:00
Brecht Van Lommel	a1d52ee950	Fix: Cycles CUDA runtime compilation should mark CUDA 12 as supported	2024-06-03 14:04:30 +02:00
Xavier Hallade	db8021d61a	Cycles: oneAPI: explicitly enable/disable SYSMAN ZES_ENABLE_SYSMAN is supposed to be set for free_memory queries to be available. These queries are then optionally used since `759bb6c768`, for the host memory fallback feature. Setting SYCL_ENABLE_PCI was leading ZES_ENABLE_SYSMAN to be set by DPCPP 2022-12 but it's not used by newer versions of DPCPP. We however temporarily disable SYSMAN by default on Linux as builds with JEMALLOC enabled currently lead to driver runtime issues. These can be worked around by using LD_PRELOAD=libigsc.so.	2024-05-30 12:16:16 +02:00
Xavier Hallade	0b3157dc93	Cleanup: Cycles: remove unused SYCL environment variable SYCL_PI_LEVEL_ZERO_USE_COPY_ENGINE_FOR_IN_ORDER_QUEUE has been removed from SYCL runtime years ago.	2024-05-30 11:43:26 +02:00
Nikita Sirgienko	8ee8d01711	Cycles: oneAPI: Fix Out-Of-Memory errors on some integrated GPUs	2024-05-29 21:57:13 +02:00
Campbell Barton	c5a27f011e	Cleanup: spelling in comments	2024-05-29 12:49:07 +10:00
Nikita Sirgienko	759bb6c768	Cycles: oneAPI: Enable host memory migration This enables scenes with all textures not fitting in GPU memory to finally render. For scenes that are fitting, no functional change or performance change is expected. Pull Request: https://projects.blender.org/blender/blender/pulls/122385	2024-05-28 19:04:19 +02:00
Michael Jones	e82d69daa1	Cycles: Disambiguate shadow integrator state buffer names This patch adds a "shadow" prefix & array index suffixes to the shadow integrator state buffer names. This eliminates confusion when looking at GPU traces etc. Pull Request: https://projects.blender.org/blender/blender/pulls/121745	2024-05-15 23:19:24 +02:00
Michael Jones	5508b41a40	Cycles: MetalRT optimisations (scene_intersect_shadow + random_walk) This PR contains optimisations and a general tidy-up of the MetalRT backend. - Currently `scene_intersect` is used for both normal and (opaque) shadow rays, however the usage patterns are different enough to warrant specialisation. Shadow intersection tests (flagged with `PATH_RAY_SHADOW_OPAQUE`) only need a bool result, but need a larger "self" payload in order to exclude hits against target lights. By specialising we can minimise the payload size in each case (which is helps performance) and avoid some dynamic branching. This PR introduces a new `scene_intersect_shadow` function which is specialised in Metal, and currently redirects to `scene_intersect` in the other backends. - Currently `scene_intersect_local` is implemented for worst-case payload requirements as demanded by `subsurface_disk` (where `max_hits` is 4). The random_walk case only demands 1 hit result which we can retrieve directly from the intersector object (rather than stashing it in the payload). By specialising, we significantly reduce the payload size for random_walk queries, which has a big impact on performance. Additionally, we only need to use a custom intersection function for the first ray test in a random walk (for self-primitive filtering), so this PR forces faster `opaque` intersection testing for all but the first random walk test. - Currently `scene_intersect_volume` has a lot of redundant code to handle non-triangle primitives despite volumes only being enclosed by trimeshes. This PR removes this code. Additionally, this PR tidies up the convoluted intersection function linking code, removes some redundant intersection handlers, and uses more consistent naming of intersection functions. On a M3 MacBook Pro, these changes give 2-3% performance increase on typical scenes with opaque trimesh materials (e.g. barbershop, classroom junkshop), but can give over 15% performance increase for certain scenes using random walk SSS (e.g. monster). Pull Request: https://projects.blender.org/blender/blender/pulls/121397	2024-05-10 16:38:02 +02:00
Attila Áfra	26c93c8359	Cycles: Enable OIDN 2.3 lazy device module loading This enables the new lazy module loading behavior introduced in OIDN 2.3, without breaking compatibility with older versions of OIDN (using separate code paths). Also, the detection of OIDN support for devices is now much cleaner, and devices do not need to be matched by PCI address or device name anymore. Pull Request: https://projects.blender.org/blender/blender/pulls/121362	2024-05-07 14:07:39 +02:00
Attila Áfra	2a0a6f18cc	Cycles: Add OpenImageDenoise quality option This adds a new "Quality" option for OIDN to switch between the existing "High" and "Balanced" modes and the new "Fast" mode introduced in OIDN 2.3. Pull Request: https://projects.blender.org/blender/blender/pulls/121374	2024-05-06 18:56:16 +02:00
Michael Jones	9b833fdeba	Cycles: Use more accurate GPU counter timestamps for profiling in Metal This PR replaces the existing CPU wall-clock based profiling mechanism with more precise GPU counter based timestamps. As before, it is enabled by setting the env var `CYCLES_METAL_PROFILING=1`. Original implementation by Morteza Mostajabodaveh. Pull Request: https://projects.blender.org/blender/blender/pulls/121208	2024-04-29 15:25:32 +02:00
Alaska	1dede89eee	Refactor: Allow get_apple_gpu_architecture to report non Apple GPUs get_apple_gpu_architecture will now report if the GPU being checked is not an Apple GPU. At the moment this has no functional changes. But it reduces the chances of mistakes in the future where a developer tries to enable a feature on newer Apple GPUs using get_apple_gpu_architecture, and accidentally enables it on unsupported AMD and Intel GPUs. Pull Request: https://projects.blender.org/blender/blender/pulls/120448	2024-04-15 15:04:23 +02:00
Patrick Mours	33d7fa8cb3	Fix #119959 : Enabling "Distribute memory between devices" for Cycles results in error With the switch to using the primary CUDA context it became possible for peer access between CUDA devices to already have been enabled for that context, either by a previous Cycles session or third-party library, thus causing the call to `cuCtxEnablePeerAccess` to return `CUDA_ERROR_PEER_ACCESS_ALREADY_ENABLED`. This is not a failure state however, so just needs to be handled like a success return value. Pull Request: https://projects.blender.org/blender/blender/pulls/120255	2024-04-15 12:17:32 +02:00
Alaska	eff4fe24cf	Cycles: Properly default to Metal-RT off unless GPU is a M3 or newer Ever since commit [1], `use_metalrt_by_default` will be True if the GPU being used is not a M1 or M2 based system. The intention of this was to enable MetalRT by default for M3 and newer devices that have hardware for ray traversal. However the side effect of this change was that all AMD GPUs would have `use_metalrt_by_default` set to True. Which appears to be the main culprit causing crashes on older AMD GPUs in #120126. Since these GPUs don't support MetalRT. This commit fixes this issue by only setting `use_metalrt_by_default` to True if the GPU is not M1 or M2 based, and the GPU is Apple Silicon based. Which equates to M3 or newer. Which is the original intent of this code. This resolves the issue where AMD GPUs were being told to use MetalRT by default, when they shouldn't be. [1] `322a2f7b12` Pull Request: https://projects.blender.org/blender/blender/pulls/120299	2024-04-09 16:19:24 +02:00
Xavier Hallade	cbc7962a73	Cycles: Tune kernel sizes for oneAPI device This brings a 1-3% performance improvement depending on the scenes, on the Arc A770.	2024-04-04 16:04:13 +02:00
Brecht Van Lommel	bd1f4343c3	Build: Improve OSL library dependency handling in Cycles Might fix some missing symbols when the OSL library gets updated. Pull Request: https://projects.blender.org/blender/blender/pulls/119391	2024-03-29 15:24:30 +01:00
Brecht Van Lommel	53e9fb6b78	Fix #117566 : Cycles persistent data not updated by device preferences Pull Request: https://projects.blender.org/blender/blender/pulls/119970	2024-03-27 18:55:46 +01:00
Sergey Sharybin	bffcb000e8	Fix: Cycles crash on Metal GPU with ASAN builds Running a very simple files when Blender is built with the WITH_COMPILER_ASAN=ON and WITH_CYCLES_KERNEL_ASAN=ON CMake options leads to ASAN reporting an unknown-crash at line where the worker pool is being filled in. It is not entirely clear if it is a real issue in the code, since placing debug prints with `this` address report proper addresses, however there is no harm on capturing `this` pointer by value and it does solve the ASAN reporting issues. It is possible to reproduce the ASAN crash with the following steps: - Start with --factory-startup - Enable Metal device in User Preferences - Switch render device to GPU Compute - Switch viewport more to Rendered Pull Request: https://projects.blender.org/blender/blender/pulls/119867	2024-03-25 11:36:15 +01:00
Campbell Barton	57dd9c21d3	Cleanup: spelling in comments	2024-03-21 10:02:53 +11:00
Weizhen Huang	b81b0308fd	Fix: `WITH_CYCLES_DEBUG` flag not enabled on Metal seems to be enabled on other GPUs already Pull Request: https://projects.blender.org/blender/blender/pulls/119701	2024-03-20 16:42:42 +01:00
Brecht Van Lommel	433d91fca8	Merge branch 'blender-v4.1-release'	2024-03-18 11:00:49 +01:00
Brecht Van Lommel	f57e4c5b98	Fix #119551 : Cycles denoising crash canceling tiled render with MetalRT The BVH has been freed at this point, but the Metal queue sets it on every invocation. Make sure it's null so it doesn't get used anymore. Pull Request: https://projects.blender.org/blender/blender/pulls/119581	2024-03-18 11:00:21 +01:00
Sergey Sharybin	f3f79ef4bd	Merge branch 'blender-v4.1-release'	2024-03-15 09:53:25 +01:00
Alaska	7ec0ebf30c	Cycles: Fix grammar issues in OIDN GPU command line reporting Pull Request: https://projects.blender.org/blender/blender/pulls/119492	2024-03-15 09:52:47 +01:00
Brecht Van Lommel	335ff6efab	Cycles: Disable OpenImageDenoise support for AMD GPUs in Blender 4.1 In older drivers with an integrated GPU, this may crash. This not only affects HIP, but also can crash when using Cycles with an NVIDIA or Intel GPU in combination with an AMD CPU. Fixes for this are expected to be coming, but there will not be enough time for user testing, and it is difficult to be certain that the fix is complete. So to be careful, this is postponed until it has had more testing. Pull Request: https://projects.blender.org/blender/blender/pulls/119476	2024-03-14 18:18:18 +01:00
Brecht Van Lommel	92f6ba5a5f	Merge branch 'blender-v4.1-release'	2024-03-11 15:09:55 +01:00
Brecht Van Lommel	c388ed1e53	Fix #118709 : Crash in OIDN GPU detection for unsupported HIP device Pull Request: https://projects.blender.org/blender/blender/pulls/119315	2024-03-11 15:09:24 +01:00
Miguel Pozo	a53e8d6d24	Merge branch 'blender-v4.1-release'	2024-03-11 12:27:39 +01:00
Attila Afra	60e8b56bcd	Fix: CUDA module memory leak since using primary context Previously the CUDA context was always destroyed and the module along with it. Now that this no longer happens, the missing module free became a memory leak. Also fix the same issue for HIP, though this is destroying the context so it's not a problem yet. Fix part of #119035 Co-authored-by: Brecht Van Lommel <brecht@blender.org>	2024-03-11 10:39:24 +01:00
Campbell Barton	e33f5e36ac	Cleanup: spacing around C-style comment blocks	2024-03-09 23:40:57 +11:00
Hans Goudey	04a9790035	Merge branch 'blender-v4.1-release'	2024-03-08 16:35:33 -05:00
Brecht Van Lommel	898187cfab	Fix #118466 : Cycles renders black on Metal + AMD Global built-ins appear to not work on AMD cards. Also add a tweak to avoid a performance regression, similar to what was done before. Disable adaptive subdivision kernel code if not used. Pull Request: https://projects.blender.org/blender/blender/pulls/119175	2024-03-08 16:41:27 +01:00
Brecht Van Lommel	44d418143e	Merge branch 'blender-v4.1-release'	2024-03-05 19:55:07 +01:00
Sahar A. Kashi	3e09fbf062	Fix #112983 : Cycles HIP-RT crash on deleting all objects Pull Request: https://projects.blender.org/blender/blender/pulls/118944	2024-03-05 19:52:58 +01:00
Brecht Van Lommel	6788b7e87f	Merge branch 'blender-v4.1-release'	2024-02-29 17:55:37 +01:00
Brecht Van Lommel	36c11ee482	Fix #118514 : Cycles MetalRT crash with empty scene Pull Request: https://projects.blender.org/blender/blender/pulls/118907	2024-02-29 17:28:13 +01:00
Brecht Van Lommel	1355285c0e	Merge branch 'blender-v4.1-release'	2024-02-29 13:52:19 +01:00
Alaska	659f05ef28	Fix: Cycles HIP incorrect rendering of clip image textures This was fixed in the driver quite a while ago: https://github.com/ROCm/HIP/pull/2229 Ref: #91571 Pull Request: https://projects.blender.org/blender/blender/pulls/118540	2024-02-29 13:49:29 +01:00
Alaska	0a173b942b	Cycles: Improve reporting of HIP texture allocation failures HIP fails to allocate textures, typically when they are too large. This commit lets the user know what might be causing the issue rather than providing a confusing internal error message. Pull Request: https://projects.blender.org/blender/blender/pulls/118239	2024-02-29 13:49:11 +01:00
Xavier Hallade	b8fdef965d	Merge branch 'blender-v4.1-release'	2024-02-28 18:25:21 +01:00
Xavier Hallade	98343c0c17	Build: Upgrade Intel Graphics Compiler to 1.0.15468 on Linux This corresponds the latest stable LTS release: https://dgpu-docs.intel.com/releases/LTS_803.29_20240131.html Graphics compiler upgrades require increasing the mininum required driver (compute-runtime) version to the corresponding one to guarantee compatibility, which is XX.XX.27642.38 in this release, so we bump this requirement accordingly. Fixes #118713 Pull Request: https://projects.blender.org/blender/blender/pulls/118814	2024-02-28 18:24:30 +01:00
Thomas Dinges	2b095c97fa	Cycles: Increase minimum target on x86 to SSE4.2 * Compile regular host code with SSE4.2 * Remove the SSE2 kernel, only the SSE4.2 and AVX2 kernel remain Pull Request: https://projects.blender.org/blender/blender/pulls/118471	2024-02-26 14:49:19 +01:00
Brecht Van Lommel	0f2064bc3b	Revert changes from main commits that were merged into blender-v4.1-release The last good commit was `4bf6a2e564`.	2024-02-19 15:59:59 +01:00
Brecht Van Lommel	7453c5ed67	Merge branch 'blender-v4.1-release' into main	2024-02-16 19:31:31 +01:00
Raul Fernandez	324ff4ddef	macOS: Remove unnecessary checks now that minimum version is macOS 11.2 MacOS minimum version is now 11.2 we no longer need to check for lower API versions. Pull Request: https://projects.blender.org/blender/blender/pulls/118388	2024-02-16 19:03:23 +01:00
Campbell Barton	156fffbfde	Merge branch 'blender-v4.1-release'	2024-02-14 14:29:55 +11:00
Brecht Van Lommel	dd382be067	Fix #118020 : Cycles OptiX OSL crashes Turns out we were not building OSL with OptiX enabled anymore. Also check now if the OSL builds has OptiX support and if not disable it in Cycles. Building OSL with support for this (still) does not require either the OptiX SDK or CUDA, it only needs LLVM. Pull Request: https://projects.blender.org/blender/blender/pulls/118234	2024-02-14 03:40:01 +01:00
Campbell Barton	3dbbc013de	Cleanup: spelling in comments	2024-02-10 22:35:35 +11:00
Thomas Dinges	30a22b92ca	Cycles: Rename SSE4.1 kernel to SSE4.2 This commit updates all defines, compiler flags and cleans up some code for unused CPU capabilities. There should be no functional change, unless it's run on a CPU that supports sse41 but not sse42. It will fallback to the SSE2 kernel in this case. In preparation for the new SSE4.2 minimum in Blender 4.2. Pull Request: https://projects.blender.org/blender/blender/pulls/118043	2024-02-09 17:25:58 +01:00

1 2 3 4 5 ...

1312 Commits