griefith/test

Author	SHA1	Message	Date
Michael Jones	5be30b7d2b	Cycles: "Struct-of-array-of-packed-structs" for parts of the integrator state On a M3 MacBook Pro, this change increases the benchmark score by 8% (with classroom seeing a path-tracing speedup of 15%). The integrator state is currently store using struct-of-arrays, with one array per field. Such fine grained separation can result in poor GPU cache utilisation in cases where multiple fields of the same parent struct are accessed together. This PR changes the layout of the `ray`, `isect`, `subsurface`, and `shadow_ray` structs so that the data is interleaved (per parent struct) instead of separate. To try and keep this change localised, I encapsulated the layout change by extending the integrator state access macros, however maybe we want to do this more explicitly? (e.g. by updating every bit of code that accesses these parts of the state). Feedback welcome. Pull Request: https://projects.blender.org/blender/blender/pulls/122015	2024-06-04 14:53:30 +02:00
Sergey Sharybin	6e0a49265a	Fix #122622 : Cycles GPU performance regression in certain scenes This is an oversight of #122543, for which benchmarking was done in the headless mode. The solution is to tweak policy a little bit, and keep refresh intervals low for the first 10 seconds of render, after which increase updates to every 15 seconds. Doing so allows: - Have quick cancel of complex files when the error is noticed during the first few samples. - Have more predictable cancel time after long render. - Mitigate the performance regression. This does not fully solve the regression, but it makes it much more manageable. There are some compromises to be done from the performance for the UI renders. The interactivity is also not as fantastic, but it could be solved later by introducing some "Instant Cancel" operations which would be able to also stop render in the middle of a sample. Performance measured with the Spring file (path tracing time in seconds): Samples: 300 1024 2048 Base (prior to #122543): 29.1 85.4 174.1 This patch: 37.0 95.7 180.2 This is measured on M2 Ultra GPU render. The penalty is close to a constant time (the time within which a more interactive cancel is possible. Pull Request: https://projects.blender.org/blender/blender/pulls/122658	2024-06-04 14:45:46 +02:00
Brad Smith	b0fdca60cd	Cleanup: Mikktspace: use sqrtf() function with float type Pull Request: https://projects.blender.org/blender/blender/pulls/121703	2024-06-04 14:27:17 +02:00
Sergey Sharybin	11d311e300	Fix: Cycles assert in device consistency check A regression since #118841. It is possible that the selected preference device is not found, in which case a default-initialized DeviceInfo would have added to the list. This device is set to CPU, but with differnet other fields (such as description) compared to the actual CPU device. Pull Request: https://projects.blender.org/blender/blender/pulls/122701	2024-06-04 12:49:30 +02:00
Lukas Stockner	e8c83aed17	Cleanup: Formatting	2024-06-04 02:01:44 +02:00
Lukas Stockner	fbc770d58b	Cycles: Fallback to area PDF when spherical rectangles are too small This fixes #69535 and #98930. We use a equi-solid-angle sampling algorithm for rectangular area lights, but it is not particularly robust for small area lights (either small in general and/or small because it's being viewed from grazing angles). The actual sampling part is fine since it just gets clamped into the valid area anyways, and the difference isn't notable for small lights. However, we also need to compute the solid angle to get the sampling PDF, and that computation is quite sensitive to numerical issues for small values. Therefore, this commit adds a fallback path for small values, which instead uses the classic equi-area sampling PDF term times the area-to-solid-angle Jacobian term. This approximation assumes that all points on the light have the same distance and angle to the sampling point, which is of course not strictly the case, but it's close enough for small area lights and better than failing altogether. Pull Request: https://projects.blender.org/blender/blender/pulls/122323	2024-06-04 01:55:26 +02:00
Lukas Stockner	a5dc233aa2	Cycles: Tweak area light sampling to improve numerical robustness Reformulates some terms in the equi-solid-angle rectangle sampling code to handle small area lamps better, and allows for some rounding error in the check whether the sampled position is inside the area light. Pull Request: https://projects.blender.org/blender/blender/pulls/122323	2024-06-04 01:55:25 +02:00
Lukas Stockner	0566bf11b3	Cycles: Add debugging option to load custom OIDN weights Pull Request: https://projects.blender.org/blender/blender/pulls/122591	2024-06-04 01:22:44 +02:00
Weizhen Huang	296ac0e9ef	Cycles: improve light tree with large spot blend In the original paper, the falloff inside `bcone.theta_e` is assumed to be `pi/2`, which is too large for spot light and resulted in an overestimation near the cone boundary. To address this issue, attenuate the energy of a spot light using the minimal possible angle formed by the light axis and the shading point when traversing the light tree. Ref: #122362 Pull Request: https://projects.blender.org/blender/blender/pulls/122667	2024-06-03 23:33:29 +02:00
Nikita Sirgienko	4651f8a08f	Fix: Respect Blender Cycles setting for GPU denoising Previously, GPU denoisers were ignoring settings about render configuration and were using any available GPU. With these changes, GPU denoisers will use the device selected in Blender Cycles settings. This allows any GPU denoiser to be used with CPU rendering. Pull Request: https://projects.blender.org/blender/blender/pulls/118841	2024-06-03 22:41:25 +02:00
Brecht Van Lommel	a1d52ee950	Fix: Cycles CUDA runtime compilation should mark CUDA 12 as supported	2024-06-03 14:04:30 +02:00
Nikita Sirgienko	e0ecf443d4	Cycles: Fix compile error in `1465076a0a`	2024-06-03 11:33:28 +02:00
Nikita Sirgienko	6d87a5d9ed	Cleanup: Cycles: Fix type in the OpenImageDenoise name. There are no functional changes expected.	2024-06-03 11:11:20 +02:00
Nikita Sirgienko	1465076a0a	Refactor: Cycles: Unify error reporting code from denoiser's classes There are no functional changes expected.	2024-06-03 11:11:19 +02:00
Nikita Sirgienko	cb8d2bf4ad	Fix: Cycles: Add missing queue init in run_film_convert_kernels	2024-06-03 11:11:19 +02:00
Campbell Barton	7b30f657a5	Fix memory leak on exit with tablet seat data on Wayland	2024-06-01 22:54:52 +10:00
Sergey Sharybin	5f11ae70ea	Fix: Cycles is not responsive to cancel render in complex scenes In some of the complex scenes it could a very long time for Cycles to respond to cancel request. This is because Cycles only cancels render at a consistent state of render buffer: when all scheduled samples are rendered. This was caused by the render scheduler over-scheduling the number of samples in an attempt to improve occupancy of the GPU. This fix makes it so the scheduler only compensates for the low occupancy if rendering can happen within a desired update time. There is no visible difference in the benchmark scenes with this change. Pull Request: https://projects.blender.org/blender/blender/pulls/122543	2024-05-31 15:54:14 +02:00
Sergey Sharybin	c4d413fdad	Fix: Cycles assert when having volumes and objects with displacement Delay calculation of volume step until bounds are known, avoiding access to non-valid bounds and fixing assert in debug builds. This seems to be an oversight from the #121042 Pull Request: https://projects.blender.org/blender/blender/pulls/122549	2024-05-31 15:53:12 +02:00
Sergey Sharybin	6359464507	Fix #122263 : Pause Preview icon does not update Happens when opening a file saved file with preview paused. This fix covers the typical use-case when the property is modified from the space it comes from. Pull Request: https://projects.blender.org/blender/blender/pulls/122502	2024-05-31 11:46:15 +02:00
Xavier Hallade	db8021d61a	Cycles: oneAPI: explicitly enable/disable SYSMAN ZES_ENABLE_SYSMAN is supposed to be set for free_memory queries to be available. These queries are then optionally used since `759bb6c768`, for the host memory fallback feature. Setting SYCL_ENABLE_PCI was leading ZES_ENABLE_SYSMAN to be set by DPCPP 2022-12 but it's not used by newer versions of DPCPP. We however temporarily disable SYSMAN by default on Linux as builds with JEMALLOC enabled currently lead to driver runtime issues. These can be worked around by using LD_PRELOAD=libigsc.so.	2024-05-30 12:16:16 +02:00
Xavier Hallade	0b3157dc93	Cleanup: Cycles: remove unused SYCL environment variable SYCL_PI_LEVEL_ZERO_USE_COPY_ENGINE_FOR_IN_ORDER_QUEUE has been removed from SYCL runtime years ago.	2024-05-30 11:43:26 +02:00
Jeroen Bakker	12a5b2977f	Vulkan: Do not enable layers Vulkan layers should not be controlled from application, but should be enabled via `vkconfig`. There are configurations in the layers that are mutual exclusive and hard to maintain from within an application. An example is that currently we are not able to use renderdoc efficient, because there is an validation error triggered from within renderdoc. This PR removes all vulkan layer configuration from Blender. Pull Request: https://projects.blender.org/blender/blender/pulls/122473	2024-05-30 11:11:21 +02:00
Nikita Sirgienko	8ee8d01711	Cycles: oneAPI: Fix Out-Of-Memory errors on some integrated GPUs	2024-05-29 21:57:13 +02:00
Sergey Sharybin	fa3eaac0ac	Cycles: Limit the number of bounces in the shadow linking intersect kernel Since the previous fix to properly support volumes and transparent objects it became very easy to make it so the intersection loop takes all 1024 iterations to find intersections. This change makes it so the number of intersection is limited by the max number of volume/transparent bounces. This should minimize possible performance impact of the previous fix. Pull Request: https://projects.blender.org/blender/blender/pulls/122448	2024-05-29 21:53:12 +02:00
Brecht Van Lommel	133d6b1a33	Cleanup: Compiler warnings	2024-05-29 20:41:51 +02:00
Brecht Van Lommel	5d213d78e3	Fix #122362 : Shadow linking with volumes and transparency misses light We can't do the optimization to shorten the ray when we might still need to go through transparent surfaces or volumes to reach the light. This issue was not light tree specific, however in the test file it was more noticable because the light tree poorly handles some areas. This in in turn causes MIS weights for forward path tracing to become higher, which is where the error was. Pull Request: https://projects.blender.org/blender/blender/pulls/122404	2024-05-29 17:53:32 +02:00
Campbell Barton	c5a27f011e	Cleanup: spelling in comments	2024-05-29 12:49:07 +10:00
Nikita Sirgienko	759bb6c768	Cycles: oneAPI: Enable host memory migration This enables scenes with all textures not fitting in GPU memory to finally render. For scenes that are fitting, no functional change or performance change is expected. Pull Request: https://projects.blender.org/blender/blender/pulls/122385	2024-05-28 19:04:19 +02:00
Campbell Barton	0cbadd00a3	Cleanup: various non functional changes	2024-05-28 14:09:40 +10:00
Campbell Barton	ca9b105e6c	Cleanup: spelling in comments	2024-05-28 13:47:43 +10:00
Hoshinova	de6ff6eb0a	Fix #121969 : Noise texture outputs 0 with coordinates beyond 8.40*10^6 One of the properties of Perlin noise is that it always evaluates to 0.0 when not normalized (or 0.5 when normalized) when the input consists of only whole integers in all vector components. Blender's Perlin noise implementation uses single precision floats with a machine epsilon of 1.19e-07 meaning that for numbers that are greater than 1/(1.19e-07) = 8.40e6 there mantissa doesn't have any bits left to store a rational part of the number, effectively meaning that any number greater than 8.40e6 is a whole integer as far as Blender is concerned. Therefore when evaluating Perlin noise for any coordinates greater than that it always results in 0.0 (or 0.5 when normalized). This fix works as follows: If the original input number is larger than 1.0e6 it is offset by 0.5 after it underwent modulo, which always outputs numbers in a [0.0, 1.0e5) range leaving the mantissa room for a rational part. This way the quantization error still persists however the outputs are random again instead of a constant 0.0. Pull Request: https://projects.blender.org/blender/blender/pulls/122112	2024-05-27 12:48:15 +02:00
Amine Bensalem	4708e9ec6d	Fix #114780 : Cycles: Principled Chiang Hair importance sampling correction Principled Chiang hair longitudinal importance sampling correction, according to the new pbrt fix here : https://github.com/mmp/pbrt-v3/pull/256 Pull Request: https://projects.blender.org/blender/blender/pulls/115241	2024-05-27 09:35:35 +02:00
Lukas Stockner	896ef010f6	Fix #122031 : Cycles: OSL implementation of Thin Film is missing	2024-05-24 23:55:12 +02:00
Jeroen Bakker	b6002872a1	Vulkan: Optional device extensions maintenance 4 When VK_EXT_maintenance4 extension is used at device creation its extension should also be part of the device extension list. This wasn't the case and would trigger UB in the validation layers. Pull Request: https://projects.blender.org/blender/blender/pulls/122208	2024-05-24 13:39:04 +02:00
Jesse Yurkovich	1f243f1d36	Cleanup: fix -Wpessimizing-move warning in GHOST_XrGraphicsBinding	2024-05-24 07:28:55 +02:00
Campbell Barton	765ab695d1	Cleanup: remove light preferences operator This only avoided setting the "section" property for SCREEN_OT_userpref_show.	2024-05-24 14:08:31 +10:00
Lukas Stockner	392b84f879	Fix #114515 : Cycles: Numerical precision issues in triangle light sampling The refactor in `97d9bbbc97` changed the way q is computed in the spherical triangle sampling code. While the new approach is more efficient and saves a few operations, it introduces numerical precision issues for skinny/small (spherical) triangles. Therefore, this change moves the computation of q back to the method from the paper, while keeping the more efficient solid angle computation. Pull Request: https://projects.blender.org/blender/blender/pulls/119224	2024-05-23 02:27:48 +02:00
Brecht Van Lommel	3cbb3b05cd	Fix #121003 : Cycles OptiX error with noise textures Missing math functions used by changes in noise texture code.	2024-05-21 19:31:45 +02:00
Michael Jones	e82d69daa1	Cycles: Disambiguate shadow integrator state buffer names This patch adds a "shadow" prefix & array index suffixes to the shadow integrator state buffer names. This eliminates confusion when looking at GPU traces etc. Pull Request: https://projects.blender.org/blender/blender/pulls/121745	2024-05-15 23:19:24 +02:00
Clément Foucault	1036d9bdb2	Cycles: Use blender's `use_shadow` RNA property This unify Cycles and EEVEE setting. We always copy the Cycles setting in versionning except if the first scene is using EEVEE as renderer. Note that this currently breaks importers addons who will try to `cycles.cast_shadow`property on the light. Pull Request: https://projects.blender.org/blender/blender/pulls/121804	2024-05-15 22:00:55 +02:00
Campbell Barton	c4a0bbb1f4	Extensions: Support online extensions and move add-ons outside Blender The extensions system allows to extend Blender with connectivity to the internet. Right now it means Blender can discover and install add-ons and themes directly from the internet, and notify users about their updates. By default this is disabled (opt-in), and users can enable it the first time they try to install an extension or visit the Prefences > Extensions tab. If this is enabled, Blender will automatically check for updates for extensions.blender.org upon startup. When will Blender access the remote repositories: * Every time you open the Preferences → Extensions: ALL the enabled repositories get checked for the latest info (json) * Every time you try to install by dragging: ALL the enabled repositories get checked for the latest info (json). * Every time you start Blender: selected repositories get checked for the latest info (json). ------------------ From the Blender code point of view, this means that most of the add-ons and themes originally bundled with Blender will now be available from the online platform, instead of bundled with Blender. The exception are add-ons which are deemed core functionality which just happened to be written as Python add-ons. Links: * Original Extenesions Platform Announcement: https://code.blender.org/2022/10/blender-extensions-platform/ * Extensions website: https://extensions.blender.org/ * User Manual: https://docs.blender.org/manual/en/4.2/extensions/index.html#extensions-index * Technical specifications: https://developer.blender.org/docs/features/extensions/ * Changes on add-ons bundling: https://devtalk.blender.org/t/changes-to-add-on-bundling-4-2-onwards/34593 ------------------ This PR does the following: * Move extensions out of experimental. * No longer install `scripts/addons` & `scripts/addons_contrib`. * Add `scripts/addons_core` to blender's repository. These add-ons will still be bundled with Blender and will be always enabled in the future, with their preferences moved to be more closely integrated with the rest of Blender. This will happen during the remaining bcon2 period. For more details, see #121830 From scripts/addons: * copy_global_transform.py * hydra_storm * io_anim_bvh * io_curve_svg * io_mesh_uv_layout * io_scene_fbx * io_scene_gltf2 * pose_library * ui_translate * viewport_vr_preview Extra: bl_pkg (scripts/addons_contrib) Note: The STL (legacy) add-on is going to be moved to the extensions platform. There is already a C++ version on core which is enabled by default. All the other add-ons are already available at extensions.blender.org. To use them you need to: * Go to User Preferences > Extensions * You will be greated with an "Online Extensions" message, click on "Enable Repository". * Search the add-on you are looking for (e.g, Import Images as Planes). * Click on Install Over time their maintaince will be transferred over to the community so their development can carry on. If you used to help maintain a bundled add-on please read: https://devtalk.blender.org/t/changes-to-add-on-bundling-4-2-onwards/34593 Ref: !121825	2024-05-15 19:26:29 +02:00
Sergey Sharybin	727a90a0f1	Compositor: Make GPU compositor an official feature Effectively, make GPU compositor available without need to enable an experimental feature set. The compositor device is now exposed in the Performance panel of Render Buttons. It is also still available in the compositor's N-panel, together with some other options which are more about how editing works, and not exactly related to render performance. Pull Request: https://projects.blender.org/blender/blender/pulls/121398	2024-05-14 15:49:20 +02:00
Weizhen Huang	68de483bf3	Fix: Ray Portal BSDF changing `sd->flag` when closure is not allocated when ray exceeds `max_bounce`, we do not allocate any closure at intersection. However, Ray Portal BSDF still added `SD_BSDF` flag, resulting in undefined behavior in `integrate_surface_bsdf_bssrdf_bounce()`. This part of code was similar to Transparent BSDF, however, Transparent closure was still allocated in this case. To fix the undefined behavior, add `SD_BSDF` flag only when the Ray Portal closure was allocated.	2024-05-13 19:10:02 +02:00
nutti	0d34f59854	Cleanup: Mark method as static to fix pylint error Pull Request: https://projects.blender.org/blender/blender/pulls/121688	2024-05-13 16:02:56 +02:00
Lukas Stockner	12bf5ff7d4	Cycles: Cleanup: Remove unused hash function Also move the remaining one to util/hash.h.	2024-05-13 02:48:54 +02:00
Michael Jones	09ba1486f8	Cycles: Select Metal compute device by default on Apple Silicon machines _No response_ Pull Request: https://projects.blender.org/blender/blender/pulls/121672	2024-05-11 09:32:55 +02:00
Michael Jones	5508b41a40	Cycles: MetalRT optimisations (scene_intersect_shadow + random_walk) This PR contains optimisations and a general tidy-up of the MetalRT backend. - Currently `scene_intersect` is used for both normal and (opaque) shadow rays, however the usage patterns are different enough to warrant specialisation. Shadow intersection tests (flagged with `PATH_RAY_SHADOW_OPAQUE`) only need a bool result, but need a larger "self" payload in order to exclude hits against target lights. By specialising we can minimise the payload size in each case (which is helps performance) and avoid some dynamic branching. This PR introduces a new `scene_intersect_shadow` function which is specialised in Metal, and currently redirects to `scene_intersect` in the other backends. - Currently `scene_intersect_local` is implemented for worst-case payload requirements as demanded by `subsurface_disk` (where `max_hits` is 4). The random_walk case only demands 1 hit result which we can retrieve directly from the intersector object (rather than stashing it in the payload). By specialising, we significantly reduce the payload size for random_walk queries, which has a big impact on performance. Additionally, we only need to use a custom intersection function for the first ray test in a random walk (for self-primitive filtering), so this PR forces faster `opaque` intersection testing for all but the first random walk test. - Currently `scene_intersect_volume` has a lot of redundant code to handle non-triangle primitives despite volumes only being enclosed by trimeshes. This PR removes this code. Additionally, this PR tidies up the convoluted intersection function linking code, removes some redundant intersection handlers, and uses more consistent naming of intersection functions. On a M3 MacBook Pro, these changes give 2-3% performance increase on typical scenes with opaque trimesh materials (e.g. barbershop, classroom junkshop), but can give over 15% performance increase for certain scenes using random walk SSS (e.g. monster). Pull Request: https://projects.blender.org/blender/blender/pulls/121397	2024-05-10 16:38:02 +02:00
Jeroen Bakker	79dfe8d37f	Vulkan: Enable device feature dynamic rendering In the near future the legacy framebuffer/renderpass/pipeline drawing will be replaced by dynamic rendering. Dynamic rendering provide a flexible API to reuse pipelines between framebuffers if they share the same image formats. Dynamic rendering is provided by `VK_KHR_dynamic_rendering` extension and is supported by all platforms we support (Intel since HD4000, NVIDIA since 700, AMD since GCN2 and llvmpipe). Functions provided by extensions are loaded in a struct inside `VKDevice`. Pull Request: https://projects.blender.org/blender/blender/pulls/121642	2024-05-10 10:22:58 +02:00
Attila Áfra	26c93c8359	Cycles: Enable OIDN 2.3 lazy device module loading This enables the new lazy module loading behavior introduced in OIDN 2.3, without breaking compatibility with older versions of OIDN (using separate code paths). Also, the detection of OIDN support for devices is now much cleaner, and devices do not need to be matched by PCI address or device name anymore. Pull Request: https://projects.blender.org/blender/blender/pulls/121362	2024-05-07 14:07:39 +02:00
Attila Áfra	2a0a6f18cc	Cycles: Add OpenImageDenoise quality option This adds a new "Quality" option for OIDN to switch between the existing "High" and "Balanced" modes and the new "Fast" mode introduced in OIDN 2.3. Pull Request: https://projects.blender.org/blender/blender/pulls/121374	2024-05-06 18:56:16 +02:00

1 2 3 4 5 ...

13968 Commits