griefith/test

Author	SHA1	Message	Date
Campbell Barton	d98a7a7756	Merge branch 'blender-v4.2-release'	2024-06-06 10:23:16 +10:00
Campbell Barton	7f7648c6ed	Cleanup: spelling in code comments & minor edits - Use uppercase NOTE: tags. - Correct bNote -> bNode. - Use colon after parameters. - Use doxy-style doc-strings.	2024-06-06 09:55:13 +10:00
Lukas Stockner	db991d8ac0	Cycles: Only store per-pixel seed for vertex color baking Currently, during baking each pixel stores a seed input that comes from the Blender side. This is only needed for vertex color baking, however - for regular image baking, we can just as well hash the pixel coordinates. Therefore, we can save some memory (4 byte per pixel) by splitting the seed info out into a separate pass and only storing it when needed. Pull Request: https://projects.blender.org/blender/blender/pulls/122806	2024-06-06 00:45:38 +02:00
Lukas Stockner	f3f05f945c	Cycles: Add missing make_uintX definitions for Metal	2024-06-05 03:04:04 +02:00
Lukas Stockner	79ddd07e76	Cleanup: Formatting	2024-06-05 03:04:03 +02:00
Lukas Stockner	5246fb5a57	Cycles: Implement blue-noise dithered sampling This patch implements blue-noise dithered sampling as described by Nathan Vegdahl (https://psychopath.io/post/2022_07_24_owen_scrambling_based_dithered_blue_noise_sampling), which in turn is based on "Screen-Space Blue-Noise Diffusion of Monte Carlo Sampling Error via Hierarchical Ordering of Pixels"(https://repository.kaust.edu.sa/items/1269ae24-2596-400b-a839-e54486033a93). The basic idea is simple: Instead of generating independent sequences for each pixel by scrambling them, we use a single sequence for the entire image, with each pixel getting one chunk of the samples. The ordering across pixels is determined by hierarchical scrambling of the pixel's position along a space-filling curve, which ends up being pretty much the same operation as already used for the underlying sequence. This results in a more high-frequency noise distribution, which appears smoother despite not being less noisy overall. The main limitation at the moment is that the improvement is only clear if the full sample amount is used per pixel, so interactive preview rendering and adaptive sampling will not receive the benefit. One exception to this is that when using the new "Automatic" setting, the first sample in interactive rendering will also be blue-noise-distributed. The sampling mode option is now exposed in the UI, with the three options being Blue Noise (the new mode), Classic (the previous Tabulated Sobol method) and the new default, Automatic (blue noise, with the additional property of ensuring the first sample is also blue-noise-distributed in interactive rendering). When debug mode is enabled, additional options appear, such as Sobol-Burley. Note that the scrambling distance option is not compatible with the blue-noise pattern. Pull Request: https://projects.blender.org/blender/blender/pulls/118479	2024-06-05 02:29:47 +02:00
Weizhen Huang	2ddc936cbb	Cleanup: simplify computation in Cycles area light sampling `diff` in numerator and denominator cancels out, except for the signs, which we know beforehand since `x1 > x0`, `y1 > y0` Pull Request: https://projects.blender.org/blender/blender/pulls/122740	2024-06-04 23:37:48 +02:00
Michael Jones	5be30b7d2b	Cycles: "Struct-of-array-of-packed-structs" for parts of the integrator state On a M3 MacBook Pro, this change increases the benchmark score by 8% (with classroom seeing a path-tracing speedup of 15%). The integrator state is currently store using struct-of-arrays, with one array per field. Such fine grained separation can result in poor GPU cache utilisation in cases where multiple fields of the same parent struct are accessed together. This PR changes the layout of the `ray`, `isect`, `subsurface`, and `shadow_ray` structs so that the data is interleaved (per parent struct) instead of separate. To try and keep this change localised, I encapsulated the layout change by extending the integrator state access macros, however maybe we want to do this more explicitly? (e.g. by updating every bit of code that accesses these parts of the state). Feedback welcome. Pull Request: https://projects.blender.org/blender/blender/pulls/122015	2024-06-04 14:53:30 +02:00
Lukas Stockner	fbc770d58b	Cycles: Fallback to area PDF when spherical rectangles are too small This fixes #69535 and #98930. We use a equi-solid-angle sampling algorithm for rectangular area lights, but it is not particularly robust for small area lights (either small in general and/or small because it's being viewed from grazing angles). The actual sampling part is fine since it just gets clamped into the valid area anyways, and the difference isn't notable for small lights. However, we also need to compute the solid angle to get the sampling PDF, and that computation is quite sensitive to numerical issues for small values. Therefore, this commit adds a fallback path for small values, which instead uses the classic equi-area sampling PDF term times the area-to-solid-angle Jacobian term. This approximation assumes that all points on the light have the same distance and angle to the sampling point, which is of course not strictly the case, but it's close enough for small area lights and better than failing altogether. Pull Request: https://projects.blender.org/blender/blender/pulls/122323	2024-06-04 01:55:26 +02:00
Lukas Stockner	a5dc233aa2	Cycles: Tweak area light sampling to improve numerical robustness Reformulates some terms in the equi-solid-angle rectangle sampling code to handle small area lamps better, and allows for some rounding error in the check whether the sampled position is inside the area light. Pull Request: https://projects.blender.org/blender/blender/pulls/122323	2024-06-04 01:55:25 +02:00
Weizhen Huang	296ac0e9ef	Cycles: improve light tree with large spot blend In the original paper, the falloff inside `bcone.theta_e` is assumed to be `pi/2`, which is too large for spot light and resulted in an overestimation near the cone boundary. To address this issue, attenuate the energy of a spot light using the minimal possible angle formed by the light axis and the shading point when traversing the light tree. Ref: #122362 Pull Request: https://projects.blender.org/blender/blender/pulls/122667	2024-06-03 23:33:29 +02:00
Sergey Sharybin	fa3eaac0ac	Cycles: Limit the number of bounces in the shadow linking intersect kernel Since the previous fix to properly support volumes and transparent objects it became very easy to make it so the intersection loop takes all 1024 iterations to find intersections. This change makes it so the number of intersection is limited by the max number of volume/transparent bounces. This should minimize possible performance impact of the previous fix. Pull Request: https://projects.blender.org/blender/blender/pulls/122448	2024-05-29 21:53:12 +02:00
Brecht Van Lommel	5d213d78e3	Fix #122362 : Shadow linking with volumes and transparency misses light We can't do the optimization to shorten the ray when we might still need to go through transparent surfaces or volumes to reach the light. This issue was not light tree specific, however in the test file it was more noticable because the light tree poorly handles some areas. This in in turn causes MIS weights for forward path tracing to become higher, which is where the error was. Pull Request: https://projects.blender.org/blender/blender/pulls/122404	2024-05-29 17:53:32 +02:00
Nikita Sirgienko	759bb6c768	Cycles: oneAPI: Enable host memory migration This enables scenes with all textures not fitting in GPU memory to finally render. For scenes that are fitting, no functional change or performance change is expected. Pull Request: https://projects.blender.org/blender/blender/pulls/122385	2024-05-28 19:04:19 +02:00
Campbell Barton	ca9b105e6c	Cleanup: spelling in comments	2024-05-28 13:47:43 +10:00
Hoshinova	de6ff6eb0a	Fix #121969 : Noise texture outputs 0 with coordinates beyond 8.40*10^6 One of the properties of Perlin noise is that it always evaluates to 0.0 when not normalized (or 0.5 when normalized) when the input consists of only whole integers in all vector components. Blender's Perlin noise implementation uses single precision floats with a machine epsilon of 1.19e-07 meaning that for numbers that are greater than 1/(1.19e-07) = 8.40e6 there mantissa doesn't have any bits left to store a rational part of the number, effectively meaning that any number greater than 8.40e6 is a whole integer as far as Blender is concerned. Therefore when evaluating Perlin noise for any coordinates greater than that it always results in 0.0 (or 0.5 when normalized). This fix works as follows: If the original input number is larger than 1.0e6 it is offset by 0.5 after it underwent modulo, which always outputs numbers in a [0.0, 1.0e5) range leaving the mantissa room for a rational part. This way the quantization error still persists however the outputs are random again instead of a constant 0.0. Pull Request: https://projects.blender.org/blender/blender/pulls/122112	2024-05-27 12:48:15 +02:00
Amine Bensalem	4708e9ec6d	Fix #114780 : Cycles: Principled Chiang Hair importance sampling correction Principled Chiang hair longitudinal importance sampling correction, according to the new pbrt fix here : https://github.com/mmp/pbrt-v3/pull/256 Pull Request: https://projects.blender.org/blender/blender/pulls/115241	2024-05-27 09:35:35 +02:00
Lukas Stockner	896ef010f6	Fix #122031 : Cycles: OSL implementation of Thin Film is missing	2024-05-24 23:55:12 +02:00
Lukas Stockner	392b84f879	Fix #114515 : Cycles: Numerical precision issues in triangle light sampling The refactor in `97d9bbbc97` changed the way q is computed in the spherical triangle sampling code. While the new approach is more efficient and saves a few operations, it introduces numerical precision issues for skinny/small (spherical) triangles. Therefore, this change moves the computation of q back to the method from the paper, while keeping the more efficient solid angle computation. Pull Request: https://projects.blender.org/blender/blender/pulls/119224	2024-05-23 02:27:48 +02:00
Brecht Van Lommel	3cbb3b05cd	Fix #121003 : Cycles OptiX error with noise textures Missing math functions used by changes in noise texture code.	2024-05-21 19:31:45 +02:00
Weizhen Huang	68de483bf3	Fix: Ray Portal BSDF changing `sd->flag` when closure is not allocated when ray exceeds `max_bounce`, we do not allocate any closure at intersection. However, Ray Portal BSDF still added `SD_BSDF` flag, resulting in undefined behavior in `integrate_surface_bsdf_bssrdf_bounce()`. This part of code was similar to Transparent BSDF, however, Transparent closure was still allocated in this case. To fix the undefined behavior, add `SD_BSDF` flag only when the Ray Portal closure was allocated.	2024-05-13 19:10:02 +02:00
Lukas Stockner	12bf5ff7d4	Cycles: Cleanup: Remove unused hash function Also move the remaining one to util/hash.h.	2024-05-13 02:48:54 +02:00
Michael Jones	5508b41a40	Cycles: MetalRT optimisations (scene_intersect_shadow + random_walk) This PR contains optimisations and a general tidy-up of the MetalRT backend. - Currently `scene_intersect` is used for both normal and (opaque) shadow rays, however the usage patterns are different enough to warrant specialisation. Shadow intersection tests (flagged with `PATH_RAY_SHADOW_OPAQUE`) only need a bool result, but need a larger "self" payload in order to exclude hits against target lights. By specialising we can minimise the payload size in each case (which is helps performance) and avoid some dynamic branching. This PR introduces a new `scene_intersect_shadow` function which is specialised in Metal, and currently redirects to `scene_intersect` in the other backends. - Currently `scene_intersect_local` is implemented for worst-case payload requirements as demanded by `subsurface_disk` (where `max_hits` is 4). The random_walk case only demands 1 hit result which we can retrieve directly from the intersector object (rather than stashing it in the payload). By specialising, we significantly reduce the payload size for random_walk queries, which has a big impact on performance. Additionally, we only need to use a custom intersection function for the first ray test in a random walk (for self-primitive filtering), so this PR forces faster `opaque` intersection testing for all but the first random walk test. - Currently `scene_intersect_volume` has a lot of redundant code to handle non-triangle primitives despite volumes only being enclosed by trimeshes. This PR removes this code. Additionally, this PR tidies up the convoluted intersection function linking code, removes some redundant intersection handlers, and uses more consistent naming of intersection functions. On a M3 MacBook Pro, these changes give 2-3% performance increase on typical scenes with opaque trimesh materials (e.g. barbershop, classroom junkshop), but can give over 15% performance increase for certain scenes using random walk SSS (e.g. monster). Pull Request: https://projects.blender.org/blender/blender/pulls/121397	2024-05-10 16:38:02 +02:00
Campbell Barton	5529a94f14	Cleanup: spelling in comments	2024-05-03 11:33:18 +10:00
David Murmann	aa70c687e4	Fix: Ray Portal BSDF unconnected direction wrong for OSL Pull Request: https://projects.blender.org/blender/blender/pulls/121348	2024-05-02 19:35:02 +02:00
Lukas Stockner	17f2cdd104	Cycles: Add thin film iridescence to Principled BSDF This is an implementation of thin film iridescence in the Principled BSDF based on "A Practical Extension to Microfacet Theory for the Modeling of Varying Iridescence". There are still several open topics that are left for future work: - Currently, the thin film only affects dielectric Fresnel, not metallic. Properly specifying thin films on metals requires a proper conductive Fresnel term with complex IOR inputs, any attempt of trying to hack it into the F82 model we currently use for the Principled BSDF is fundamentally flawed. In the future, we'll add a node for proper conductive Fresnel, including thin films. - The F0/F90 control is not very elegantly implemented right now. It fundamentally works, but enabling thin film while using a Specular Tint causes a jump in appearance since the models integrate it differently. Then again, thin film interference is a physical effect, so of course a non-physical tweak doesn't play nicely with it. - The white point handling is currently quite crude. In short: The code computes XYZ values of the reflectance spectrum, but we'd need the XYZ values of the product of the reflectance spectrum and the neutral illuminant of the working color space. Currently, this is addressed by just dividing by the XYZ values of the illuminant, but it would be better to do a proper chromatic adaptation transform or to use the proper reference curves for the working space instead of the XYZ curves from the paper. Pull Request: https://projects.blender.org/blender/blender/pulls/118477	2024-05-02 14:28:44 +02:00
Michael Jones	99f5433445	Cycles: Dormant fixes for adaptive feature compilation This PR fixes the (currently unused) scene-based selective feature compilation macros. These feature based macros haven't been used for a few years, and enabling them currently results in compilation errors. The only functional change in this PR is in geom/primitive.h where undef-ing `__HAIR__` had exposed an inconsistency in how pointcloud attributes were being fetched. Using the more general `primitive_surface_attribute_float4` (instead of `curve_attribute_float4`) fixed a compilation error that occurred when rendering pointcloud unit test scenes with adaptive compilation enabled. Pull Request: https://projects.blender.org/blender/blender/pulls/121216	2024-04-30 12:56:22 +02:00
David Murmann	ee51f643b0	Cycles: Ray Portal BSDF Transport rays that enter to another location in the scene, with specified ray position and normal. This may be used to render portals for visual effects, and other production rendering tricks. This acts much like a Transparent BSDF. Render passes are passed through, and this is affected by light path max transparent bounces. Pull Request: https://projects.blender.org/blender/blender/pulls/114386	2024-04-29 12:37:51 +02:00
Alaska	f8c0050a25	Shaders: Add Roughness to Subsurface Scattering Node Matching the Principled BSDF. Pull Request: https://projects.blender.org/blender/blender/pulls/114499	2024-04-29 10:54:29 +02:00
Alaska	afa66fc628	Shader: Clamp invalid inputs of various BSDF nodes Clamp some of the inputs of the Glossy BSDF, Glass BSDF, Sheen BSDF, and Subsurface Scattering nodes to improve consistency between render engines and to avoid unexpected results. * Clamp roughness to 0..1 * Clamp subsurface radius to 0..inf * Clamp colors to 0..inf Pull Request: https://projects.blender.org/blender/blender/pulls/120390	2024-04-26 17:39:39 +02:00
Alaska	03cb588529	Refactor: Cycles: Replace fixed Tangent input with custom input This replaces the fixed Tangent input in BsdfNode::compile with a custom input. This is done because very few nodes actually use the tangent input and it would be better to have this slot available for other inputs on different nodes in the future. Pull Request: https://projects.blender.org/blender/blender/pulls/119042	2024-04-26 15:25:14 +02:00
Weizhen Huang	8e5840359f	Refactor: deduplicate assignment of background light group by ensuring `KernelLight.lightgroup` is properly assigned in `device_update_light()`. The value is later retrieved via `lamp_lightgroup(kg, lamp)`. `light_manager->device_update()` is called after `background->device_update()`, so the background light group should already been assigned when `device_update_lights()` is called. Pull Request: https://projects.blender.org/blender/blender/pulls/120714	2024-04-17 18:49:03 +02:00
Weizhen Huang	a6330cad7f	Refactor: wrap block of codes into a function	2024-04-17 18:49:00 +02:00
Weizhen Huang	45d2e46752	Cleanup: remove unnecessary shader flag in Chiang Hair BSDF lcg is not needed since `5f9b518a8b`	2024-04-17 18:49:00 +02:00
Weizhen Huang	481b50e06d	Cleanup: pass shader flag as parameter instead of ShaderData	2024-04-17 18:49:00 +02:00
Weizhen Huang	2e1380fa0c	Cleanup: use available function to compute render buffer pointer use available `film_pass_pixel_render_buffer()` to access the pointer to the render buffer. For shadow state, a similar function `film_pass_pixel_render_buffer_shadow()` is created, because `shadow_path` instead of `path` is needed.	2024-04-17 18:49:00 +02:00
Weizhen Huang	0c9ce4ba4f	Refactor: handle MIS weight in lower-level functions it is difficult to keep in mind when MIS weight is needed, better to handle this logic in the lower-level functions. This reduces code duplication in many places.	2024-04-17 18:49:00 +02:00
Weizhen Huang	418acfe8bb	Cleanup: remove unused function parameters This is not a complete list of all the unused parameters in kernel, but those I touch often, so I am more confident that it's safe to delete them.	2024-04-17 18:49:00 +02:00
Sergey Sharybin	56cbde568b	Fix #120046 : Broken render on AMD Metal GPUs This is a regression since `fdc2962beb` The size of state can not be different between CPU and GPU. This change replaces compile-time condition with a kernel feature check, which solves the render regression on AMD Metal. It also minimizes the state size on other GPUs when Light Tree is disabled. Pull Request: https://projects.blender.org/blender/blender/pulls/120476	2024-04-11 10:51:45 +02:00
Campbell Barton	6b14b67a29	Cleanup: spelling in comments	2024-04-08 16:23:25 +10:00
Sergey Sharybin	f6bff96ac5	Fix #120119 : Area lamp artifacts in Cycles with light trees This is a regression in 4.1, caused by `36e603c430`. For unbiased MIS weight in light tree, we should use sd->N for mis_origin_n, since sc->N is not available in NEE. The change also makes it so we do not sample lights below sd->N even when bump map correction is disabled. This diverges from the original idea of giving full control to artists, but ensures the internal math is happy. Pull Request: https://projects.blender.org/blender/blender/pulls/120216	2024-04-05 10:11:33 +02:00
Campbell Barton	7e9f7320e4	Cleanup: spelling in comments & comment blocks	2024-04-04 11:26:28 +11:00
Xavier Hallade	891d71a4d4	Cycles: Drop noinline keyword for oneAPI device `fdc2962beb` indirectly introduced a change in inlining (light_tree_pdf started getting inlined) that led to a 5-10% drop in performance for most scenes. Dropping the noinline keyword for oneAPI device recovers it. It however brings another performance regression to MNEE and Raytrace kernels, that we'll look into separately.	2024-04-02 18:29:35 +02:00
Campbell Barton	2be407fc82	Cleanup: spelling in comments	2024-04-01 16:47:57 +11:00
Hoshinova	c78c6b0bdf	Fix #119797 : Noise Texture Precision Issues The Perlin noise algorithms suffer from precision issues when a coordinate is greater than about 250000. To fix this the Perlin noise texture is repeated every 100000 on each axis. This causes discontinuities every 100000, however at such scales this usually shouldn't be noticeable. Pull Request: https://projects.blender.org/blender/blender/pulls/119884	2024-03-29 16:12:23 +01:00
Weizhen Huang	68253b4937	Fix: Cycles volume light tree using wrong point to compute uncertainty angle it is not clear from which point the `cos_theta_u` should be computed in volume segment, so the original implementation was mixing the closest point and the point where the minimal angle is formed. Use the closest point on segment as a conservative measure. Pull Request: https://projects.blender.org/blender/blender/pulls/119965	2024-03-27 15:06:05 +01:00
Weizhen Huang	72780c3769	Fix: Cycles light tree assertion failing on Intel Mac in the test scene `all_light_types_in_volume.blend`, `theta - theta_o - theta_u` is slightly above the threshold. Even if we do a strict check with `acos` on the failing cases, it will go to the other branch and deliver a result which is also 1.0f. Better to relax the threshold.	2024-03-27 13:46:57 +01:00
Weizhen Huang	1bf467aff6	Cleanup: make format	2024-03-26 09:39:22 +01:00
Alaska	a4f1a52a5c	Fix #119903 : Cycles compilation on device that don't use the light tree During the some of the shading for volumetrics, Cycles would try to write to a variable that does not exist if the device has the light tree disabled. At the moment this only impacts AMD GPUs with the Metal backend. Pull Request: https://projects.blender.org/blender/blender/pulls/119906	2024-03-26 08:20:09 +01:00
Weizhen Huang	fdc2962beb	Fix #114634 : correlated samples in volume when using equiangular sampling and light tree The same random number was used when sampling from the volume segment and from the direct scattering position, causing correlation issues with light tree. To solve this problem, we ensure the same light is picked for volume segment/direct scattering, equiangular/distance sampling by sampling the light tree only once in volume segment. From the direct scattering position in volume, we sample a position on the picked light as usual. If sampling from the light tree fails, we continue with indirect scattering. For unbiased MIS weight for forward sampling, we retrieve the `P`, `D` and `t` used in volume segment for traversing the light tree. The main changes are: 1. `light_tree_sample()` and `light_distribution_sample()` now only pick lights. Sampling a position on light is done separately via `light_sample()`. 2. `light_tree_sample()` is now only called only once from volume segment. For direct lighting we call `light_sample()`. 3. `light_tree_pdf()` now has a template `<in_volume_segment>`. 4. A new field `emitter_id` is added to struct `LightSample`, which just stores the picked emitter index. 5. Additional field `previous_dt = ray->tmax - ray->tmin` is added to `state->ray`, because we need this quantity for computing the pdf. 6. Distant/Background lights are also picked by light tree in volume segment now, because we have no way to pick them afterwards. The direct sample event for these lights will be handled by `VOLUME_SAMPLE_DISTANCE`. 7. Original paper suggests to use the maximal importance, this results in very poor sampling probability for distant and point lights therefore excessive noise. We have a minimal importance for surface to balance, we could do the same for volume but I do not want to spend much time on this now. Just doing `min_importance = 0.0f` seems to do the job okayish. This way we still won't sample the light with zero `max_importance`. The current solution might perform worse with distance sampling, because the light tree measure is biased towards equiangular sampling. However, it is difficult to perform MIS between equiangular and distance sampling if different lights are picked for each method. This is something we can look into in the future if proved to be a serious regression. Pull Request: https://projects.blender.org/blender/blender/pulls/119389	2024-03-25 18:50:52 +01:00

1 2 3 4 5 ...

3533 Commits