test2

Author	SHA1	Message	Date
Sergey Sharybin	30b962b3d8	Cycles: Optimize 3d and 4d noise The goal is to reduce the affect of the fmod() used in the noise code, which was initially reported in the comment: https://projects.blender.org/blender/blender/pulls/119884#issuecomment-1258902 Basic idea is to benefit from SIMD vectorization on CPU. Tested on Linux i9-11900K and macOS on M2 Ultra, in both cases performance after this change is very close to what it could be with the fmod() commented out (the call itself, `p = p + precision_correction`). On macOS the penalty of fmod() was about 10%, on Linux it was closer to 30% when built with GCC-13. With Linux builds from the buildbot it is more like 18%. The optimization is only done for 3d and 4d noise. It might be possible to gain some performance improvement for 1d and 2d cases, but the approach would need to be different: we'd need to optimize scalar version fmodf(). Maybe tricks with integer cast will be faster (since we are a bit optimistic in the kernel and do not guarantee exact behavior in extreme cases such as NaN inputs). Pull Request: https://projects.blender.org/blender/blender/pulls/137109	2025-04-09 13:40:10 +02:00
Sergey Sharybin	5b0ed683a0	Cycles: Make select() and mask() for vectorized float work on CPU and GPU Pull Request: https://projects.blender.org/blender/blender/pulls/137148	2025-04-08 17:04:18 +02:00
Brecht Van Lommel	07b60c189b	Cycles: Perform attribute subdivision on the host side * Add SubdAttributeInterpolation class for linear attribute interpolation. * Dicing computes ptex UV and face ID for interpolation. * Simplify mesh storage of subd primitive counts * Remove kernel code for subd attribute interpolation * Remove patch table packing and upload The old optimization adds a fair amount of complexity to the kernel, affecting performance even when not using the feature. It's also not that useful as it does not work for UVs that needs special interpolation. With this simpler code it should be easier to make it feature complete. Pull Request: https://projects.blender.org/blender/blender/pulls/135681	2025-03-11 20:58:07 +01:00
Lukas Stockner	8cb5e05c48	Cleanup: Cycles: Deduplicate kernel attribute code using templating The attribute handling code in the kernel is currently highly duplicated since it needs to handle five different data types and we couldn't use templates back then. We can now, so might as well make use of it and get rid of ~1000 lines. There are also some small fixes for the GPU OSL code: - Wrong derivative for .w component when converting float2/float3->float4 - Different conversion for float2->float (CPU averages, GPU used to take .x) - Removed useless code for converting to float2, not used by OSL Pull Request: https://projects.blender.org/blender/blender/pulls/134694	2025-02-20 19:28:45 +01:00
Brecht Van Lommel	f80f97ca0d	Refactor: Cycles: Rename rcp to reciprocal To avoid symbol conflicts with upcoming HIP changes. Also remove unused implementations for float4 and float8. Pull Request: https://projects.blender.org/blender/blender/pulls/134045	2025-02-04 18:59:24 +01:00
Brecht Van Lommel	612cb61199	Cleanup: Cycles: Use simpler make_float3 for single value	2025-01-13 10:07:37 +01:00
Brecht Van Lommel	57ff24cb99	Refactor: Cycles: Add const keyword to more function parameters Pull Request: https://projects.blender.org/blender/blender/pulls/132361	2025-01-03 10:23:24 +01:00
Brecht Van Lommel	dd51c8660b	Refactor: Cycles: Add const keyword where possible, using clang-tidy Check was misc-const-correctness, combined with readability-isolate-declaration as suggested by the docs. Temporarily clang-format "QualifierAlignment: Left" was used to get consistency with the prevailing order of keywords. Pull Request: https://projects.blender.org/blender/blender/pulls/132361	2025-01-03 10:23:20 +01:00
Brecht Van Lommel	7db0bc2e64	Refactor: Cycles: Make math and type headers work by themselves Remove separate impl.h headers, shuffle around some code and add more includes so that individual header files compile without errors. Pull Request: https://projects.blender.org/blender/blender/pulls/132361	2025-01-03 10:22:47 +01:00
Brecht Van Lommel	f53e13411b	Refactor: Cycles: Use #pragma once Pull Request: https://projects.blender.org/blender/blender/pulls/132361	2025-01-03 10:22:45 +01:00
Weizhen Huang	2ddc936cbb	Cleanup: simplify computation in Cycles area light sampling `diff` in numerator and denominator cancels out, except for the signs, which we know beforehand since `x1 > x0`, `y1 > y0` Pull Request: https://projects.blender.org/blender/blender/pulls/122740	2024-06-04 23:37:48 +02:00
Lukas Stockner	fbc770d58b	Cycles: Fallback to area PDF when spherical rectangles are too small This fixes #69535 and #98930. We use a equi-solid-angle sampling algorithm for rectangular area lights, but it is not particularly robust for small area lights (either small in general and/or small because it's being viewed from grazing angles). The actual sampling part is fine since it just gets clamped into the valid area anyways, and the difference isn't notable for small lights. However, we also need to compute the solid angle to get the sampling PDF, and that computation is quite sensitive to numerical issues for small values. Therefore, this commit adds a fallback path for small values, which instead uses the classic equi-area sampling PDF term times the area-to-solid-angle Jacobian term. This approximation assumes that all points on the light have the same distance and angle to the sampling point, which is of course not strictly the case, but it's close enough for small area lights and better than failing altogether. Pull Request: https://projects.blender.org/blender/blender/pulls/122323	2024-06-04 01:55:26 +02:00
Hoshinova	c78c6b0bdf	Fix #119797 : Noise Texture Precision Issues The Perlin noise algorithms suffer from precision issues when a coordinate is greater than about 250000. To fix this the Perlin noise texture is repeated every 100000 on each axis. This causes discontinuities every 100000, however at such scales this usually shouldn't be noticeable. Pull Request: https://projects.blender.org/blender/blender/pulls/119884	2024-03-29 16:12:23 +01:00
Thomas Dinges	30a22b92ca	Cycles: Rename SSE4.1 kernel to SSE4.2 This commit updates all defines, compiler flags and cleans up some code for unused CPU capabilities. There should be no functional change, unless it's run on a CPU that supports sse41 but not sse42. It will fallback to the SSE2 kernel in this case. In preparation for the new SSE4.2 minimum in Blender 4.2. Pull Request: https://projects.blender.org/blender/blender/pulls/118043	2024-02-09 17:25:58 +01:00
Campbell Barton	c12994612b	License headers: use SPDX-FileCopyrightText in intern/cycles	2023-06-14 16:53:23 +10:00
Hoshinova	144ad4d20b	Nodes: add Fractal Voronoi Noise Fractal noise is the idea of evaluating the same noise function multiple times with different input parameters on each layer and then mixing the results. The individual layers are usually called octaves. The number of layers is controlled with a "Detail" slider. The "Lacunarity" input controls a factor by which each successive layer gets scaled. The existing Noise node already supports fractal noise. Now the Voronoi Noise node supports it as well. The node also has a new "Normalize" property that ensures that the output values stay in a [0.0, 1.0] range. That is except for the F2 feature where in rare cases the output may be outside that range even with "Normalize" turned on. How the individual octaves are mixed depends on the feature and output socket: - F1/Smooth F1/F2: - Distance/Color output: The individual Distance/Color octaves are first multiplied by a factor of `Roughness ^ (#layers - 1.0)` then added together to create the final output. - Position output: Each Position octave gets linearly interpolated with the combined output of the previous octaves. The Roughness input serves as an interpolation factor with 0.0 resutling in only using the combined output of the previous octaves and 1.0 resulting in only using the current highest octave. - Distance to Edge: - Distance output: The Distance octaves are mixed exactly like the Position octaves for F1/Smooth F1/F2. It should be noted that Voronoi Noise is a relatively slow noise function, especially at higher dimensions. Increasing the "Detail" makes it even slower. Therefore, when optimizing a scene one should consider trying to use simpler noise functions instead of Voronoi if the final result is close enough. Pull Request: https://projects.blender.org/blender/blender/pulls/106827	2023-06-13 09:18:12 +02:00
Brecht Van Lommel	63dfbdc187	Fix mistake in Cycles float4 pow Not currently used anywhere, so should be no user visible change.	2023-04-25 20:19:43 +02:00
Brecht Van Lommel	e1b3d91127	Refactor: replace Cycles sse/avx types by vectorized float4/int4/float8/int8 The distinction existed for legacy reasons, to easily port of Embree intersection code without affecting the main vector types. However we are now using SIMD for these types as well, so no good reason to keep the distinction. Also more consistently pass these vector types by value in inline functions. Previously it was partially changed for functions used by Metal to avoid having to add address space qualifiers, simple to do it everywhere. Also removes function declarations for vector math headers, serves no real purpose. Differential Revision: https://developer.blender.org/D16146	2022-11-08 12:28:40 +01:00
Andrii Symkin	f00d9e80ae	Cycles: add more math functions for float4 Add more math functions for float4 to make them on par with float3 ones. It makes it possible to change the types of float3 variables to float4 without additional work. Differential Revision: https://developer.blender.org/D15318	2022-06-30 16:25:21 +02:00
Andrii Symkin	c2a2f3553a	Cycles: unify math functions names This patch unifies the names of math functions for different data types and uses overloading instead. The goal is to make it possible to swap out all the float3 variables containing RGB data with something else, with as few as possible changes to the code. It's a requirement for future spectral rendering patches. Differential Revision: https://developer.blender.org/D15276	2022-06-23 15:02:53 +02:00
Brecht Van Lommel	51380b9346	Fix Cycles Metal build error and GCC warning after recent changes Function overloading of make_float4() doesn't work since it's a macro, just don't do this minor cleanup then.	2022-03-23 23:25:31 +01:00
Ethan-Hall	f5066d43ae	Cleanup: use make_float4(f) zero_float4() to simplify code Differential Revision: https://developer.blender.org/D14426	2022-03-23 20:06:12 +01:00
Brecht Van Lommel	9cfc7967dd	Cycles: use SPDX license headers * Replace license text in headers with SPDX identifiers. * Remove specific license info from outdated readme.txt, instead leave details to the source files. * Add list of SPDX license identifiers used, and corresponding license texts. * Update copyright dates while we're at it. Ref D14069, T95597	2022-02-11 17:47:34 +01:00
Michael Jones	d19e35873f	Cycles: several small fixes and additions for MSL This patch contains many small leftover fixes and additions that are required for Metal-enablement: - Address space fixes and a few other small compile fixes - Addition of missing functionality to the Metal adapter headers - Addition of various scattered `__KERNEL_METAL__` blocks (e.g. for atomic support & maths functions) Ref T92212 Differential Revision: https://developer.blender.org/D13263	2021-11-18 14:38:02 +01:00
Brecht Van Lommel	fd25e883e2	Cycles: remove prefix from source code file names Remove prefix of filenames that is the same as the folder name. This used to help when #includes were using individual files, but now they are always relative to the cycles root directory and so the prefixes are redundant. For patches and branches, git merge and rebase should be able to detect the renames and move over code to the right file.	2021-10-26 15:37:04 +02:00

25 Commits