test2

Author	SHA1	Message	Date
Jeroen Bakker	a407186dbf	GPU: Make shader cache clearing backend independent Parallel shader compilation introduced `GPU_shader_cache_dir_clear_old`. The implementation was specific to OpenGL and could not be overwritten by other backends. This PR improves the implementation so the backend can have its own implementation. This is needed for upcoming changes to the Vulkan backend where we want to use similar mechanisms to speed up shader compilation and caching. Pull Request: https://projects.blender.org/blender/blender/pulls/127680	2024-09-16 14:03:14 +02:00
Jeroen Bakker	529c720457	Fix #126412 : Metal default alpha value Metal doesn't support RGB textures and the backend converts them to RGBA textures. During the conversion missing RGB components should be set to 0 and missing A component should be set to 1. In the current implementation this was not the case and A components where also set to 0. PR should be backported to 4.2 Pull Request: https://projects.blender.org/blender/blender/pulls/126630	2024-08-22 11:52:34 +02:00
Clément FOUCAULT	d712f91881	DRW: Primitive Expansion This PR introduces the concept of primitive expansion draws. This allows to create a drawcall that will generate N amount of new primitive for an original primitive in a `gpu::Batch`. The intent is to phase out the use of geometry shader for this purpose. This adds a new `Frequency::GEOMETRY` only available for SSBOs. The resources using this will be fed the current `gpu::Batch` VBOs using name matching. A dedicated slot is reserved for the index buffer, which has its own internal lib to decode the index buffer content. A new attribute lib is added to ease the loading of unaligned attribute. This should be revisited and made obsolete once more refactor lands. It is similar to the Metal backend SSBO vertex fetch path but it is defined on a different level. The main difference is that this PR is backend independant and modify the draw module instead of the GPU module. However, it doesn't cover all possible attribute conversion cases. This will only be added if needed. This system is less automatic than the Metal backend one and needs more care to make sure the data matches what the shader expects. The Metal system will be removed once all its usage have been converted. This PR only shows example usage for workbench shadows. Cleanup PRs will follow this one. Rel #105221 Pull Request: https://projects.blender.org/blender/blender/pulls/125782	2024-08-03 11:06:17 +02:00
Hans Goudey	79416a8b96	Refactor: GPU: Simplify access to vertex buffer data Add a `.data<T>()` method that retrieves a mutable span. This is useful more and more as we change to filling in vertex buffer data arrays directly, and compared to raw pointers it's safer too because of asserts in debug builds. Pull Request: https://projects.blender.org/blender/blender/pulls/123338	2024-06-18 21:10:45 +02:00
Jeroen Bakker	b598bd4a6f	Merge branch 'blender-v4.2-release'	2024-06-18 10:55:24 +02:00
Jeroen Bakker	c525c0354f	EEVEE: Film accumulation workaround for Metal/Intel iGPUs EEVEE Film accumulation workaround for Metal/Intel iGPUs. On Metal the Intel iGPUs do not support image read write on array textures. However this limitation doesn't show any artifacts when using the compute shader. This PR is a work around that uses the film_comp shader to process the film samples, but uses a separate film_copy_frag shader to read the result and copy them to the frame buffer. I deliberately didn't include the fix to the film_frag shader as that would change the read/write resources and could lead to performance issues for other platforms. Writable resources are typically slower compared to read only resources. Some code needed to be duplicated (and not added to `*_lib.glsl`) as compilers would still raise compilation errors due to imageStore/Load on incompatible resource access. The Metal/Intel iGPU is also marked to have limited support as raytracing and probes still produces big artifacts. This workaround can be tested on any platform just by setting `use_compute_ = true` in `Film::sync` Related to #122361 Pull Request: https://projects.blender.org/blender/blender/pulls/123330	2024-06-18 10:53:53 +02:00
Jacques Lucke	8b7cde3efc	Merge branch 'blender-v4.2-release'	2024-06-14 20:19:03 +02:00
Miguel Pozo	a22a4810c7	GPU: Use size_t for GPU buffer sizes Update all GPU buffer size-related functions to use `size_t` for consistency. Pull Request: https://projects.blender.org/blender/blender/pulls/123240	2024-06-14 19:27:33 +02:00
Jeroen Bakker	f13e51543c	Cleanup: Fix spelling mistake Attachement -> Attachment Pull Request: https://projects.blender.org/blender/blender/pulls/122988	2024-06-10 09:57:15 +02:00
Miguel Pozo	83db9cc7b4	Merge branch 'blender-v4.2-release'	2024-06-07 18:47:47 +02:00
Miguel Pozo	22652b305e	GPU: Add GPU_shaders_precompile_specializations Allow precompiling specialization constants variations in parallel. Only supported in OpenGL as the rest of the batch compilation API, on the other backends the function is a no-op. This also moves the `SpecializationConstant` from `gpu_shader_create_info` (private API) into`GPU_common_types` (public API). Pull Request: https://projects.blender.org/blender/blender/pulls/122796	2024-06-07 18:45:31 +02:00
Campbell Barton	d98a7a7756	Merge branch 'blender-v4.2-release'	2024-06-06 10:23:16 +10:00
Campbell Barton	7f7648c6ed	Cleanup: spelling in code comments & minor edits - Use uppercase NOTE: tags. - Correct bNote -> bNode. - Use colon after parameters. - Use doxy-style doc-strings.	2024-06-06 09:55:13 +10:00
Lukas Stockner	5891a73785	Merge branch 'blender-v4.2-release'	2024-06-05 20:25:50 +02:00
Hans Goudey	84c4ddbbb9	Cleanup: GPU: Use references for some vertex buffer functions Pull Request: https://projects.blender.org/blender/blender/pulls/122784	2024-06-05 18:47:22 +02:00
Miguel Pozo	74224b25a5	GPU: Add GPU_shader_batch_create_from_infos This is the first commit of the several required to support subprocess-based parallel compilation on OpenGL. This provides the base API and implementation, and exposes the max subprocesses setting on the UI, but it's not used by any code yet. More information and the rest of the code can be found in #121925. This one includes: - A new `GPU_shader_batch` API that allows requesting the compilation of multiple shaders at once, allowing GPU backed to compile them in parallel and asynchronously without blocking the Blender UI. - A virtual `ShaderCompiler` class that backends can use to add their own implementation. - A `ShaderCompilerGeneric` class that implements synchronous/blocking compilation of batches for backends that don't have their own implementation yet. - A `GLShaderCompiler` that supports parallel compilation using subprocesses. - A new `BLI_subprocess` API, including IPC (required for the `GLShaderCompiler` implementation). - The implementation of the subprocess program in `GPU_compilation_subprocess`. - A new `Max Shader Compilation Subprocesses` option in `Preferences > System > Memory & Limits` to enable parallel shader compilation and the max number of subprocesses to allocate (each subprocess has a relatively high memory footprint). Implementation Overview: There's a single `GLShaderCompiler` shared by all OpenGL contexts. This class stores a pool of up to `GCaps.max_parallel_compilations` subprocesses that can be used for compilation. Each subprocess has a shared memory pool used for sending the shader source code from the main Blender process and for receiving the already compiled shader binary from the subprocess. This is synchronized using a series of shared semaphores. The subprocesses maintain a shader cache on disk inside a `BLENDER_SHADER_CACHE` folder at the OS temporary folder. Shaders that fail to compile are tried to be compiled again locally for proper error reports. Hanged subprocesses are currently detected using a timeout of 30s. Pull Request: https://projects.blender.org/blender/blender/pulls/122232	2024-06-05 18:45:57 +02:00
Hans Goudey	da1ea4cdd1	Revert "Draw: Avoid temporary copy for mesh triangulation index buffer" This reverts commit `108ab1df2d`. This causes issues when duplicating objects that I don't have time to investigate right now.	2024-05-23 23:43:34 -04:00
Hans Goudey	108ab1df2d	Draw: Avoid temporary copy for mesh triangulation index buffer The mesh triangulation data is stored in CPU memory with the same format as the triangles GPU index buffer. Because of that we can skip creating a temporary copied owned by the GPU API. One way to do that is to just upload the data directly and avoid keeping a reference to it. However, we can only upload GPU data from the main thread with OpenGL, so instead reference the data and keep track of whether to free it. When drawing a mesh with a single material and 1.8 million faces, this change gives a 12-15% improvement in framerate, from about 32 to 37 FPS. Part of #116901. Pull Request: https://projects.blender.org/blender/blender/pulls/122175	2024-05-23 19:59:36 +02:00
Clément Foucault	e16a0b869b	Cleanup: Metal: Use `MTLContext::get()` instead of static casts	2024-05-18 14:43:45 +02:00
Jason Fielder	47ada34324	Metal: Remove redundant synchronization operations Remove both compute barriers and useResource calls as explicit resources bound via setTexture and setComputeBuffer are implicitly tracked by the Metal API anyway, so these calls increase complexity, without altering correctness Authored by Apple: Michael Parkin-White Pull Request: https://projects.blender.org/blender/blender/pulls/121598	2024-05-17 13:38:55 +02:00
Jason Fielder	405be29540	Metal: Fix texture update data sizing for compressed textures Resolves issue for texture data upload sizing not taking compressed texture input data size into account. Authored by Apple: Michael Parkin-White Pull Request: https://projects.blender.org/blender/blender/pulls/121210	2024-05-13 09:35:08 +02:00
Clément Foucault	414854a09f	Metal: Change error assert into system backtrace This matches the behavior of the GL backend debug.	2024-05-11 12:47:23 +02:00
Campbell Barton	4f5f0040c0	Cleanup: back-tick quote file extensions in code-comments	2024-05-04 15:06:46 +10:00
Campbell Barton	9918488bb1	Cleanup: use uppercase tags, following own style guide	2024-05-03 11:33:21 +10:00
Clément Foucault	ce768d43a1	MTL: Increase log level to errors for missing texture binds This avoid missing them as they are true user level errors.	2024-04-23 23:08:19 +02:00
Jeroen Bakker	0c2085a316	GPU: Remove GPU_compute_shader_support Compute shaders are required since 4.0. There was one occasion where an older AMD driver failed and support was turned off. This driver is now marked unsupported. This PR includes: - removing the check in viewport compositing - remove properties from system info - always construct draw manager. - remove unused pass logic in draw hair/curves - add deprecation warning when accessed from python Pull Request: https://projects.blender.org/blender/blender/pulls/120909	2024-04-22 13:28:10 +02:00
Clément Foucault	f2ae04db10	GPU: Implement missing UBO/SSBO bind tracking This PR adds a context function to consider all buffer bindings obsolete. This is in order to track missing binds and invalid lingering states accross `draw::Pass`es. The functions `GPU_storagebuf_debug_unbind_all` and `GPU_uniformbuf_debug_unbind_all` do nothing more than resetting the internal debug slot bits to zero. This is what OpenGL backend does as it doesn't track the bindings themselves. Other backends might have other way to detect missing bindings. If not they should be implemented separately anyway. I renamed the function to `debug_unbind_all` to denote that it actually does something related to debugging. This also add SSBO binding check for OpenGL as it was also missing. #### Future This error checking logic is pretty much backend agnostic. While it would be nice to move it at `gpu::Context` level, we don't have the resources for that now. Pull Request: https://projects.blender.org/blender/blender/pulls/120716	2024-04-17 11:06:39 +02:00
Campbell Barton	6e3eaae299	Cleanup: spelling in comments	2024-04-14 12:13:55 +10:00
Campbell Barton	43179864f4	Cleanup: remove strcpy usage	2024-04-14 11:58:14 +10:00
Clément Foucault	b07d392b5a	MTL: Remove warning in debug build	2024-04-13 09:12:22 +02:00
Jason Fielder	5f86faf3a5	Metal: Fix write-only qualifier on shader generation A small logic issue caused all write-only image resources to be tagged as read-write in all cases. This caused correctness issues on Intel and AMD GPUs which are resolved through this change. Change also yields a small performance uplift due to enabling improved non-dependent workload scheduling. Authored by Apple: Michael Parkin-White Co-authored-by: Michael Parkin-White <mparkinwhite@apple.com> Pull Request: https://projects.blender.org/blender/blender/pulls/120528	2024-04-11 18:17:40 +02:00
Jason Fielder	be32bc5b72	Metal: Add AMD support for subpass transition Adds support for subpass transition for AMD/Intel IMR GPUs. This enables correct functioning of EEVEE Next deferred lighting pass on AMD platforms. The emulation is consistent with the OpenGL approach of generating additional texture bindings in the shader for subpass inputs, and splitting render passes across sub-pass boundaries. Authored by Apple: Michael Parkin-White Pull Request: https://projects.blender.org/blender/blender/pulls/119784	2024-04-11 15:23:53 +02:00
Jason Fielder	5753a27624	Metal: Disable usage attachment for atomic buffer textures Removes the implicit USAGE_ATTACHMENT flag from atomic fallback textures which are buffer-backed. This usage flag results in a validation failure, and is not required by these textures as they are cleared via the backing buffer. Authored by Apple: Michael Parkin-White Pull Request: https://projects.blender.org/blender/blender/pulls/119785	2024-04-11 10:46:20 +02:00
Campbell Barton	7e9f7320e4	Cleanup: spelling in comments & comment blocks	2024-04-04 11:26:28 +11:00
Campbell Barton	362d381a5a	Cleanup: pass GPUStateMutable as a const reference	2024-03-28 18:10:49 +11:00
Campbell Barton	155dae94d7	Cleanup: code-comments, use doxygen formatting & spelling corrections Also move some function doc-strings from the implementation to their declarations.	2024-03-26 17:55:20 +11:00
Hans Goudey	893130e6fe	Refactor: Remove unnecessary C wrapper for GPUBatch class Similar to `fe76d8c946` Pull Request: https://projects.blender.org/blender/blender/pulls/119898	2024-03-26 03:06:25 +01:00
Aras Pranckevicius	26337b9fb4	Metal: implement support for compressed textures Noticed lack of it via #119793. Now DDS images using BC1/BC2/BC3 (aka DXT1/DXT3/DXT5) formats can keep on being GPU compressed on Metal too, just like e.g. on OpenGL. Pull Request: https://projects.blender.org/blender/blender/pulls/119835	2024-03-25 11:40:20 +01:00
Hans Goudey	b54d9875ba	Fix: Another Metal build error after recent refactor Sorry for the noise, I misread the output from the PR build.	2024-03-24 13:24:03 -04:00
Hans Goudey	aa87b747c5	Fix: Additional macOS metal build error	2024-03-24 12:37:36 -04:00
Hans Goudey	e201b5e553	Fix: Debug build error after previous commit	2024-03-24 12:17:44 -04:00
Hans Goudey	fe76d8c946	Refactor: Remove unnecessary C wrappers for vertex and index buffers Now that all relevant code is C++, the indirection from the C struct `GPUVertBuf` to the C++ `blender::gpu::VertBuf` class just adds complexity and necessitates a wrapper API, making more cleanups like use of RAII or other C++ types more difficult. This commit replaces the C wrapper structs with direct use of the vertex and index buffer base classes. In C++ we can choose which parts of a class are private, so we don't risk exposing too many implementation details here. Pull Request: https://projects.blender.org/blender/blender/pulls/119825	2024-03-24 16:38:30 +01:00
Hans Goudey	8b514bccd1	Cleanup: Move remaining GPU headers to C++ Pull Request: https://projects.blender.org/blender/blender/pulls/119807	2024-03-23 01:24:18 +01:00
Campbell Barton	57dd9c21d3	Cleanup: spelling in comments	2024-03-21 10:02:53 +11:00
Jason Fielder	661d12aef7	Fix #119195 : Ensure Metal uses correct attribute conversion mode Resolves custom attribute types for ints and booleans by ensuring conversion mode is correct. Previously, the attribute declarations were assumed to be linear. However, patch ensures the correct attribute index is now fetched, ensuring the conversion mode is correctly specified for non-linear attribute ID's. Authored by Apple: Michael Parkin-White Pull Request: https://projects.blender.org/blender/blender/pulls/119569	2024-03-18 13:38:09 +01:00
Jason Fielder	6768ded895	Fix #118868 : Metal render pass output for EEVEE Next Resolves render pass export for EEVEE Next on Metal. Reads from texture views was previously utilising the root texture rather than the view variant, resulting in views into texture arrays being incorrectly sampled. Authored by Apple: Michael Parkin-White Pull Request: https://projects.blender.org/blender/blender/pulls/119563	2024-03-16 20:16:37 +01:00
Jason Fielder	ecffea86b1	Metal: Fix Storage buffer read sync affecting surfels Authored by Apple: Michael Parkin-White Pull Request: https://projects.blender.org/blender/blender/pulls/119093	2024-03-14 09:40:59 +01:00
laurynas	aa3ffca8dc	Fix #119247 : Curves: Extra point in evaluated spline of Curves geometry In `bf17fc8d79` after extending buffer to multiple of 4 there appeared trailing space in buffer not covered by shader's `for` loop. Pull Request: https://projects.blender.org/blender/blender/pulls/119346	2024-03-12 15:01:10 +01:00
Jason Fielder	06ac33bdd2	Metal: Fix SSBO from VBO size assertion Resolves assertion firing when creating an SSBO from a VBO which is not aligned to 16 bytes. Required to ensure API validation is satisfied. Authored by Apple: Michael Parkin-White Pull Request: https://projects.blender.org/blender/blender/pulls/119298	2024-03-11 08:28:14 +01:00
Jason Fielder	703353b5da	Metal: Fix uniform upload for small types This patch adds special cases to Shader::uniform_int routine to allow writing of small types (1 bytes, 2 bytes) to the push constant buffer. This previously interpreted all incoming push constant data as integer components only, resulting in rendering artifacts such as bad SRGB mode selection and shader editor not rendering due to mis-aligned overlay parameter, as the uniform assignment would overflow consecutive small types. Authored by Apple: Michael Parkin-White Pull Request: https://projects.blender.org/blender/blender/pulls/119285	2024-03-10 19:36:30 +01:00

1 2 3 4 5 ...

373 Commits