test2

Author	SHA1	Message	Date
Clément Foucault	9990273d04	GPU: Change `Type` enum to use lower case values This is to help for future resource declaration using macros. Rel #137261 Pull Request: https://projects.blender.org/blender/blender/pulls/137367	2025-04-11 22:39:01 +02:00
Clément Foucault	bb52754652	GPU: Use `f` suffix for float literals They are actually already some literals with the `f` suffix that are in our shader codebase and we never had problem in the past 5 years (or even 8 years). So I think it is safe to do and improves convergence of codestyles. Pull Request: https://projects.blender.org/blender/blender/pulls/137352	2025-04-11 18:28:45 +02:00
Miguel Pozo	a5ed5dc4bf	GPU: Support deferred compilation in ShaderCompilerGeneric Update the `ShaderCompilerGeneric` to support deferred compilation using the batch compilation API, so we can get rid of `drw_manager_shader`. This approach also allows supporting non-blocking compilation for static shaders. This shouldn't cause any behavior changes at the moment, since batch compilation is not yet used when parallel compilation is disabled. This adds a `GPUWorker` and a `GPUSecondaryContext` as an easy to use wrapper for managing secondary GPU contexts. (Part of #133674) Pull Request: https://projects.blender.org/blender/blender/pulls/136518	2025-04-07 15:26:25 +02:00
Clément Foucault	3562433ae7	pyGPU: Deprecate `Shader.program` getter This is getting in the way of making the GPUShader API more threadsafe. This getter already doesn't work for vulkan and Metal, and has very limited usage. Keeping the python function to avoid errors and display a deprecation warning. Pull Request: https://projects.blender.org/blender/blender/pulls/136983	2025-04-04 14:23:09 +02:00
Omar Emara	56b0b709ea	Compositor: Support GPU OIDN denoising This patch supports GPU OIDN denoising in the compositor. A new compositor performance option was added to allow choosing between CPU, GPU, and Auto device selection. Auto will use whatever the compositor is using for execution. The code is two folds, first, denoising code was adapted to use buffers as opposed to passing in pointers to filters directly, this is needed to support GPU devices. Second, device creation is now a bit more involved, it tries to choose the device is being used by the compositor for execution. Matching GPU devices is done by choosing the OIDN device that matches the UUID or LUID of the active GPU platform. We need both UUID and LUID because not all platforms support both. UUID is supported on all platforms except MacOS Metal, while LUID is only supported on Window and MacOS metal. If there is no active GPU device or matching is unsuccessful, we let OIDN choose the best device, which is typically the fastest. To support this case, UUID and LUID identifiers were added to the GPUPlatformGlobal and are initialized by the GPU backend if supported. OpenGL now requires GL_EXT_memory_object and GL_EXT_memory_object_win32 to support this use case, but it should function without it. Pull Request: https://projects.blender.org/blender/blender/pulls/136660	2025-04-04 11:17:08 +02:00
Clément Foucault	f8de6c31bc	EEVEE: Move Object ID storage to gbuffer header layer This allow to store the full object ID inside a `uint32` buffer. This allows to get the per object data in deferred passes and avoid to store object data inside the Gbuffer. This data is only written if needed. This had to modify the implementation of subpass input for all backend to be able to bind layered texture. This currently work because only the layer 0 is bound to the framebuffer. This is fragile but I don't see a good builtin way to fix it. Rel #135935 #### Tasks - [x] Replace light linking bits in Gbuffer - [x] Replace Object ID in GBuffer for SSS - [x] Conditional storage - [x] Dummy storage if not needed Pull Request: https://projects.blender.org/blender/blender/pulls/136428	2025-04-03 14:00:55 +02:00
Campbell Barton	d616c87d03	Cleanup: spelling in comments (make check_spelling_*)	2025-03-21 11:51:50 +11:00
Jeroen Bakker	32999913ef	SubDiv: Enable GPU subdivision on Metal This PR enabled GPU based subdivision on Metal. Most work is done in #135296. - Metal max storage bindings for compute shaders were never set. Some performance figures: Suzanne 6 subdivision levels \| Machine \| CPU Subdivision \| GPU Subdivision \| \| --------------- \| --------------- \| --------------- \| \| M1 Studio Ultra \| 7fps \| 12 fps \| \| M2 Air \| 3fps \| 11 fps \| Pull Request: https://projects.blender.org/blender/blender/pulls/135628	2025-03-11 11:12:01 +01:00
Jason Fielder	ff4b6c033d	Metal: Fix framebuffers being cleared during subpasses. Stops clearing the framebuffer when we split the scene into multiple renders. Fixes default cube rendering as black on some Mac systems. Authored by Apple: James McCarthy" Co-authored-by: James McCarthy <jamesmccarthy@apple.com> Pull Request: https://projects.blender.org/blender/blender/pulls/135099	2025-03-11 00:10:33 +01:00
Campbell Barton	d951428422	Cleanup: spelling in comments Address warnings from check_spelling.py	2025-03-06 10:49:51 +11:00
Brecht Van Lommel	3dab100860	Fix: ASAN errors after addition of texture pool Same fix as #132504. Free the texture pool before the derived GPU context class, as that one is used as part of freeing the texture pool. Pull Request: https://projects.blender.org/blender/blender/pulls/135444	2025-03-04 16:54:05 +01:00
Jeroen Bakker	3b5c3e70b1	SubDiv: Use shader create info for stretch overlays This PR migrates subdiv_vbo_edituv_strech_*_comp.glsl to use shader create info. Pull Request: https://projects.blender.org/blender/blender/pulls/135038	2025-02-24 13:32:53 +01:00
Jeroen Bakker	b34bc67f67	Metal: Add support for packed_float3 as storage buffers Subdivision shaders currently fail to compile using Metal as it doesn't recognize packed_float3 as an internal data type. This PR includes packed_float3 as an internal data type. Without this `blender --debug-gpu-compile-shaders` will fail as it includes a namespace. ``` ERROR (gpu.shader): subdiv_normals_accumulate Compute Shader: \| \| source/blender/gpu/metal/mtl_shader_generator.mm:971:9: Error: no type named 'packed_float3' in 'MTLShaderComputeImpl'; did you mean simply 'packed_float3'? \| \| device MTLShaderComputeImpl::packed_float3* normals[[buffer(MTL_storage_buffer_base_index+4)]], \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ \| packed_float3 \| \| /System/Library/PrivateFrameworks/GPUCompiler.framework/Versions/32023/Libraries/lib/clang/32023.196/include/metal/metal_packed_vector:145:58: Note: 'packed_float3' declared here \| \| typedef __attribute__((__packed_vector_type__(3))) float packed_float3; \| ^ ``` Pull Request: https://projects.blender.org/blender/blender/pulls/134925	2025-02-21 13:46:10 +01:00
Clément Foucault	b73c06ada0	Fix: Metal: Avoid overriding GPU matrices after shader bind This was a bad usage of the Matrix API. This fixes 2D and 3D cursor being broken on Metal.	2025-02-17 14:47:15 +01:00
Clément Foucault	ad7b8d5b4c	Metal: Ensure that storage buffer reads are synchronized on Intel Macs There seems to be a pattern where this commonly failed. This patch adds the async flush (which is effectively not async) when there were no previous call to `async_flush_to_host`. This is only done on Intel Macs (or any mac that has non unified memory arch). Pull Request: https://projects.blender.org/blender/blender/pulls/134216	2025-02-10 20:44:08 +01:00
Clément Foucault	86b70143d5	Cleanup: GPU: Remove unused Transform Feedback implementation Most of the cleanup is inside the metal backend. Pull Request: https://projects.blender.org/blender/blender/pulls/134349	2025-02-10 17:30:42 +01:00
Jeroen Bakker	dda23c53f8	Metal: Add native tile input to workarounds Native tile input wasn't part of the MTLCapability struct, but stored locally in the shader generator and checked in MTLFramebuffer. This PR moves it to the MTLCapability struct and disables it when workarounds are forced. Pull Request: https://projects.blender.org/blender/blender/pulls/133818	2025-02-03 16:36:15 +01:00
Clément Foucault	976ed42533	Cleanup: GPU: Use functional cast for scalar casting	2025-01-31 18:26:44 +01:00
Brecht Van Lommel	c7502b092d	Cleanup: Various clang-tidy warnings in gpu Pull Request: https://projects.blender.org/blender/blender/pulls/133734	2025-01-31 17:03:18 +01:00
Clément Foucault	636147053d	Metal: Add support for repeating byte sequence for buffer clearing This allows to run with the --debug-gpu option (which does NAN and 0xF0F0F0F0 clearing) without asserts even when the texture atomic workaround is enabled.	2025-01-31 16:13:56 +01:00
Clément Foucault	6ab4e99cf7	Fix #133645 : Metal: Crash when activating EEVEE on MacOS 13.7.2 with AMD This was caused by the subpass input workaround for non-tilebased GPU using `texelFetch` on an `image`. This was supported before the cleanup `9c0321ae9b`. But is against the GLSL specification and was removed inside the cleanup. Using `imageLoad` instead of `texelFetch` fixes the crash. However rendering seems to be broken for other reasons.	2025-01-30 15:32:17 +01:00
Jeroen Bakker	efff379ea5	Metal: Add support to force workarounds. Recently it came to out attention that macOs13 doesn't always work due to texture atomics not supported by that version of the OS. Development happens most of the time on newer versions of the OS without ability to check if it still works on the older versions. This PR enables to disable some Metal capabilities to better check how Blender works on those OS's. The capabilities that will be disabled are texture gathering and texture atomics. It doesn't disable the capabilities that are required to start Blender, which are still part of the `MTLCapabilities` struct. This allows us to reproduce issues like #129571 Pull Request: https://projects.blender.org/blender/blender/pulls/133636	2025-01-27 11:07:20 +01:00
Brecht Van Lommel	24e5226ff0	Fix #128186 : Invalid GPU framebuffer free from context Framebuffers are getting freed in the GPUContext base class destructor. But the framebuffer destructors use the MTL/VK/GLContext derived class, whose destructor has already completed at this point. So these contexts are no longer valid to use. Now free the framebuffers earlier. This caused ASAN warnings, it's not known to cause actual bugs. Pull Request: https://projects.blender.org/blender/blender/pulls/132504	2025-01-06 11:32:02 +01:00
Campbell Barton	d2d754be3f	Cleanup: spelling in comments (make check_spelling*) - Back-tick quote math expressions so differentiate them from English. - Use doxygen code blocks for TEX expressions.	2025-01-04 16:26:39 +11:00
Brecht Van Lommel	63b4d7ba03	Fix: Uninitialized variable in MTLSafeFreeList Found by address sanitizer. My understanding is that this bug could cause too much flushing, but not wrong behavior or crashes. See `aca9c131fc`. Pull Request: https://projects.blender.org/blender/blender/pulls/132141	2024-12-20 20:03:41 +01:00
Jason Fielder	81f9df606a	Fix #130700 : Release Metal resources after each frame when rendering Python animations. Rendering animations from Python scripts via `bpy.ops.render.opengl()` did not trigger any of the notifications in the Metal back-end to indicate a frame had been rendered and that the associated resources could be released. This adds a call to GPU_render_step() after each render. For the original asset in the bug report this reduces the high memory watermark from 30gb to 13gb for 500 frames. 13gb is likely still too high and therefore it is likely there are additional leaks that need to be addressed so this should only be considered a partial fix. Authored by Apple: James McCarthy Co-authored-by: James McCarthy <jamesmccarthy@apple.com> Co-authored-by: Clément Foucault <foucault.clem@gmail.com> Pull Request: https://projects.blender.org/blender/blender/pulls/131085	2024-12-12 20:30:18 +01:00
Weizhen Huang	0b954d7777	Cleanup: make format	2024-12-10 17:51:29 +01:00
Clément Foucault	94b7035311	Fix: GPU: Broken compilation on Mac	2024-12-10 17:42:13 +01:00
Miguel Pozo	e24eadbb42	GPU: Add assert_framebuffer_shader_compatibility Ensure all the framebuffer color attachments are written to. Pull Request: https://projects.blender.org/blender/blender/pulls/130995	2024-12-10 17:13:06 +01:00
Aras Pranckevicius	074df4ceeb	GPU: ensure viewport does not use uninitialized images (#119685 and others) GPUViewport is creating a bunch of framebuffer textures for itself, but some space types never initialize/use them. E.g. Sequencer, Nodes etc. only ever use the "overlay" texture. Eventually when viewport is "drawn", it combines this uninitialized texture data and then only by luck it happens that most of the time it is black. But not always! The textures were only cleared (right now) on Metal backend, under GPU_clear_viewport_workaround as if it was some driver workaround. Stop doing that, and just clear them always. However, there was seemingly a performance issue on OpenGL, when this clear was being done. At least on my machine (Win10, Geforce RTX 3080Ti), the overhead of doing the clears is measurable, and is caused by usage of GL4.4 glClearTexImage instead of a framebuffer clear. As if glClearTexImage makes "pixel data to exist" on the CPU side and then later on binding this framebuffer sends off that data to the GPU, or somesuch. More details in the PR. Pull Request: https://projects.blender.org/blender/blender/pulls/131518	2024-12-09 13:23:18 +01:00
Clément Foucault	7b6cc57215	Metal: Fix race condition in msl_patch_default_get The string `msl_patch_default` can have been read partially uninitialized or initialized multiple time and read uncomplete during multithreaded compilation. This should fix the GPU tests randomly failing on mac. While this would never fail when blender runs from the UI (since UI shaders are init in single threaded manner and always compile before EEVEE shaders), this race condition could happen when running EEVEE through background rendering or running tests. Pull Request: https://projects.blender.org/blender/blender/pulls/131580	2024-12-08 19:15:56 +01:00
Clément Foucault	52463a5f0b	GPU: Remove unused GPUDrawList API This was only used by the legacy draw manager. This one has already been removed.	2024-12-05 23:26:29 +01:00
Clément Foucault	994c43413a	Metal: Remove SSBO Vertex Fetch This API was used as a workaround to the lack of geometry shader. It has been rendered redundant since the introduction of #125782.	2024-12-05 22:58:52 +01:00
Clément Foucault	4bfaecc340	Fix #131212 : Metal: Non-aligned circular buffer allocation logic The new buffer size could have been non aligned when using the fractional growing heuristic. This non aligned allocation would then trigger an assert at the SSBO constructor. Aligning the alocation size fixes the issue.	2024-12-02 17:34:35 +01:00
Clément Foucault	ec84fe5fdb	Fix #131091 : GL: Weird Lines appearing in Gizmo Overlays This happened because NVidia GPUs require higher alignment for SSBO binds than for vertex inputs. This is related to #131103 which fixed it for vulkan. Add a common capability option for that.	2024-11-28 17:22:12 +01:00
Clément Foucault	00a8d006fe	GPU: Move Polyline shader to primitive expansion This port is not so straightforward. This shader is used in different configurations and is available to python bindings. So we need to keep compatibility with different attributes configurations. This is why attributes are loaded per component and a uniform sets the length of the component. Since this shader can be used from both the imm and batch API, we need to inject some workarounds to bind the buffers correctly. The end result is still less versatile than the previous metal workaround (i.e.: more attribute fetch mode supported), but it is also way less code. ### Limitations: The new shader has some limitation: - Both `color` and `pos` attributes need to be `F32`. - Each attribute needs to be 4byte aligned. - Fetch type needs to be `GPU_FETCH_FLOAT`. - Primitive type needs to be `GPU_PRIM_LINES`, `GPU_PRIM_LINE_STRIP` or `GPU_PRIM_LINE_LOOP`. - If drawing using an index buffer, it must contain no primitive restart. Rel #127493 Co-authored-by: Jeroen Bakker <jeroen@blender.org> Pull Request: https://projects.blender.org/blender/blender/pulls/129315	2024-11-27 17:37:04 +01:00
Campbell Barton	b9f055459a	Cleanup: ensure trailing space around comment blocks	2024-11-27 19:01:00 +11:00
Campbell Barton	9e9598877e	Cleanup: balance braces in pre-processor checks While it's correct, unbalanced braces confuses some editing operations.	2024-11-26 12:41:29 +11:00
Sergey Sharybin	f70ec20ab8	Fix: Leak of GHOST GPU contexts in Metal backend Need to respect the ownership and lifetime of objects. Pull Request: https://projects.blender.org/blender/blender/pulls/130282	2024-11-14 17:18:55 +01:00
Jason Fielder	4fc2e1c842	Fix #129661 : Wait for GPU to complete to avoid use-after-free issues. In some cases the MTLContext was being destroyed before all GPU work was completed causing the (outstanding) command buffer completion event handler to update a command buffer that had already been freed. This behaviour was introduced by [this](https://projects.blender.org/blender/blender/commit/6da42e9c951b) change which updated the event handler to track the number of outstanding command buffers per context as well as system-wide. Reproduced the issue with ASAN enabled and confirmed that waiting for the GPU to complete fixes the issue. Also contains a minor fix for unitiiliased values in MTLAttachments identified by ASAN. Authored by Apple: James McCarthy" Co-authored-by: James McCarthy <jamesmccarthy@apple.com> Pull Request: https://projects.blender.org/blender/blender/pulls/129686	2024-11-12 16:48:43 +01:00
Clément Foucault	642933ffe6	Metal: Guard advanced vertex format against newer osx version Fixes a build error on older macos.	2024-11-12 13:06:15 +01:00
Clément Foucault	e79f12cf43	Fix: GPU: Broken static shader tests Caused by `5d162719ba`	2024-11-10 15:16:34 +01:00
Clément Foucault	f67ea33993	Cleanup: Metal: Fix clang-tidy warnings Replace != "" by more semanticaly friendly `is_empty`.	2024-11-10 00:29:03 +01:00
Clément Foucault	6f4b106da3	Metal: Refactor format conversion logic Simplify the logic and handle all cases.	2024-11-09 21:50:37 +01:00
Clément Foucault	5d162719ba	Cleanup: Metal: Fix clang-tidy warning Replace size > 0 by more semanticaly friendly `is_empty`.	2024-11-09 21:50:37 +01:00
Clément Foucault	510f97865a	Fix: Metal: Address ASAN errors Fix several error reported by asan when just launching blender.	2024-11-08 16:09:09 +01:00
Clément Foucault	72b24fa336	Cleanup: Metal: Simplify mtl_convert_vertex_format _No response_ Pull Request: https://projects.blender.org/blender/blender/pulls/130036	2024-11-08 16:07:58 +01:00
Bastien Montagne	e156a422cd	Merge branch 'blender-v4.3-release'	2024-11-07 16:01:04 +01:00
Jason Fielder	658700ddff	Fix #126364 : Metal: modified texture usage flags causing cache misses For Metal we can change the texture usage flags to get more optimal behaviour - one example is adding the attachment flag so we can utilise renders to do texture clears. However these usage flags are used as the part of the match-criteria when trying to reuse released textures in the texture pool. The modifications means a request for the same type of texture will fail causing a cache miss. When we render to an image-view the texture pool is not released until the final sample has been rendered as we consider the entire render to be a single frame (as opposed to normal viewport rendering when we are presenting the intermediate results). This causes the texture pool to grow and grow and grow hence the large memory usage. This fix splits the usage flags into two sets, the internal ones we use to create the MTLTexture (which we may modify) and the originally requested ones. The originally requested ones are used for the texture pool matching. This fix also improves memory efficiency for normal viewport rendering. Mr Elephant Scene Before -> After Load scene in viewport: 13.04Gb -> 9.15 Gb Viewport Render Image: 78.69Gb -> 16.61Gb Authored by Apple: James McCarthy Pull Request: https://projects.blender.org/blender/blender/pulls/129951	2024-11-07 15:53:09 +01:00
Clément Foucault	8c0bd61342	Merge branch 'blender-v4.3-release'	2024-11-05 17:40:49 +01:00

1 2 3 4 5 ...

446 Commits