test2

Author	SHA1	Message	Date
Brecht Van Lommel	3dab100860	Fix: ASAN errors after addition of texture pool Same fix as #132504. Free the texture pool before the derived GPU context class, as that one is used as part of freeing the texture pool. Pull Request: https://projects.blender.org/blender/blender/pulls/135444	2025-03-04 16:54:05 +01:00
Jeroen Bakker	3b5c3e70b1	SubDiv: Use shader create info for stretch overlays This PR migrates subdiv_vbo_edituv_strech_*_comp.glsl to use shader create info. Pull Request: https://projects.blender.org/blender/blender/pulls/135038	2025-02-24 13:32:53 +01:00
Jeroen Bakker	b34bc67f67	Metal: Add support for packed_float3 as storage buffers Subdivision shaders currently fail to compile using Metal as it doesn't recognize packed_float3 as an internal data type. This PR includes packed_float3 as an internal data type. Without this `blender --debug-gpu-compile-shaders` will fail as it includes a namespace. ``` ERROR (gpu.shader): subdiv_normals_accumulate Compute Shader: \| \| source/blender/gpu/metal/mtl_shader_generator.mm:971:9: Error: no type named 'packed_float3' in 'MTLShaderComputeImpl'; did you mean simply 'packed_float3'? \| \| device MTLShaderComputeImpl::packed_float3* normals[[buffer(MTL_storage_buffer_base_index+4)]], \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ \| packed_float3 \| \| /System/Library/PrivateFrameworks/GPUCompiler.framework/Versions/32023/Libraries/lib/clang/32023.196/include/metal/metal_packed_vector:145:58: Note: 'packed_float3' declared here \| \| typedef __attribute__((__packed_vector_type__(3))) float packed_float3; \| ^ ``` Pull Request: https://projects.blender.org/blender/blender/pulls/134925	2025-02-21 13:46:10 +01:00
Clément Foucault	b73c06ada0	Fix: Metal: Avoid overriding GPU matrices after shader bind This was a bad usage of the Matrix API. This fixes 2D and 3D cursor being broken on Metal.	2025-02-17 14:47:15 +01:00
Clément Foucault	ad7b8d5b4c	Metal: Ensure that storage buffer reads are synchronized on Intel Macs There seems to be a pattern where this commonly failed. This patch adds the async flush (which is effectively not async) when there were no previous call to `async_flush_to_host`. This is only done on Intel Macs (or any mac that has non unified memory arch). Pull Request: https://projects.blender.org/blender/blender/pulls/134216	2025-02-10 20:44:08 +01:00
Clément Foucault	86b70143d5	Cleanup: GPU: Remove unused Transform Feedback implementation Most of the cleanup is inside the metal backend. Pull Request: https://projects.blender.org/blender/blender/pulls/134349	2025-02-10 17:30:42 +01:00
Jeroen Bakker	dda23c53f8	Metal: Add native tile input to workarounds Native tile input wasn't part of the MTLCapability struct, but stored locally in the shader generator and checked in MTLFramebuffer. This PR moves it to the MTLCapability struct and disables it when workarounds are forced. Pull Request: https://projects.blender.org/blender/blender/pulls/133818	2025-02-03 16:36:15 +01:00
Clément Foucault	976ed42533	Cleanup: GPU: Use functional cast for scalar casting	2025-01-31 18:26:44 +01:00
Brecht Van Lommel	c7502b092d	Cleanup: Various clang-tidy warnings in gpu Pull Request: https://projects.blender.org/blender/blender/pulls/133734	2025-01-31 17:03:18 +01:00
Clément Foucault	636147053d	Metal: Add support for repeating byte sequence for buffer clearing This allows to run with the --debug-gpu option (which does NAN and 0xF0F0F0F0 clearing) without asserts even when the texture atomic workaround is enabled.	2025-01-31 16:13:56 +01:00
Clément Foucault	6ab4e99cf7	Fix #133645 : Metal: Crash when activating EEVEE on MacOS 13.7.2 with AMD This was caused by the subpass input workaround for non-tilebased GPU using `texelFetch` on an `image`. This was supported before the cleanup `9c0321ae9b`. But is against the GLSL specification and was removed inside the cleanup. Using `imageLoad` instead of `texelFetch` fixes the crash. However rendering seems to be broken for other reasons.	2025-01-30 15:32:17 +01:00
Jeroen Bakker	efff379ea5	Metal: Add support to force workarounds. Recently it came to out attention that macOs13 doesn't always work due to texture atomics not supported by that version of the OS. Development happens most of the time on newer versions of the OS without ability to check if it still works on the older versions. This PR enables to disable some Metal capabilities to better check how Blender works on those OS's. The capabilities that will be disabled are texture gathering and texture atomics. It doesn't disable the capabilities that are required to start Blender, which are still part of the `MTLCapabilities` struct. This allows us to reproduce issues like #129571 Pull Request: https://projects.blender.org/blender/blender/pulls/133636	2025-01-27 11:07:20 +01:00
Brecht Van Lommel	24e5226ff0	Fix #128186 : Invalid GPU framebuffer free from context Framebuffers are getting freed in the GPUContext base class destructor. But the framebuffer destructors use the MTL/VK/GLContext derived class, whose destructor has already completed at this point. So these contexts are no longer valid to use. Now free the framebuffers earlier. This caused ASAN warnings, it's not known to cause actual bugs. Pull Request: https://projects.blender.org/blender/blender/pulls/132504	2025-01-06 11:32:02 +01:00
Campbell Barton	d2d754be3f	Cleanup: spelling in comments (make check_spelling*) - Back-tick quote math expressions so differentiate them from English. - Use doxygen code blocks for TEX expressions.	2025-01-04 16:26:39 +11:00
Brecht Van Lommel	63b4d7ba03	Fix: Uninitialized variable in MTLSafeFreeList Found by address sanitizer. My understanding is that this bug could cause too much flushing, but not wrong behavior or crashes. See `aca9c131fc`. Pull Request: https://projects.blender.org/blender/blender/pulls/132141	2024-12-20 20:03:41 +01:00
Jason Fielder	81f9df606a	Fix #130700 : Release Metal resources after each frame when rendering Python animations. Rendering animations from Python scripts via `bpy.ops.render.opengl()` did not trigger any of the notifications in the Metal back-end to indicate a frame had been rendered and that the associated resources could be released. This adds a call to GPU_render_step() after each render. For the original asset in the bug report this reduces the high memory watermark from 30gb to 13gb for 500 frames. 13gb is likely still too high and therefore it is likely there are additional leaks that need to be addressed so this should only be considered a partial fix. Authored by Apple: James McCarthy Co-authored-by: James McCarthy <jamesmccarthy@apple.com> Co-authored-by: Clément Foucault <foucault.clem@gmail.com> Pull Request: https://projects.blender.org/blender/blender/pulls/131085	2024-12-12 20:30:18 +01:00
Weizhen Huang	0b954d7777	Cleanup: make format	2024-12-10 17:51:29 +01:00
Clément Foucault	94b7035311	Fix: GPU: Broken compilation on Mac	2024-12-10 17:42:13 +01:00
Miguel Pozo	e24eadbb42	GPU: Add assert_framebuffer_shader_compatibility Ensure all the framebuffer color attachments are written to. Pull Request: https://projects.blender.org/blender/blender/pulls/130995	2024-12-10 17:13:06 +01:00
Aras Pranckevicius	074df4ceeb	GPU: ensure viewport does not use uninitialized images (#119685 and others) GPUViewport is creating a bunch of framebuffer textures for itself, but some space types never initialize/use them. E.g. Sequencer, Nodes etc. only ever use the "overlay" texture. Eventually when viewport is "drawn", it combines this uninitialized texture data and then only by luck it happens that most of the time it is black. But not always! The textures were only cleared (right now) on Metal backend, under GPU_clear_viewport_workaround as if it was some driver workaround. Stop doing that, and just clear them always. However, there was seemingly a performance issue on OpenGL, when this clear was being done. At least on my machine (Win10, Geforce RTX 3080Ti), the overhead of doing the clears is measurable, and is caused by usage of GL4.4 glClearTexImage instead of a framebuffer clear. As if glClearTexImage makes "pixel data to exist" on the CPU side and then later on binding this framebuffer sends off that data to the GPU, or somesuch. More details in the PR. Pull Request: https://projects.blender.org/blender/blender/pulls/131518	2024-12-09 13:23:18 +01:00
Clément Foucault	7b6cc57215	Metal: Fix race condition in msl_patch_default_get The string `msl_patch_default` can have been read partially uninitialized or initialized multiple time and read uncomplete during multithreaded compilation. This should fix the GPU tests randomly failing on mac. While this would never fail when blender runs from the UI (since UI shaders are init in single threaded manner and always compile before EEVEE shaders), this race condition could happen when running EEVEE through background rendering or running tests. Pull Request: https://projects.blender.org/blender/blender/pulls/131580	2024-12-08 19:15:56 +01:00
Clément Foucault	52463a5f0b	GPU: Remove unused GPUDrawList API This was only used by the legacy draw manager. This one has already been removed.	2024-12-05 23:26:29 +01:00
Clément Foucault	994c43413a	Metal: Remove SSBO Vertex Fetch This API was used as a workaround to the lack of geometry shader. It has been rendered redundant since the introduction of #125782.	2024-12-05 22:58:52 +01:00
Clément Foucault	4bfaecc340	Fix #131212 : Metal: Non-aligned circular buffer allocation logic The new buffer size could have been non aligned when using the fractional growing heuristic. This non aligned allocation would then trigger an assert at the SSBO constructor. Aligning the alocation size fixes the issue.	2024-12-02 17:34:35 +01:00
Clément Foucault	ec84fe5fdb	Fix #131091 : GL: Weird Lines appearing in Gizmo Overlays This happened because NVidia GPUs require higher alignment for SSBO binds than for vertex inputs. This is related to #131103 which fixed it for vulkan. Add a common capability option for that.	2024-11-28 17:22:12 +01:00
Clément Foucault	00a8d006fe	GPU: Move Polyline shader to primitive expansion This port is not so straightforward. This shader is used in different configurations and is available to python bindings. So we need to keep compatibility with different attributes configurations. This is why attributes are loaded per component and a uniform sets the length of the component. Since this shader can be used from both the imm and batch API, we need to inject some workarounds to bind the buffers correctly. The end result is still less versatile than the previous metal workaround (i.e.: more attribute fetch mode supported), but it is also way less code. ### Limitations: The new shader has some limitation: - Both `color` and `pos` attributes need to be `F32`. - Each attribute needs to be 4byte aligned. - Fetch type needs to be `GPU_FETCH_FLOAT`. - Primitive type needs to be `GPU_PRIM_LINES`, `GPU_PRIM_LINE_STRIP` or `GPU_PRIM_LINE_LOOP`. - If drawing using an index buffer, it must contain no primitive restart. Rel #127493 Co-authored-by: Jeroen Bakker <jeroen@blender.org> Pull Request: https://projects.blender.org/blender/blender/pulls/129315	2024-11-27 17:37:04 +01:00
Campbell Barton	b9f055459a	Cleanup: ensure trailing space around comment blocks	2024-11-27 19:01:00 +11:00
Campbell Barton	9e9598877e	Cleanup: balance braces in pre-processor checks While it's correct, unbalanced braces confuses some editing operations.	2024-11-26 12:41:29 +11:00
Sergey Sharybin	f70ec20ab8	Fix: Leak of GHOST GPU contexts in Metal backend Need to respect the ownership and lifetime of objects. Pull Request: https://projects.blender.org/blender/blender/pulls/130282	2024-11-14 17:18:55 +01:00
Jason Fielder	4fc2e1c842	Fix #129661 : Wait for GPU to complete to avoid use-after-free issues. In some cases the MTLContext was being destroyed before all GPU work was completed causing the (outstanding) command buffer completion event handler to update a command buffer that had already been freed. This behaviour was introduced by [this](https://projects.blender.org/blender/blender/commit/6da42e9c951b) change which updated the event handler to track the number of outstanding command buffers per context as well as system-wide. Reproduced the issue with ASAN enabled and confirmed that waiting for the GPU to complete fixes the issue. Also contains a minor fix for unitiiliased values in MTLAttachments identified by ASAN. Authored by Apple: James McCarthy" Co-authored-by: James McCarthy <jamesmccarthy@apple.com> Pull Request: https://projects.blender.org/blender/blender/pulls/129686	2024-11-12 16:48:43 +01:00
Clément Foucault	642933ffe6	Metal: Guard advanced vertex format against newer osx version Fixes a build error on older macos.	2024-11-12 13:06:15 +01:00
Clément Foucault	e79f12cf43	Fix: GPU: Broken static shader tests Caused by `5d162719ba`	2024-11-10 15:16:34 +01:00
Clément Foucault	f67ea33993	Cleanup: Metal: Fix clang-tidy warnings Replace != "" by more semanticaly friendly `is_empty`.	2024-11-10 00:29:03 +01:00
Clément Foucault	6f4b106da3	Metal: Refactor format conversion logic Simplify the logic and handle all cases.	2024-11-09 21:50:37 +01:00
Clément Foucault	5d162719ba	Cleanup: Metal: Fix clang-tidy warning Replace size > 0 by more semanticaly friendly `is_empty`.	2024-11-09 21:50:37 +01:00
Clément Foucault	510f97865a	Fix: Metal: Address ASAN errors Fix several error reported by asan when just launching blender.	2024-11-08 16:09:09 +01:00
Clément Foucault	72b24fa336	Cleanup: Metal: Simplify mtl_convert_vertex_format _No response_ Pull Request: https://projects.blender.org/blender/blender/pulls/130036	2024-11-08 16:07:58 +01:00
Bastien Montagne	e156a422cd	Merge branch 'blender-v4.3-release'	2024-11-07 16:01:04 +01:00
Jason Fielder	658700ddff	Fix #126364 : Metal: modified texture usage flags causing cache misses For Metal we can change the texture usage flags to get more optimal behaviour - one example is adding the attachment flag so we can utilise renders to do texture clears. However these usage flags are used as the part of the match-criteria when trying to reuse released textures in the texture pool. The modifications means a request for the same type of texture will fail causing a cache miss. When we render to an image-view the texture pool is not released until the final sample has been rendered as we consider the entire render to be a single frame (as opposed to normal viewport rendering when we are presenting the intermediate results). This causes the texture pool to grow and grow and grow hence the large memory usage. This fix splits the usage flags into two sets, the internal ones we use to create the MTLTexture (which we may modify) and the originally requested ones. The originally requested ones are used for the texture pool matching. This fix also improves memory efficiency for normal viewport rendering. Mr Elephant Scene Before -> After Load scene in viewport: 13.04Gb -> 9.15 Gb Viewport Render Image: 78.69Gb -> 16.61Gb Authored by Apple: James McCarthy Pull Request: https://projects.blender.org/blender/blender/pulls/129951	2024-11-07 15:53:09 +01:00
Clément Foucault	8c0bd61342	Merge branch 'blender-v4.3-release'	2024-11-05 17:40:49 +01:00
Clément Foucault	750a9af518	Fix #129705 : EEVEE: Light Probe RAM Pool Crash on MacOS Crash manifested after the inclusion of #128877. The very tall 3D texture tested by the new code were not supported / tested by the Metal Backend. Simply adding the appropriate upfront checks fixes the issue. Needs to be backported to 4.2	2024-11-05 17:37:02 +01:00
Campbell Barton	e46c58df7c	Merge branch 'blender-v4.3-release'	2024-11-02 15:44:22 +11:00
Campbell Barton	99387c0749	Cleanup: spelling in comments, docs & error	2024-11-02 15:43:27 +11:00
Clément Foucault	e311c6dd4f	Cleanup: Metal: Fix clang tidy warnings _No response_ Pull Request: https://projects.blender.org/blender/blender/pulls/129656	2024-11-01 20:23:18 +01:00
Hans Goudey	9b97ba1462	Cleanup: GPU: Avoid raw pointers for shader API strings Avoid measuring the length of strings repeatedly by passing their length along with their data with `StringRefNull`. Null termination seems to be necessary still for passing the shader sources to OpenGL. Though I doubt this is a bottleneck, it's still nice to avoid overhead from string operations and this helps move in that direction. Pull Request: https://projects.blender.org/blender/blender/pulls/127702	2024-11-01 20:00:31 +01:00
Clément Foucault	47f7aaa2cc	Merge branch 'blender-v4.3-release'	2024-11-01 12:16:38 +01:00
Jason Fielder	7fbc9e9428	Fix: Metal: Memory leaks identified by Instruments and Xcode memory graph. Running Xcode memory graphs and the Instruments tools revealed memory leaks caused, in the main, by over-retained objects. This removes the unnecessary 'retains' and adds some asserts to guard against over-retaining in the future. There are a few memory leaks remaining involving PyUnicode_DecodeUTF8 but I am unable to identify the cause of these at this time. Authored by Apple: James McCarthy Pull Request: https://projects.blender.org/blender/blender/pulls/129117	2024-11-01 11:56:51 +01:00
Clément Foucault	324517fd78	Cleanup: GPU: Fix clang tidy warnings Removes some other things like: - `TRUST_NO_ONE` which was the same as `#ifndef NDEBUG`. - Replace `reinterpret_cast` by `unwrap` Pull Request: https://projects.blender.org/blender/blender/pulls/129631	2024-10-31 15:18:29 +01:00
Clément Foucault	1b130f651a	Fix: Metal: Remove some more warning & errors Fix by either changing the user level code, or by removing the warnings that are not helpful.	2024-10-30 14:58:26 +01:00
Clément Foucault	7c979d6d40	Fix: Metal: Error caused by missing const_cast Error introduced by `7dc43b7dd2`	2024-10-29 23:55:24 +01:00

1 2 3 4 5 ...

436 Commits