test2

Author	SHA1	Message	Date
Clément Foucault	ca88983af2	EEVEE: Reverse-Z implementation This feature greatly increase depth buffer precision. This is very noticeable in large view distance scenes. This is enabled by default on GPUs that supports it (most of the hardware we support already supports this). This makes rendering different on the GPUs that do not support that feature (`glClipControl`). While this give much better depth precision as before, we also have a lot of imprecision caused by our vertex transformations. This can be improved in another task. Pull Request: https://projects.blender.org/blender/blender/pulls/138898	2025-05-19 16:29:26 +02:00
Miguel Pozo	992e7c95a7	GPU: Converge ShaderCompiler implementations Part of #136993. Share as much of the ShaderCompiler implementations as possible. Remove the ShaderCompiler/ShaderCompilerGeneric split and make most of its functions non virtual. Move the `get_compiler` function from `Context` to `GPUBackend` and creation/deletion to `GPUBackend::init/delete_resources`. Add a `batch_cancel` function to `ShaderCompiler` (needed for the GPUPass refactor). As a nice extra, the multithreaded OpenGL compilation has become faster too. The barbershop materials + EEVEE static shaders have gone from 27s to 22s. I have not observed any performance difference on Vulkan or Metal. Pull Request: https://projects.blender.org/blender/blender/pulls/136676	2025-05-08 18:16:47 +02:00
Miguel Pozo	41ae990b29	Cleanup: Remove StencilViewWorkaround and GPU_texture_view_support() Texture views should be fully supported everywhere now. Pull Request: https://projects.blender.org/blender/blender/pulls/137863	2025-04-23 20:57:00 +02:00
Clément Foucault	04fcf2f907	Merge branch 'blender-v4.4-release'	2025-03-05 12:06:00 +01:00
Clément Foucault	326ce59961	Fix #134509 : GPU: Add workaround for Intel ARC nodelink driver bug This disables the instancing optimization for this specific hardware. Pull Request: https://projects.blender.org/blender/blender/pulls/135458	2025-03-05 12:05:34 +01:00
Clément Foucault	86b70143d5	Cleanup: GPU: Remove unused Transform Feedback implementation Most of the cleanup is inside the metal backend. Pull Request: https://projects.blender.org/blender/blender/pulls/134349	2025-02-10 17:30:42 +01:00
Aras Pranckevicius	074df4ceeb	GPU: ensure viewport does not use uninitialized images (#119685 and others) GPUViewport is creating a bunch of framebuffer textures for itself, but some space types never initialize/use them. E.g. Sequencer, Nodes etc. only ever use the "overlay" texture. Eventually when viewport is "drawn", it combines this uninitialized texture data and then only by luck it happens that most of the time it is black. But not always! The textures were only cleared (right now) on Metal backend, under GPU_clear_viewport_workaround as if it was some driver workaround. Stop doing that, and just clear them always. However, there was seemingly a performance issue on OpenGL, when this clear was being done. At least on my machine (Win10, Geforce RTX 3080Ti), the overhead of doing the clears is measurable, and is caused by usage of GL4.4 glClearTexImage instead of a framebuffer clear. As if glClearTexImage makes "pixel data to exist" on the CPU side and then later on binding this framebuffer sends off that data to the GPU, or somesuch. More details in the PR. Pull Request: https://projects.blender.org/blender/blender/pulls/131518	2024-12-09 13:23:18 +01:00
Clément Foucault	ec84fe5fdb	Fix #131091 : GL: Weird Lines appearing in Gizmo Overlays This happened because NVidia GPUs require higher alignment for SSBO binds than for vertex inputs. This is related to #131103 which fixed it for vulkan. Add a common capability option for that.	2024-11-28 17:22:12 +01:00
Jeroen Bakker	272403a6cb	Vulkan: Remove empty attachments Some Vulkan platforms don't support framebuffers with gaps between the color attachments. Workbench framebuffers can create gaps. (`in_front_fb`, `main_fb` when used for wire frame drawing). This PR implements a detection mechanism to detect gaps. It also disables features that are not able to comply to this requirement. Detected when working on #129062 Pull Request: https://projects.blender.org/blender/blender/pulls/130258	2024-11-14 13:55:33 +01:00
Clément Foucault	43b5e2d0ed	Fix: EEVEE: Add buffer workaround for stencil classification Adding a dummy storage buffer to the classification shader seems to fix the issue on Qualcomm drivers (WoA). The workaround is added to the force workaround option to allow other platforms to test the fix. Rel #122837 Pull Request: https://projects.blender.org/blender/blender/pulls/129857	2024-11-05 17:12:28 +01:00
Miguel Pozo	4c314f9a78	GPU: Add --gpu-compilation-subprocesses cmd setting Allow overriding the `max_parallel_compilations` from the command line. Disable compilation subprocesses on RenderDoc sessions. Pull Request: https://projects.blender.org/blender/blender/pulls/123995	2024-07-01 16:36:01 +02:00
Miguel Pozo	74224b25a5	GPU: Add GPU_shader_batch_create_from_infos This is the first commit of the several required to support subprocess-based parallel compilation on OpenGL. This provides the base API and implementation, and exposes the max subprocesses setting on the UI, but it's not used by any code yet. More information and the rest of the code can be found in #121925. This one includes: - A new `GPU_shader_batch` API that allows requesting the compilation of multiple shaders at once, allowing GPU backed to compile them in parallel and asynchronously without blocking the Blender UI. - A virtual `ShaderCompiler` class that backends can use to add their own implementation. - A `ShaderCompilerGeneric` class that implements synchronous/blocking compilation of batches for backends that don't have their own implementation yet. - A `GLShaderCompiler` that supports parallel compilation using subprocesses. - A new `BLI_subprocess` API, including IPC (required for the `GLShaderCompiler` implementation). - The implementation of the subprocess program in `GPU_compilation_subprocess`. - A new `Max Shader Compilation Subprocesses` option in `Preferences > System > Memory & Limits` to enable parallel shader compilation and the max number of subprocesses to allocate (each subprocess has a relatively high memory footprint). Implementation Overview: There's a single `GLShaderCompiler` shared by all OpenGL contexts. This class stores a pool of up to `GCaps.max_parallel_compilations` subprocesses that can be used for compilation. Each subprocess has a shared memory pool used for sending the shader source code from the main Blender process and for receiving the already compiled shader binary from the subprocess. This is synchronized using a series of shared semaphores. The subprocesses maintain a shader cache on disk inside a `BLENDER_SHADER_CACHE` folder at the OS temporary folder. Shaders that fail to compile are tried to be compiled again locally for proper error reports. Hanged subprocesses are currently detected using a timeout of 30s. Pull Request: https://projects.blender.org/blender/blender/pulls/122232	2024-06-05 18:45:57 +02:00
Jeroen Bakker	0c2085a316	GPU: Remove GPU_compute_shader_support Compute shaders are required since 4.0. There was one occasion where an older AMD driver failed and support was turned off. This driver is now marked unsupported. This PR includes: - removing the check in viewport compositing - remove properties from system info - always construct draw manager. - remove unused pass logic in draw hair/curves - add deprecation warning when accessed from python Pull Request: https://projects.blender.org/blender/blender/pulls/120909	2024-04-22 13:28:10 +02:00
Hans Goudey	8b514bccd1	Cleanup: Move remaining GPU headers to C++ Pull Request: https://projects.blender.org/blender/blender/pulls/119807	2024-03-23 01:24:18 +01:00
Omar Emara	eb91828aab	GPU: Add maximum image units to GPU capabilities This patch adds the maximum number of supported image units to the GPU capabilities module. Currently, the GPU module assume a maximum of 8 units, so the patch is not currently particularly useful, but we can consider committing it for the future anyways. Pull Request: https://projects.blender.org/blender/blender/pulls/119057	2024-03-05 07:25:20 +01:00
Clément Foucault	71904d7fb3	GL: Remove image load/store workaround # Conflicts: # source/blender/gpu/opengl/gl_backend.cc	2024-01-31 18:12:59 +01:00
Clément Foucault	98e465109b	EEVEE-Next: Replace lighting tiles by direct stencil setup This avoid the cost of creating the tiles themselves which uses a lot texture write. This was a bottleneck on Apple GPUs. Also the per pixel classification allows us to remove certain checks in the deferred lighting shader making it faster. ### TODO - [x] Add gl_FragStencilRefARB support on other backend - [x] Add workaround for when gl_FragStencilRefARB isnt supported Pull Request: https://projects.blender.org/blender/blender/pulls/116704	2024-01-08 07:35:05 +01:00
Campbell Barton	3b5031f1cb	Cleanup: use 'r_' prefix for output arguments, order last Also clarify some naming.	2023-11-30 10:42:18 +11:00
Miguel Pozo	3f72c62be6	GPU: Add GPU_max_storage_buffer_size Needed for #113509. Using a `size_t` since some GPUs should support >4GB buffers, but Vulkan exposes the value as a `uint` and Metal as an `int`. Pull Request: https://projects.blender.org/blender/blender/pulls/113516	2023-10-11 19:26:40 +02:00
Jeroen Bakker	6d91d36161	GPU: Remove GPU_shader_storage_buffer_objects_support Blender 4.0 requires OpenGL 4.3 which always support SSBO's. Platforms that don't support enough SSBO bind points will be marked as unsupported. Users who start Blender on those platforms will be informed via a dialog. This PR also updates the `--debug-gpu-force-workarounds` to match our minimum requirements. Note that some bugs are still there that should be solved in other PRs: * Workbench only renders the object using a unit matrix this is because there is a bug in the workaround for shader_draw_parameters * Navigating with middle mouse button is not working. Unsure what the cause is, but might be a missing feature check in the OpenGL backend. Related to #112224 Pull Request: https://projects.blender.org/blender/blender/pulls/112572	2023-09-21 12:55:51 +02:00
Miguel Pozo	f63a0e670a	Workbench-Next: Workaround lack of texture views support Extract a mask from the stencil buffer and use that texture instead when texture views are not available. Needed for supporting the Windows ARM software driver. The workaround is isolated on its own class so it's easy to remove once it's no longer needed. Note that while this adds a function to check if texture views are available (`GPU_texture_view_support`), at the moment this always returns true in practice, since OpenGL 4.3 is the minimum supported version. A separate patch will be needed to allow Blender to run using OpenGL 4.2 + extensions. Pull Request: https://projects.blender.org/blender/blender/pulls/111402	2023-08-24 15:10:33 +02:00
Campbell Barton	e955c94ed3	License Headers: Set copyright to "Blender Authors", add AUTHORS Listing the "Blender Foundation" as copyright holder implied the Blender Foundation holds copyright to files which may include work from many developers. While keeping copyright on headers makes sense for isolated libraries, Blender's own code may be refactored or moved between files in a way that makes the per file copyright holders less meaningful. Copyright references to the "Blender Foundation" have been replaced with "Blender Authors", with the exception of `./extern/` since these this contains libraries which are more isolated, any changed to license headers there can be handled on a case-by-case basis. Some directories in `./intern/` have also been excluded: - `./intern/cycles/` it's own `AUTHORS` file is planned. - `./intern/opensubdiv/`. An "AUTHORS" file has been added, using the chromium projects authors file as a template. Design task: #110784 Ref !110783.	2023-08-16 00:20:26 +10:00
Jason Fielder	2367ed2ef2	macOS: Enable support for EDR rendering Add a High Dynamic Range option in the Color Management > Display panel. This enables display of extended color ranges above 1.0 for the 3D viewport, image editor and render previews. This requires a monitor that can display HDR colors, and a view transform designed for HDR output. The Standard view transform works, but Filmic does not as it was designed to bring values into the 0..1 range for SDR displays. This patch is limited to allowing the display to visualize extended colors, but does not include future looking work to better integrate HDR into the full workflow. It is implemented by rendering to high bit-depth texture formats for the user interface, and uncapping the color range in color management. Authored by Apple: Michael Parkin-White Pull Request: https://projects.blender.org/blender/blender/pulls/105662	2023-08-09 14:25:15 +02:00
Campbell Barton	69aee8ba6b	Cleanup: remove redundant (void) for functions with no args in C++	2023-07-02 19:54:27 +10:00
Sergey Sharybin	c1bc70b711	Cleanup: Add a copyright notice to files and use SPDX format A lot of files were missing copyright field in the header and the Blender Foundation contributed to them in a sense of bug fixing and general maintenance. This change makes it explicit that those files are at least partially copyrighted by the Blender Foundation. Note that this does not make it so the Blender Foundation is the only holder of the copyright in those files, and developers who do not have a signed contract with the foundation still hold the copyright as well. Another aspect of this change is using SPDX format for the header. We already used it for the license specification, and now we state it for the copyright as well, following the FAQ: https://reuse.software/faq/	2023-05-31 16:19:06 +02:00
Sergey Sharybin	a12a8a71bb	Remove "All Rights Reserved" from Blender Foundation copyright code The goal is to solve confusion of the "All rights reserved" for licensing code under an open-source license. The phrase "All rights reserved" comes from a historical convention that required this phrase for the copyright protection to apply. This convention is no longer relevant. However, even though the phrase has no meaning in establishing the copyright it has not lost meaning in terms of licensing. This change makes it so code under the Blender Foundation copyright does not use "all rights reserved". This is also how the GPL license itself states how to apply it to the source code: <one line to give the program's name and a brief idea of what it does.> Copyright (C) <year> <name of author> This program is free software ... This change does not change copyright notice in cases when the copyright is dual (BF and an author), or just an author of the code. It also does mot change copyright which is inherited from NaN Holding BV as it needs some further investigation about what is the proper way to handle it.	2023-03-30 10:51:59 +02:00
Thomas Dinges	6b8bb26c45	EEVEE: Port existing EEVEE shaders and generated materials to use GPUShaderCreateInfo. Required by Metal backend for efficient shader compilation. EEVEE material resource binding permutations now controlled via CreateInfo and selected based on material options. Other existing CreateInfo's also modified to ensure explicitness for depth-writing mode. Other missing bindings also addressed to ensure full compliance with the Metal backend. Authored by Apple: Michael Parkin-White Ref T96261 Reviewed By: fclem Differential Revision: https://developer.blender.org/D16243	2022-12-08 21:12:19 +01:00
Jason Fielder	ac07fb38a1	Metal: Minimum per-vertex stride, 3D texture size + Transform feedback GPUCapabilities expansion. - Adding in compatibility paths to support minimum per-vertex strides for vertex formats. OpenGL supports a minimum stride of 1 byte, in Metal, this minimum stride is 4 bytes. Meaing a vertex format must be atleast 4-bytes in size. - Replacing transform feedback compile-time check to conditional look-up, given TF is supported on macOS with Metal. - 3D texture size safety check added as a general capability, rather than being in the gl backend only. Also required for Metal. Authored by Apple: Michael Parkin-White Ref T96261 Reviewed By: fclem Maniphest Tasks: T96261 Differential Revision: https://developer.blender.org/D14510	2022-09-01 22:18:02 +02:00
Clément Foucault	5a60535a20	GPUCapabilities: Add GPU_shader_draw_parameters_support This checks for the availability of `gl_BaseInstanceARB` or equivalent. Disabling for any workaround that disables shader_image_load_store_support as a preventive measure.	2022-08-31 11:35:18 +02:00
Campbell Barton	75f0aaab3d	Cleanup: remove redundant GPU headers	2022-06-22 14:59:42 +10:00
Jeroen Bakker	f41c7723c9	GPU: Remove cached full/scaled image texture. full scaled image isn't used anymore. It was added to use a different scale when displaying an image in the image editor. This was replaced by the image engine redesign. This change will reduce complexity of {T98375}.	2022-05-27 10:52:49 +02:00
Campbell Barton	3b0a08b793	Cleanup: format	2022-05-12 14:07:15 +10:00
Jeroen Bakker	e354ba701a	Fix T97330: GPU Subdiv compiler error. GLSL has different max number of ssbo per glsl stage. This patch checks if the number of compute ssbo blocks matches our requirements for the GPU Subdiv, before enabling it. Some platforms allow more ssbo bindings then blocks per stage.	2022-05-11 15:29:52 +02:00
Jeroen Bakker	439f86ac89	Fix T97272: Lag when resizing viewports. Viewports where cleared explicitly due to compatibility reasons with Intel iGPUs. This slowed down other platforms as well, this wasn't noticeable on all platforms. This patch will be more selective when to enable the workaround. Currently only for iGPUs on Mac + Linux.	2022-05-10 08:48:51 +02:00
Kévin Dietrich	45b637e1e7	GPU capabilities: detect max SSBO bindings This adds detection of the maximum number of shader storage buffer bindings that is supported on the current platform. This can be useful to turn off features that require compute shaders but use more buffer bindings than available. Differential Revision: https://developer.blender.org/D14337	2022-03-15 16:04:41 +01:00
Campbell Barton	c434782e3a	File headers: SPDX License migration Use a shorter/simpler license convention, stops the header taking so much space. Follow the SPDX license specification: https://spdx.org/licenses - C/C++/objc/objc++ - Python - Shell Scripts - CMake, GNUmakefile While most of the source tree has been included - `./extern/` was left out. - `./intern/cycles` & `./intern/atomic` are also excluded because they use different header conventions. doc/license/SPDX-license-identifiers.txt has been added to list SPDX all used identifiers. See P2788 for the script that automated these edits. Reviewed By: brecht, mont29, sergey Ref D14069	2022-02-11 09:14:36 +11:00
Campbell Barton	7c76bdca1b	Cleanup: move public doc-strings into headers for 'gpu' Ref T92709	2021-12-09 20:01:47 +11:00
Aaron Carlisle	c1279768a7	Cleanup: Clang-Tidy modernize-redundant-void-arg	2021-12-08 00:31:20 -05:00
Jeroen Bakker	6b03621c01	DrawManager: Use Compute Shader to Update Hair. This patch will use compute shaders to create the VBO for hair. The previous implementation uses transform feedback. Timings before: between 0.000069s and 0.000362s. Timings after: between 0.000032s and 0.000092s. Speedup isn't noticeable by end-users. The patch is used to test the new compute shader pipeline and integrate it with the draw manager. Allowing EEVEE, Workbench and other draw engines to use compute shaders with the introduction of `DRW_shgroup_call_compute` and `DRW_shgroup_vertex_buffer`. Future improvements are possible by generating the index buffer of hair directly on the GPU. NOTE: that compute shaders aren't supported by Apple and still use the transform feedback workaround. Reviewed By: fclem Differential Revision: https://developer.blender.org/D11057	2021-05-28 08:16:26 +02:00
Jeroen Bakker	2c607ec2f6	Revert "DrawManager: Use Compute Shader to Update Hair." This reverts commit `8f9599d17e`. Mac seems to have an error with this change. ``` ERROR: /Users/blender/git/blender-vdev/blender.git/source/blender/draw/intern/draw_hair.c:115:44: error: use of undeclared identifier 'shader_src' ERROR: /Users/blender/git/blender-vdev/blender.git/source/blender/draw/intern/draw_hair.c:123:13: error: use of undeclared identifier 'shader_src' ERROR: make[2]: * [source/blender/draw/CMakeFiles/bf_draw.dir/intern/draw_hair.c.o] Error 1 ERROR: make[1]: * [source/blender/draw/CMakeFiles/bf_draw.dir/all] Error 2 ERROR: make: *** [all] Error 2 ```	2021-05-26 20:32:05 +02:00
Jeroen Bakker	8f9599d17e	DrawManager: Use Compute Shader to Update Hair. This patch will use compute shaders to create the VBO for hair. The previous implementation uses tranform feedback. Timings master (transform feedback with GPU_USAGE_STATIC between 0.000069s and 0.000362s Timings transform feedback with GPU_USAGE_DEVICE_ONLY. between 0.000057s and 0.000122s Timings compute shader between 0.000032 and 0.000092s Future improvements: * Generate hair Index buffer using compute shaders: currently done single threaded on CPU, easy to add as compute shader. Reviewed By: fclem Differential Revision: https://developer.blender.org/D11057	2021-05-26 17:03:37 +02:00
Jeroen Bakker	87055dc71b	GPU: Compute Pipeline. With the compute pipeline calculation can be offloaded to the GPU. This patch only adds the framework for compute. So no changes for users at this moment. NOTE: As this is an OpenGL4.3 feature it must always have a fallback. Use `GPU_compute_shader_support` to check if compute pipeline can be used. Check `gpu_shader_compute*` test cases for usage. This patch also adds support for shader storage buffer objects and device only vertex/index buffers. An alternative that had been discussed was adding this to the `GPUBatch`, this was eventually not chosen as it would lead to more code when used as part of a shading group. The idea is that we add an `eDRWCommandType` in the near future. Reviewed By: fclem Differential Revision: https://developer.blender.org/D10913	2021-05-26 16:49:30 +02:00
Germano Cavalcante	3f4f109646	Python GPU: New 'capabilities' module This module exposes the capabilities defined in the GPU module in C. This will be useful for porting existing code in `bgl` to `gpu`. Reviewed By: fclem, brecht, campbellbarton Maniphest Tasks: T80730 Part of D11147	2021-05-14 12:26:40 -03:00
Jeroen Bakker	549d9f8727	Fix T81206: Do not limit gl texture size in image editor This patch will show textures in the image editor with the maximum available resolution determined by the GPU Hardware/Driver. Currently the size is limited by the user preference texture size limit. An image user can set the `IMA_SHOW_MAX_RESOLUTION` flag to request gpu textures in the max supported resolution. When this flag isn't set the gpu texture is limited by the user preference setting. When the gl resolution limit is disabled the GPU texture is always created for the max supported resolution. Reviewed By: Clément Foucault Maniphest Tasks: T81206 Differential Revision: https://developer.blender.org/D9160	2021-02-09 08:12:29 +01:00
Jeroen Bakker	a3fcbb54f4	GPU: Add HQ normals workaround. This change makes it possible for platforms to only support high quality normal rendering. This is part of {T82856} where current AMD drivers running on the polaris architecture does not support the low quality setting due to a driver bug. In a next commit the work around will be enabled.	2021-01-04 12:19:51 +01:00
Clément Foucault	136bdb561b	GPU: Add Image Load Store extension support This wraps the functionality used to speedup EEVEE volumetrics. This touches the rendering code of EEVEE as it should fix a mis-usage of the GL barrier. The barrier changed type and location, removing an unused barrier.	2020-09-12 15:29:54 +02:00
Clément Foucault	48690d967a	GPUContext: Move GPUContext to gpu::Context for more consistency This makes the GPUContext follow the same naming convention as the rest of the module. Also add a static getter for extra bonus style (no need for casts): - Context::get() - GLContext::get()	2020-09-08 04:15:50 +02:00
Clément Foucault	58353834f4	GPUCapabilities: Isolate GL memory statistics This is part of the Vulkan task T68990 This is a simple cleanup.	2020-09-07 20:17:45 +02:00
Clément Foucault	c5c6b5ddb3	GPUCapabilities: Isolate GL_STEREO to GLContext This is part of the Vulkan task T68990 This is a simple cleanup.	2020-09-07 20:17:45 +02:00
Clément Foucault	6b436b80a4	GPU: Rename gpu_extensions to gpu_capabilities This makes more sense as this module has more to it than just GL extensions.	2020-09-07 19:37:05 +02:00

50 Commits