test2

Author	SHA1	Message	Date
Miguel Pozo	a249e93ad1	GPU: Add missing virtual destructor to ShaderCompiler	2024-06-06 15:50:09 +02:00
Miguel Pozo	74224b25a5	GPU: Add GPU_shader_batch_create_from_infos This is the first commit of the several required to support subprocess-based parallel compilation on OpenGL. This provides the base API and implementation, and exposes the max subprocesses setting on the UI, but it's not used by any code yet. More information and the rest of the code can be found in #121925. This one includes: - A new `GPU_shader_batch` API that allows requesting the compilation of multiple shaders at once, allowing GPU backed to compile them in parallel and asynchronously without blocking the Blender UI. - A virtual `ShaderCompiler` class that backends can use to add their own implementation. - A `ShaderCompilerGeneric` class that implements synchronous/blocking compilation of batches for backends that don't have their own implementation yet. - A `GLShaderCompiler` that supports parallel compilation using subprocesses. - A new `BLI_subprocess` API, including IPC (required for the `GLShaderCompiler` implementation). - The implementation of the subprocess program in `GPU_compilation_subprocess`. - A new `Max Shader Compilation Subprocesses` option in `Preferences > System > Memory & Limits` to enable parallel shader compilation and the max number of subprocesses to allocate (each subprocess has a relatively high memory footprint). Implementation Overview: There's a single `GLShaderCompiler` shared by all OpenGL contexts. This class stores a pool of up to `GCaps.max_parallel_compilations` subprocesses that can be used for compilation. Each subprocess has a shared memory pool used for sending the shader source code from the main Blender process and for receiving the already compiled shader binary from the subprocess. This is synchronized using a series of shared semaphores. The subprocesses maintain a shader cache on disk inside a `BLENDER_SHADER_CACHE` folder at the OS temporary folder. Shaders that fail to compile are tried to be compiled again locally for proper error reports. Hanged subprocesses are currently detected using a timeout of 30s. Pull Request: https://projects.blender.org/blender/blender/pulls/122232	2024-06-05 18:45:57 +02:00
Hans Goudey	da1ea4cdd1	Revert "Draw: Avoid temporary copy for mesh triangulation index buffer" This reverts commit `108ab1df2d`. This causes issues when duplicating objects that I don't have time to investigate right now.	2024-05-23 23:43:34 -04:00
Hans Goudey	108ab1df2d	Draw: Avoid temporary copy for mesh triangulation index buffer The mesh triangulation data is stored in CPU memory with the same format as the triangles GPU index buffer. Because of that we can skip creating a temporary copied owned by the GPU API. One way to do that is to just upload the data directly and avoid keeping a reference to it. However, we can only upload GPU data from the main thread with OpenGL, so instead reference the data and keep track of whether to free it. When drawing a mesh with a single material and 1.8 million faces, this change gives a 12-15% improvement in framerate, from about 32 to 37 FPS. Part of #116901. Pull Request: https://projects.blender.org/blender/blender/pulls/122175	2024-05-23 19:59:36 +02:00
Jeroen Bakker	d2d1311023	OpenGL: Disable multi texture binding on Windows with Intel GPUs Multiple generations of Intel GPU have the same issue where multi texture binding results in invalid operations where the driver reports that the internal texture format isn't supported. Previously this was only enabled for UHD devices, but this PR enables it for any Intel GPU. It was detected to be faulty on UHD600 and Iris. Pull Request: https://projects.blender.org/blender/blender/pulls/121479	2024-05-06 14:57:25 +02:00
Jeroen Bakker	3d3dfb6518	Fix #120273 : GPU: UHD630 on Windows reports buggy extension On windows the OpenGL backend of the UHD630 driver (but could also be other GPUs that use the same driver) reports of supporting `GL_ARB_multi_bind`. But when enabling it can result in incorrect bindings and report errors about unsupported internal texture formats. These are internal driver issues. Might also fix #107642 as it shows the same error message. EEVEE-Next relies more on using the same binding slot for the same texture in order to reduce actual bindings which makes this more prominent. Pull Request: https://projects.blender.org/blender/blender/pulls/121062	2024-04-30 10:38:07 +02:00
Jacques Lucke	ed111e1907	Fix: assert when opening debug build with OpenGL backend Can't use `GPU_type_matches` there, because it requires `GPG.init` to be called beforehand. Pull Request: https://projects.blender.org/blender/blender/pulls/120928	2024-04-22 13:57:18 +02:00
Jeroen Bakker	0c2085a316	GPU: Remove GPU_compute_shader_support Compute shaders are required since 4.0. There was one occasion where an older AMD driver failed and support was turned off. This driver is now marked unsupported. This PR includes: - removing the check in viewport compositing - remove properties from system info - always construct draw manager. - remove unused pass logic in draw hair/curves - add deprecation warning when accessed from python Pull Request: https://projects.blender.org/blender/blender/pulls/120909	2024-04-22 13:28:10 +02:00
Jeroen Bakker	463a4c6211	Cleanup: Move specialization constant default hash The specialization constant default hash was implemented in gl_shader.hh But the same implementation is needed for vulkan. This PR moves the default hash to a common place where both backends can use it. Pull Request: https://projects.blender.org/blender/blender/pulls/120889	2024-04-21 16:56:00 +02:00
Clément Foucault	23348d4a5c	GL: VertBuf/IndexBuf: Add missing SSBO bind tracking Fix false positive errors with `--debug-gpu`.	2024-04-18 12:33:42 +02:00
Clément Foucault	d31b459927	GL: UniformBuf: Fix wrong positive missing resource error This happened when the buffer was bound as a SSBO as the slot was not marked as used. We do not tag the unbind as we never call `unbind` for SSBOs. Also we don't track what target type the UBO is bound to. This could be improved later.	2024-04-18 12:21:16 +02:00
Clément Foucault	f2ae04db10	GPU: Implement missing UBO/SSBO bind tracking This PR adds a context function to consider all buffer bindings obsolete. This is in order to track missing binds and invalid lingering states accross `draw::Pass`es. The functions `GPU_storagebuf_debug_unbind_all` and `GPU_uniformbuf_debug_unbind_all` do nothing more than resetting the internal debug slot bits to zero. This is what OpenGL backend does as it doesn't track the bindings themselves. Other backends might have other way to detect missing bindings. If not they should be implemented separately anyway. I renamed the function to `debug_unbind_all` to denote that it actually does something related to debugging. This also add SSBO binding check for OpenGL as it was also missing. #### Future This error checking logic is pretty much backend agnostic. While it would be nice to move it at `gpu::Context` level, we don't have the resources for that now. Pull Request: https://projects.blender.org/blender/blender/pulls/120716	2024-04-17 11:06:39 +02:00
Jason Fielder	be32bc5b72	Metal: Add AMD support for subpass transition Adds support for subpass transition for AMD/Intel IMR GPUs. This enables correct functioning of EEVEE Next deferred lighting pass on AMD platforms. The emulation is consistent with the OpenGL approach of generating additional texture bindings in the shader for subpass inputs, and splitting render passes across sub-pass boundaries. Authored by Apple: Michael Parkin-White Pull Request: https://projects.blender.org/blender/blender/pulls/119784	2024-04-11 15:23:53 +02:00
Campbell Barton	7e9f7320e4	Cleanup: spelling in comments & comment blocks	2024-04-04 11:26:28 +11:00
Campbell Barton	d5d1025e94	Cleanup: use const pointer arguments	2024-04-03 10:22:05 +11:00
Campbell Barton	362d381a5a	Cleanup: pass GPUStateMutable as a const reference	2024-03-28 18:10:49 +11:00
Hans Goudey	893130e6fe	Refactor: Remove unnecessary C wrapper for GPUBatch class Similar to `fe76d8c946` Pull Request: https://projects.blender.org/blender/blender/pulls/119898	2024-03-26 03:06:25 +01:00
Hans Goudey	fe76d8c946	Refactor: Remove unnecessary C wrappers for vertex and index buffers Now that all relevant code is C++, the indirection from the C struct `GPUVertBuf` to the C++ `blender::gpu::VertBuf` class just adds complexity and necessitates a wrapper API, making more cleanups like use of RAII or other C++ types more difficult. This commit replaces the C wrapper structs with direct use of the vertex and index buffer base classes. In C++ we can choose which parts of a class are private, so we don't risk exposing too many implementation details here. Pull Request: https://projects.blender.org/blender/blender/pulls/119825	2024-03-24 16:38:30 +01:00
Hans Goudey	8b514bccd1	Cleanup: Move remaining GPU headers to C++ Pull Request: https://projects.blender.org/blender/blender/pulls/119807	2024-03-23 01:24:18 +01:00
Campbell Barton	57dd9c21d3	Cleanup: spelling in comments	2024-03-21 10:02:53 +11:00
Anthony Roberts	445fd42c61	Windows: Add ARM64 support * Only works on machines with a Qualcomm Snapdragon 8cx Gen3 or above. Older generation devices are not and will not be supported due to some driver issues * Requires VS2022 for building. * Uses new MSVC preprocessor for sse2neon compatibility. * SIMD is not enabled, waiting on conversion of blenlib to C++. Ref #119126 Pull Request: https://projects.blender.org/blender/blender/pulls/117036	2024-03-06 16:14:34 +01:00
Omar Emara	eb91828aab	GPU: Add maximum image units to GPU capabilities This patch adds the maximum number of supported image units to the GPU capabilities module. Currently, the GPU module assume a maximum of 8 units, so the patch is not currently particularly useful, but we can consider committing it for the future anyways. Pull Request: https://projects.blender.org/blender/blender/pulls/119057	2024-03-05 07:25:20 +01:00
Miguel Pozo	c713fbc2d3	GPU: Allow printing full shader source on compilation error Add a define (DEBUG_LOG_SHADER_SRC_ON_ERROR ) in gpu_shader_private.h to print the full source code of shaders that fail to compile. Pull Request: https://projects.blender.org/blender/blender/pulls/116470	2024-02-26 17:30:15 +01:00
Jeroen Bakker	5698fb2049	RenderDoc: Set Capture Title Adds an option to set the capture title when using renderdoc `GPU_debug_capture_begin` has an optional `title` parameter to set the title of the renderdoc capture. Pull Request: https://projects.blender.org/blender/blender/pulls/118649	2024-02-23 10:57:37 +01:00
Hans Goudey	61e61ce0e1	Cleanup: Use Span instead of Vector const reference Span is preferrable since it's agnostic of the source container, makes it clearer that there is no ownership, is 8 bytes smaller, and can be passed by value.	2024-02-14 17:23:01 -05:00
Bastien Montagne	54618dbae3	Cleanup: Make `BKE_global.h` a Cpp header.	2024-02-10 18:25:14 +01:00
Miguel Pozo	98231ea880	GPU: Optimize GLStorageBuf::read performance Add a separate persistent mapped buffer where the main SSBO can be copied, so its contents can be read from the CPU without stalling the GPU. Pull Request: https://projects.blender.org/blender/blender/pulls/117521	2024-02-09 16:11:33 +01:00
Campbell Barton	8b827a5bb5	Cleanup: spelling in comments	2024-02-02 10:48:22 +11:00
Miguel Pozo	5d132ac0c6	GPU: Optimize OpenGL indirect drawing overhead `GLBatch::draw_indirect` has additional overhead compared to `GLBatch::draw`, and can become a bottleneck in scenes that require many draw calls (ie. with too many unique meshes). The performance difference is almost exclusively caused by the `GL_COMMAND_BARRIER_BIT` barrier that happens on every call. This PR adds a `GPU_storagebuf_sync_as_indirect_buffer` function that can be used to place the barrier only once after filling the indirect buffer content. This function is a no-op in Vulkan and Metal since they don't need the barrier. Pull Request: https://projects.blender.org/blender/blender/pulls/117561	2024-02-01 17:26:08 +01:00
Clément Foucault	c0c3565714	GL: Remove geometry shader invocations workaround # Conflicts: # source/blender/gpu/opengl/gl_backend.cc # Conflicts: # source/blender/gpu/opengl/gl_shader.cc Pull Request: https://projects.blender.org/blender/blender/pulls/116600	2024-01-31 18:13:02 +01:00
Clément Foucault	1ddc35ac88	GL: Remove texture gather workaround # Conflicts: # source/blender/gpu/opengl/gl_backend.cc # Conflicts: # source/blender/gpu/opengl/gl_shader.cc	2024-01-31 18:12:59 +01:00
Clément Foucault	749a3880de	GL: Remove cube map array workaround	2024-01-31 18:12:59 +01:00
Clément Foucault	856daa13a5	GL: Remove texture storage workaround # Conflicts: # source/blender/gpu/opengl/gl_backend.cc	2024-01-31 18:12:59 +01:00
Clément Foucault	0673ad344c	GL: Remove vertex attrib binding workaround # Conflicts: # source/blender/gpu/opengl/gl_backend.cc	2024-01-31 18:12:59 +01:00
Clément Foucault	a1c5a6b077	GL: Remove copy image workaround # Conflicts: # source/blender/gpu/opengl/gl_backend.cc	2024-01-31 18:12:59 +01:00
Clément Foucault	71904d7fb3	GL: Remove image load/store workaround # Conflicts: # source/blender/gpu/opengl/gl_backend.cc	2024-01-31 18:12:59 +01:00
Clément Foucault	64d1f065e3	GL: Remove base instance workaround	2024-01-31 18:12:59 +01:00
Clément Foucault	6722d40fd5	GL: Remove fixed restart index workaround	2024-01-31 18:12:59 +01:00
Jeroen Bakker	cd756143cf	OpenGL: Fix Shader Linking Error When a shader performs a geometry shader injectoin to work around features that are not supported natively on the GPU (viewport, barycentric coordinates, layered rendering), linking would fail. The reason was that the geometry shader was stored in a slot that was patched by the specialization constants, resulting in an empty geometry shader. An empty shader can be compiled, but doesn't match the interface with other stages, so the linking would fail. This fixes the issue that EEVEE crashed on Intel iGPUs. These GPUs don't support viewports. Pull Request: https://projects.blender.org/blender/blender/pulls/117440	2024-01-23 11:12:30 +01:00
Jeroen Bakker	18b5b0812b	Cleanup: OpenGL program creation and linking This PR improves the place when shader stages are attached to glPrograms. Previously it was done when shaders stages where created, in the function create_shader_stage. This PR will attach the shader stages inside link program. Ensuring that create_shader_stage doesn't alter the program, which isn't clear in its name. Pull Request: https://projects.blender.org/blender/blender/pulls/117407	2024-01-22 14:50:42 +01:00
Hans Goudey	21407901f8	Cleanup: Various clang tidy changes	2024-01-19 12:08:48 -05:00
Miguel Pozo	b5743dcf0a	Fix #117159 : GPU: Specialization constants binding deleted programs The lack of a move constructor in GLProgram was causing Map reallocations to delete their owned shaders/programs.	2024-01-18 17:12:24 +01:00
Miguel Pozo	333a5b513b	GPU: Assert framebuffer operations match attachment layout Ensure attachment states and load/store configs don't get out of sync with the framebuffer layout. In theory, a Framebuffer could have empty attachments interleaved with valid ones so checking just the attachments "length" is not enough. What this does instead is to ensure that valid attachments have a valid config and that null attachments either don't have a matching config or have an IGNORE/DONT_CARE one. Pull Request: https://projects.blender.org/blender/blender/pulls/117073	2024-01-15 13:25:20 +01:00
Jeroen Bakker	f4632e1da0	Fix: EEVEE: Potential Read From Unallocated Memory Generated copies of GLSL sources are kept in a std::string and it was always accessed by a long living StringRefNull which lead to potential read from unallocated memory as std::strings are not null terminated. Pull Request: https://projects.blender.org/blender/blender/pulls/117120	2024-01-15 08:27:17 +01:00
Jeroen Bakker	4ac0267567	OpenGL: Specialization Constants This PR adds support for specialization constants for the OpenGL backend. The minimum OpenGL version we are targetting doesn't have native support for specialization constants. We simulate this by keeping track of shader programs for each set of specialization constants that are being used. Specialization constants can be used to reduce shader complexity and improve performance as less registry and/or spilling is done. This requires the ability to recompile GLShaders. In order to do this we need to keep track of the sources that are used when the shader was compiled. For static sources we only store references (`GLSource::source_ref`), for dynamically generated sources we keep a copy of the source (`GLSource::source`). When recompiling the shader GLSL source-code is generated for the constants stored in `Shader::constants`. When compiling the previous GLSource that contains specialization constants is then replaced by the new version. Pull Request: https://projects.blender.org/blender/blender/pulls/116926	2024-01-12 14:28:50 +01:00
Clément Foucault	98e465109b	EEVEE-Next: Replace lighting tiles by direct stencil setup This avoid the cost of creating the tiles themselves which uses a lot texture write. This was a bottleneck on Apple GPUs. Also the per pixel classification allows us to remove certain checks in the deferred lighting shader making it faster. ### TODO - [x] Add gl_FragStencilRefARB support on other backend - [x] Add workaround for when gl_FragStencilRefARB isnt supported Pull Request: https://projects.blender.org/blender/blender/pulls/116704	2024-01-08 07:35:05 +01:00
Campbell Barton	617f7b76df	Cleanup: comment block formatting	2024-01-08 11:31:43 +11:00
Brecht Van Lommel	d377ef2543	Clang Format: bump to version 17 Along with the 4.1 libraries upgrade, we are bumping the clang-format version from 8-12 to 17. This affects quite a few files. If not already the case, you may consider pointing your IDE to the clang-format binary bundled with the Blender precompiled libraries.	2024-01-03 13:38:14 +01:00
Clément Foucault	1c96d0d861	Metal: Improve shader logging This adds some `#line` directive between the source file injection so that the log parser knowns which file the errors originated from. This is then followed by a scan over the combined source to find out the real row number. This needed some changes in the `Shader::plint_log` to skip lines to avoid outputing redundant information.	2024-01-01 00:43:09 +13:00
Clément Foucault	7d6b8737bf	Fix #116623 : GL/VK: Specialization constant error Using defines leads to syntax errors. Use global constants instead.	2023-12-30 11:09:15 +13:00

1 2 3 4 5 ...

519 Commits