griefith/test

Author	SHA1	Message	Date
Campbell Barton	0ef033750f	Cleanup: pass arguments by const reference	2024-03-28 17:16:33 +11:00
Hans Goudey	8b514bccd1	Cleanup: Move remaining GPU headers to C++ Pull Request: https://projects.blender.org/blender/blender/pulls/119807	2024-03-23 01:24:18 +01:00
Miguel Pozo	5d132ac0c6	GPU: Optimize OpenGL indirect drawing overhead `GLBatch::draw_indirect` has additional overhead compared to `GLBatch::draw`, and can become a bottleneck in scenes that require many draw calls (ie. with too many unique meshes). The performance difference is almost exclusively caused by the `GL_COMMAND_BARRIER_BIT` barrier that happens on every call. This PR adds a `GPU_storagebuf_sync_as_indirect_buffer` function that can be used to place the barrier only once after filling the indirect buffer content. This function is a no-op in Vulkan and Metal since they don't need the barrier. Pull Request: https://projects.blender.org/blender/blender/pulls/117561	2024-02-01 17:26:08 +01:00
Hans Goudey	5b55c1dc10	Cleanup: Move five draw headers to C++	2024-01-05 13:26:22 -05:00
Jason Fielder	335d3a1b75	GPU: Add Shader specialization constant API Adds API to allow usage of specialization constants in shaders. Specialization constants are dynamic runtime constants which can be compiled into a shader pipeline state object (PSO) to improve runtime performance by reducing shader complexity through shader compiler constant-folding. This API allows specialization constant values to be specified along with a default value if no constant value has been declared. Each GPU backend is then responsible for caching PSO permutations against the current specialization configuration. This patch adds support for specialization constants in the Metal backend and provides a generalised high-level solution which can be adopted by other graphics APIs supporting this feature. Authored by Apple: Michael Parkin-White Authored by Blender: Clément Foucault (files in gpu/test folder) Pull Request: https://projects.blender.org/blender/blender/pulls/115193	2023-12-28 05:34:38 +01:00
Miguel Pozo	4dc1c23384	Fix #114742 : Draw: Buffers never shrink The buffers from the new Draw Manager increase their size as needed, but they never shrink. Add `StorageArrayBuffer::trim_to_next_power_of_2` function that can downsize the buffer following the same heuristic as `get_or_resize`. Add `StorageVectorBuffer::trim_and_clear`, which calls `trim_to_next_power_of_2` automatically. Pull Request: https://projects.blender.org/blender/blender/pulls/114857	2023-11-20 12:23:12 +01:00
Campbell Barton	e7e4e63313	Cleanup: spelling in comments, white-space in comments	2023-10-19 18:53:16 +11:00
Jeroen Bakker	62f721467b	Merge branch 'blender-v4.0-release'	2023-10-19 08:03:51 +02:00
Jason Fielder	62219f8da9	Metal: Re-enable workbench NEXT shadows With the shift to GPU-driven rendering pipeline, the SSBO vertex fetch paradigm used to implement workbench shadows on Metal instead of utilising the geometry shader path no longer worked correctly. This is because the draw submission required vertex amplification up-front, based on the expected output geometry amount for a given input geometry. This patch aims to resolve this issue through addition of API to enable the features within the GPU driven pipeline. Co-authored-by: Michael Parkin-White <mparkinwhite@apple.com> Pull Request: https://projects.blender.org/blender/blender/pulls/113498	2023-10-19 08:01:17 +02:00
Jeroen Bakker	26adfcdb57	Revert "Metal: Re-enable workbench NEXT shadows" This reverts commit `95f01288b0`. It fails on non-apple platforms.	2023-10-13 11:05:02 +02:00
Jason Fielder	95f01288b0	Metal: Re-enable workbench NEXT shadows With the shift to GPU-driven rendering pipeline, the SSBO vertex fetch paradigm used to implement workbench shadows on Metal instead of utilising the geometry shader path no longer worked correctly. This is because the draw submission required vertex amplification up-front, based on the expected output geometry amount for a given input geometry. This WIP patch aims to resolve this issue through addition of API to enable the features within the GPU driven pipeline. Co-authored-by: Michael Parkin-White <mparkinwhite@apple.com> Pull Request: https://projects.blender.org/blender/blender/pulls/113498	2023-10-13 11:02:06 +02:00
Clément Foucault	eda7926834	DRW: Add SubPassTransition pass command	2023-10-01 18:01:15 +02:00
Campbell Barton	e955c94ed3	License Headers: Set copyright to "Blender Authors", add AUTHORS Listing the "Blender Foundation" as copyright holder implied the Blender Foundation holds copyright to files which may include work from many developers. While keeping copyright on headers makes sense for isolated libraries, Blender's own code may be refactored or moved between files in a way that makes the per file copyright holders less meaningful. Copyright references to the "Blender Foundation" have been replaced with "Blender Authors", with the exception of `./extern/` since these this contains libraries which are more isolated, any changed to license headers there can be handled on a case-by-case basis. Some directories in `./intern/` have also been excluded: - `./intern/cycles/` it's own `AUTHORS` file is planned. - `./intern/opensubdiv/`. An "AUTHORS" file has been added, using the chromium projects authors file as a template. Design task: #110784 Ref !110783.	2023-08-16 00:20:26 +10:00
Miguel Pozo	ff470f3f2e	EEVEE Next: Volumes Port of EEVEE unified volume rendering to EEVEE Next, using compute shaders. Improvements: - Skip empty volume outside object bounds. (Large performance increase) Currently missing: - Shadows and irradiance integration. - Grid-space TAA. Main Task: #105672 Pull Request: https://projects.blender.org/blender/blender/pulls/107176	2023-08-04 16:47:16 +02:00
Campbell Barton	65f99397ec	License headers: use SPDX-FileCopyrightText in all sources	2023-06-15 13:35:34 +10:00
Sergey Sharybin	c1bc70b711	Cleanup: Add a copyright notice to files and use SPDX format A lot of files were missing copyright field in the header and the Blender Foundation contributed to them in a sense of bug fixing and general maintenance. This change makes it explicit that those files are at least partially copyrighted by the Blender Foundation. Note that this does not make it so the Blender Foundation is the only holder of the copyright in those files, and developers who do not have a signed contract with the foundation still hold the copyright as well. Another aspect of this change is using SPDX format for the header. We already used it for the license specification, and now we state it for the copyright as well, following the FAQ: https://reuse.software/faq/	2023-05-31 16:19:06 +02:00
Omar Emara	ff3b2226fb	GPU: Refactor texture samplers This patch refactors the texture samples code by mainly splitting the eGPUSamplerState enum into multiple smaller enums and packing them inside a GPUSamplerState struct. This was done because many members of the enum were mutually exclusive, which was worked around during setting up the samplers in the various backends, and additionally made the API confusing, like the GPU_texture_wrap_mode function, which had two mutually exclusive parameters. The new structure also improved and clarified the backend sampler cache, reducing the cache size from 514 samplers to just 130 samplers, which also slightly improved the initialization time. Further, the GPU_SAMPLER_MAX signal value was naturally incorporated into the structure using the GPU_SAMPLER_STATE_TYPE_INTERNAL type. The only expected functional change is in the realtime compositor, which now supports per-axis repetition control, utilizing new API functions for that purpose. This patch is loosely based on an older patch D14366 by Ethan Hall. Pull Request: https://projects.blender.org/blender/blender/pulls/105642	2023-04-04 15:16:07 +02:00
Miguel Pozo	59b9bb0849	Draw: Custom IDs This pull request adds a new tipe of resource handles (thin handles). These are intended for cases where a resource buffer with more than one entry for each object is needed (for example, one entry per material slot). While it's already possible to have multiple regular handles for the same object, they have a non-trivial overhead in terms of uploaded data (matrix, bounds, object info) and computation (visibility culling). Thin handles store an indirection buffer pointing to their "parent" regular handle, therefore multiple thin handles can share the same per-object data and visibility culling computation. Thin handles can only be used in their own Pass type (PassMainThin), so passes that don't need them don't have to pay the overhead. This pull request also includes the update of the Workbench Next pre-pass to use PassMainThin, which is the main reason for the implementation of this feature. The main change from the previous PR is that the thin handles are now stored directly in the main resource_id_buf, to avoid wasting an extra bind slot. Pull Request #105261	2023-03-01 21:42:25 +01:00
Clément Foucault	75e3371aef	GPUTexture: Remove obsolete GPU_texture_bind_ex argument set_number This was previously used when the binding number wasn't always stored inside the texture itself.	2023-02-25 11:39:53 +01:00
Clément Foucault	dd171f7743	Cleanup: GPUShader: Rename `GPU_shader_uniform_vector` Rename to `GPU_shader_uniform_float/int_ex` to make more sense as a general purpose function.	2023-02-13 11:22:38 +01:00
Clément Foucault	164f591033	Cleanup: GPU: Rename some functions for consistency	2023-02-13 11:22:38 +01:00
Clément Foucault	b0b9e746fa	BLI: Use BLI_math_matrix_type.hh instead of BLI_math_float4x4.hh Straightforward port. I took the oportunity to remove some C vector functions (ex: copy_v2_v2). This makes some changes to DRWView to accomodate the alignement requirements of the float4x4 type.	2023-02-06 21:25:45 +01:00
Ray Molenkamp	b5e00a1482	Revert "BLI: Use BLI_math_matrix_type.hh instead of BLI_math_float4x4.hh" This reverts commit `52de84b0db`. had some build issues on windows i can't quickly resolve, revert for now while we fix the problems	2023-02-02 11:46:23 -07:00
Clément Foucault	52de84b0db	BLI: Use BLI_math_matrix_type.hh instead of BLI_math_float4x4.hh Straightforward port. I took the oportunity to remove some C vector functions (ex: `copy_v2_v2`). This makes some changes to DRWView to accomodate the alignement requirements of the float4x4 type.	2023-02-02 18:11:35 +01:00
Clément Foucault	9c54f2655d	DRW: Add double buffering of objects matrices, bounds, and infos This allows easy delta calculation and access to last known position of deleted objects.	2023-01-18 15:36:46 +01:00
Clément Foucault	363e5e28ee	DRW: Fix issues with multiview Resource ids buf must be allocated for the worst case scenario. Also fix issue with non procedural data overriding procedural view.	2022-12-29 18:23:39 +01:00
Clément Foucault	7d2dbe7849	DRW: Pass: Add bind_ssbo for indexbuf This is a simple wrapper to GPU_indexbuf_bind_as_ssbo. Add simple recording test for and fix other test.	2022-12-22 14:19:58 +01:00
Hans Goudey	a5e7657cee	BLI: Remove clamping from span slicing Currently slicing a span clamped the final size so that it would be within bounds of the input. However, in the vast majority of cases that is already the case anyway, and we can use asserts to detect when that assumption fails. The clamping had a performance cost. On a test interpolating a boolean attribute from 1 million curves to 4 million points, removing the clamping saved about 10% of the time. That's an extreme case but this probably slightly improves performance in other cases too. Slicing is used a lot in the new curve code. This commit introduces `slice_safe` which still does the clamping, and uses it in the few places that needed it or where I wasn't sure.	2022-11-22 11:29:24 -06:00
Clément Foucault	d775995dc3	DRW: Manager: Add possibility to bind UBO and VBO as SSBO through commands This exposes `GPU_uniformbuf_bind_as_ssbo` and `GPU_vertbuf_bind_as_ssbo` through the `draw::Pass` API.	2022-11-15 20:16:25 +01:00
Clément Foucault	bfb6ea898b	DRW: View: Add base for multi-view support This implements the base needed for supporting multiple view concurently inside the same drawcall. The view used by common macros and view related functions is indexed using a global variable `drw_view_id` which can be set arbitrarly or read from the `drw_ResourceID`. This is needed for EEVEE-Next shadow but can be used for other purpose in the future. Note that a shader specialization is needed for it to work. `DRW_VIEW_LEN` needs to be defined to the amount of view the shader will access. The number of views contained in a `draw::View` is set at construction time. Note that the maximum number of object correctly drawn by the shaders using multiple views will be lower than thoses who don't.	2022-11-14 11:17:38 +01:00
Clément Foucault	ce9fcb15a3	DRW: Manager: Fix `ClearMulti` breaking compilation on Mac The error was: `draw_pass.hh:1055:16: error: call to implicitly-deleted default constructor of 'blender::draw::command::Undetermined [3]'	2022-11-13 18:02:17 +01:00
Clément Foucault	c255be2d02	DRW: Manager: Add `bind_texture` command for vertex buffer This allows the same behavior as with `DRW_shgroup_buffer_texture`.	2022-11-13 16:47:43 +01:00
Clément Foucault	cd64615425	DRW: Manager: Add `ClearMulti` command Allows to record `GPU_framebuffer_multi_clear` inside `draw::Pass`.	2022-11-13 16:23:22 +01:00
Clément Foucault	930d14cc62	DRW: Manager: Finish / change implementation of `framebuffer_set` command Use reference instead of direct pointer. This is because framebuffers often use temp textures and are configured later just before submission.	2022-11-13 16:16:26 +01:00
Clément Foucault	9dfc134c9d	DRW: Fix incorrect logic in state redundancy check Error introduced by rB3c39a3affee7.	2022-11-03 19:41:36 +01:00
Clément Foucault	3c39a3affe	DRW: Add support for clip plane count as part of the draw state. This moves the implementation from the View to the draw manager itself. However, this is not its final place and should be moved to the shader create info at some point in the future. For now it is not possible because of possible interaction with the old draw manager codebase.	2022-11-03 17:03:22 +01:00
Clément Foucault	77749eff87	DRW: Manager: Add possibility to record a framebuffer change inside a pass This is a convenience when one needs to often change the current framebuffer and avoid the overhead of creating many Main/Simple passes.	2022-10-30 15:00:28 +01:00
Campbell Barton	f68cfd6bb0	Cleanup: replace C-style casts with functional casts for numeric types	2022-09-25 20:17:08 +10:00
Campbell Barton	6c6a53fad3	Cleanup: spelling in comments, formatting, move comments into headers	2022-09-06 16:25:20 +10:00
Clément Foucault	65ad36f5fd	DRWManager: New implementation. This is a new implementation of the draw manager using modern rendering practices and GPU driven culling. This only ports features that are not considered deprecated or to be removed. The old DRW API is kept working along side this new one, and does not interfeer with it. However this needed some more hacking inside the draw_view_lib.glsl. At least the create info are well separated. The reviewer might start by looking at `draw_pass_test.cc` to see the API in usage. Important files are `draw_pass.hh`, `draw_command.hh`, `draw_command_shared.hh`. In a nutshell (for a developper used to old DRW API): - `DRWShadingGroups` are replaced by `Pass<T>::Sub`. - Contrary to DRWShadingGroups, all commands recorded inside a pass or sub-pass (even binds / push_constant / uniforms) will be executed in order. - All memory is managed per object (except for Sub-Pass which are managed by their parent pass) and not from draw manager pools. So passes "can" potentially be recorded once and submitted multiple time (but this is not really encouraged for now). The only implicit link is between resource lifetime and `ResourceHandles` - Sub passes can be any level deep. - IMPORTANT: All state propagate from sub pass to subpass. There is no state stack concept anymore. Ensure the correct render state is set before drawing anything using `Pass::state_set()`. - The drawcalls now needs a `ResourceHandle` instead of an `Object *`. This is to remove any implicit dependency between `Pass` and `Manager`. This was a huge problem in old implementation since the manager did not know what to pull from the object. Now it is explicitly requested by the engine. - The pases need to be submitted to a `draw::Manager` instance which can be retrieved using `DRW_manager_get()` (for now). Internally: - All object data are stored in contiguous storage buffers. Removing a lot of complexity in the pass submission. - Draw calls are sorted and visibility tested on GPU. Making more modern culling and better instancing usage possible in the future. - Unit Tests have been added for regression testing and avoid most API breakage. - `draw::View` now contains culling data for all objects in the scene allowing caching for multiple views. - Bounding box and sphere final setup is moved to GPU. - Some global resources locations have been hardcoded to reduce complexity. What is missing: - ~~Workaround for lack of gl_BaseInstanceARB.~~ Done - ~~Object Uniform Attributes.~~ Done (Not in this patch) - Workaround for hardware supporting a maximum of 8 SSBO. Reviewed By: jbakker Differential Revision: https://developer.blender.org/D15817	2022-09-02 18:45:14 +02:00

40 Commits