test2

Author	SHA1	Message	Date
Miguel Pozo	f8c964de87	GPU: Improve the shader cache GC behavior Improves the shader cache GC behavior, so the passes are only deleted after being orphaned for over 1 minute. Pull Request: https://projects.blender.org/blender/blender/pulls/106386	2023-04-10 11:39:00 +02:00
Omar Emara	ff3b2226fb	GPU: Refactor texture samplers This patch refactors the texture samples code by mainly splitting the eGPUSamplerState enum into multiple smaller enums and packing them inside a GPUSamplerState struct. This was done because many members of the enum were mutually exclusive, which was worked around during setting up the samplers in the various backends, and additionally made the API confusing, like the GPU_texture_wrap_mode function, which had two mutually exclusive parameters. The new structure also improved and clarified the backend sampler cache, reducing the cache size from 514 samplers to just 130 samplers, which also slightly improved the initialization time. Further, the GPU_SAMPLER_MAX signal value was naturally incorporated into the structure using the GPU_SAMPLER_STATE_TYPE_INTERNAL type. The only expected functional change is in the realtime compositor, which now supports per-axis repetition control, utilizing new API functions for that purpose. This patch is loosely based on an older patch D14366 by Ethan Hall. Pull Request: https://projects.blender.org/blender/blender/pulls/105642	2023-04-04 15:16:07 +02:00
Clément Foucault	7592ec35d3	GPU: Fix compilation with option WITH_GPU_BUILDTIME_SHADER_BUILDER Breakage caused by `84c93f3a06`	2023-04-01 17:16:54 +02:00
Clément Foucault	e652d0002b	GPU: FrameBuffer: Fix empty framebuffer update The framebuffer default size was only set during the first bind. This is because the `dirty_attachments_ tag` wasn't set and thus the framebuffer size was never passed down to the GL. Split to `default_size_set()` to not affect other code paths that use `size_set()`.	2023-04-01 13:24:48 +02:00
Sergey Sharybin	a12a8a71bb	Remove "All Rights Reserved" from Blender Foundation copyright code The goal is to solve confusion of the "All rights reserved" for licensing code under an open-source license. The phrase "All rights reserved" comes from a historical convention that required this phrase for the copyright protection to apply. This convention is no longer relevant. However, even though the phrase has no meaning in establishing the copyright it has not lost meaning in terms of licensing. This change makes it so code under the Blender Foundation copyright does not use "all rights reserved". This is also how the GPL license itself states how to apply it to the source code: <one line to give the program's name and a brief idea of what it does.> Copyright (C) <year> <name of author> This program is free software ... This change does not change copyright notice in cases when the copyright is dual (BF and an author), or just an author of the code. It also does mot change copyright which is inherited from NaN Holding BV as it needs some further investigation about what is the proper way to handle it.	2023-03-30 10:51:59 +02:00
Sergey Sharybin	d32d787f5f	Clang-Format: Allow empty functions to be single-line For example ``` OIIOOutputDriver::~OIIOOutputDriver() { } ``` becomes ``` OIIOOutputDriver::~OIIOOutputDriver() {} ``` Saves quite some vertical space, which is especially handy for constructors. Pull Request: https://projects.blender.org/blender/blender/pulls/105594	2023-03-29 16:50:54 +02:00
Campbell Barton	1ddbe7cadd	Cleanup: move doc-strings into headers, remove duplicates In some cases move implementation details into the function body.	2023-03-29 14:37:34 +11:00
Jeroen Bakker	08a6361b3f	Vulkan: Texture Data Conversions This PR adds basic support for texture update, read back and clearing for Vulkan. In Vulkan we need to convert each data type ourselves as vulkan buffers are untyped. Therefore this change mostly is about data conversions. Considerations: - Use a compute shader to do the conversions: - Leads to performance regression as compute pipeline can stall graphics pipeline - Lead to additional memory usage as two staging buffers are needed one to hold the CPU data, and one to hold the converted data. - Do inline conversion when sending the data to Vulkan using `eGPUDataFormat` - Additional CPU cycles required and not easy to optimize as it the implementation requires many branches. - Do inline conversion when sending the data to Vulkan (optimized for CPU) For this solution it was chosen to implement the 3rd option as it is fast and doesn't require additional memory what the other options do. Use Imath/half.h This patch uses `Imath/half.h` (dependency of OpenEXR) similar to alembic. But this makes vulkan dependent of the availability of OpenEXR. For now this isn't checked, but when we are closer to a working Vulkan backend we have to make a decision how to cope with this dependency. Missing Features Framebuffer textures This doesn't include all possible data transformations. Some of those transformation can only be tested after the VKFramebuffer has been implemented. Some texture types are only available when created for a framebuffer. These include the depth and stencil variations. Component format Is more relevant when implementing VKVertexBuffer. SRGB textures SRGB encoded textures aren't natively supported on all platforms, in all usages and might require workarounds. This should be done in a separate PR in a later stage when we are required to use SRGB textures. Test cases The added test cases gives an overview of the missing bits and pieces of the patch. When the implementation/direction is accepted more test cases can be enabled/implemented. Some of these test cases will skip depending on the actual support of platform the tests are running on. For example OpenGL/NVidia will skip the next test as it doesn't support the texture format on OpenGL, although it does support it on Vulkan. ``` [ RUN ] GPUOpenGLTest.texture_roundtrip__GPU_DATA_2_10_10_10_REV__GPU_RGB10_A2UI [ SKIPPED ] GPUOpenGLTest.texture_roundtrip__GPU_DATA_2_10_10_10_REV__GPU_RGB10_A2UI [ RUN ] GPUVulkanTest.texture_roundtrip__GPU_DATA_2_10_10_10_REV__GPU_RGB10_A2UI [ OK ] GPUVulkanTest.texture_roundtrip__GPU_DATA_2_10_10_10_REV__GPU_RGB10_A2UI ``` Pull Request: https://projects.blender.org/blender/blender/pulls/105762	2023-03-24 08:09:19 +01:00
Campbell Barton	2ba1556e69	Cleanup: spelling in comments, use doxygen syntax	2023-03-22 12:22:55 +11:00
Campbell Barton	db762d5508	Cleanup: various C/C++ code cleanups, use utility macros	2023-03-21 19:47:21 +11:00
Hans Goudey	16fbadde36	Mesh: Replace MLoop struct with generic attributes Implements #102359. Split the `MLoop` struct into two separate integer arrays called `corner_verts` and `corner_edges`, referring to the vertex each corner is attached to and the next edge around the face at each corner. These arrays can be sliced to give access to the edges or vertices in a face. Then they are often referred to as "poly_verts" or "poly_edges". The main benefits are halving the necessary memory bandwidth when only one array is used and simplifications from using regular integer indices instead of a special-purpose struct. The commit also starts a renaming from "loop" to "corner" in mesh code. Like the other mesh struct of array refactors, forward compatibility is kept by writing files with the older format. This will be done until 4.0 to ease the transition process. Looking at a small portion of the patch should give a good impression for the rest of the changes. I tried to make the changes as small as possible so it's easy to tell the correctness from the diff. Though I found Blender developers have been very inventive over the last decade when finding different ways to loop over the corners in a face. For performance, nearly every piece of code that deals with `Mesh` is slightly impacted. Any algorithm that is memory bottle-necked should see an improvement. For example, here is a comparison of interpolating a vertex float attribute to face corners (Ryzen 3700x): Before (Average: 3.7 ms, Min: 3.4 ms) ``` threading::parallel_for(loops.index_range(), 4096, [&](IndexRange range) { for (const int64_t i : range) { dst[i] = src[loops[i].v]; } }); ``` After (Average: 2.9 ms, Min: 2.6 ms) ``` array_utils::gather(src, corner_verts, dst); ``` That's an improvement of 28% to the average timings, and it's also a simplification, since an index-based routine can be used instead. For more examples using the new arrays, see the design task. Pull Request: https://projects.blender.org/blender/blender/pulls/104424	2023-03-20 15:55:13 +01:00
Campbell Barton	3b8a050fc1	Cleanup: use function-style casts, ELEM & STREQ macros	2023-03-17 16:45:42 +11:00
Jason Fielder	3d9d67594c	GPU: Add GPU frame capture support. Adds two modes of GPU frame capture support for enhanced debugging. GPU frame capture begin/end allow instantaneous frame capture of all GPU commands within the capture boundary. GPU frame capture scopes allow several user-defined capture regions which can wrap key parts of code. These scopes are exposed to connected GPU tools allowing the user to manually trigger a capture of a known scope at the desired time. This is currently integrated with the Metal backend for support with Xcode. Related to #105591 Pull Request: https://projects.blender.org/blender/blender/pulls/105717	2023-03-16 08:54:05 +01:00
Jeroen Bakker	397f6250a5	Merge branch 'blender-v3.5-release'	2023-03-16 08:28:21 +01:00
Jason Fielder	d3409f2159	Fix: Uncached Metal Materials not Being Released Optimized node graphs do not get cached and were not correctly freed once their reference count reached zero, due to being excluded from the GPUPass garbage collection. Also suppress Metal shader warnings, which are prevalent during material optimization. Authored by Apple: Michael Parkin-White Pull Request: https://projects.blender.org/blender/blender/pulls/105795	2023-03-16 08:19:32 +01:00
Miguel Pozo	1fc892744c	GPU: Fix: Use 2 slots for each UDIMs texture instead of 4 Create GPUNodeLinks for tiled_image and tiled_image_mapping together, to ensure they are the same texture. See https://projects.blender.org/blender/blender/issues/105661#issuecomment-900351 for context and a more in-depth explanation. Pull Request: https://projects.blender.org/blender/blender/pulls/105772	2023-03-15 17:58:25 +01:00
Hans Goudey	1dc57a89e9	Mesh: Move functions to C++ header Refactoring mesh code, it has become clear that local cleanups and simplifications are limited by the need to keep a C public API for mesh functions. This change makes code more obvious and makes further refactoring much easier. - Add a new `BKE_mesh.hh` header for a C++ only mesh API - Introduce a new `blender::bke::mesh` namespace, documented here: https://wiki.blender.org/wiki/Source/Objects/Mesh#Namespaces - Move some functions to the new namespace, cleaning up their arguments - Move code to `Array` and `float3` where necessary to use the new API - Define existing inline mesh data access functions to the new header - Keep some C API functions where necessary because of RNA - Move all C++ files to use the new header, which includes the old one In the future it may make sense to split up `BKE_mesh.hh` more, but for now keeping the same name as the existing header keeps things simple. Pull Request: https://projects.blender.org/blender/blender/pulls/105416	2023-03-12 22:29:15 +01:00
Jeroen Bakker	af5a115f65	GPU: Refactor API for Clearing Storage Buffers The previous API for clearing storage buffers was following the OpenGL api. OpenGL has many options to support for data conversions, striding and sizzling. Metal and Vulkan don't have these features and we have to deal it ourselves. Blender internally only uses a tiny subset for what is possible in OpenGL. Making the current API to difficult to implement on our future platforms as we had to implement all cases, most even not used at all. By changing the API we make future development easier as we only need to implement what we are actually using. New API `GPU_storagebuf_clear(GPUStorageBuf* ssbo, uint32_t clear_value)` Related issue: #105492 Pull Request: https://projects.blender.org/blender/blender/pulls/105521	2023-03-09 18:46:28 +01:00
Campbell Barton	b3625e6bfd	Cleanup: comment blocks	2023-03-09 10:39:49 +11:00
Campbell Barton	5a004ccc6a	Cleanup: use function style casts, nullptr	2023-03-07 15:59:14 +11:00
Clément Foucault	4862d56a0e	GPUFrameBuffer: Document and cleanup header No fonctional changes.	2023-03-05 17:57:51 +01:00
Campbell Barton	e22eb23bc2	Cleanup: match header/source arguments & quiet cppheck warnings	2023-03-04 15:19:00 +11:00
Hans Goudey	3022a805ca	Cleanup: Standardize mesh edge and poly naming With the goal of clearly differentiating between arrays and single elements, improving consistency across Blender, and using wording that's easier to read and say, change variable names for Mesh edges and polygons/faces. Common renames are the following, with some extra prefixes, etc. - `mpoly` -> `polys` - `mpoly`/`mp`/`p` -> `poly` - `medge` -> `edges` - `med`/`ed`/`e` -> `edge` `MLoop` variables aren't affected because they will be replaced when they're split up into to arrays in #104424.	2023-03-01 15:58:01 -05:00
Miguel Pozo	59b9bb0849	Draw: Custom IDs This pull request adds a new tipe of resource handles (thin handles). These are intended for cases where a resource buffer with more than one entry for each object is needed (for example, one entry per material slot). While it's already possible to have multiple regular handles for the same object, they have a non-trivial overhead in terms of uploaded data (matrix, bounds, object info) and computation (visibility culling). Thin handles store an indirection buffer pointing to their "parent" regular handle, therefore multiple thin handles can share the same per-object data and visibility culling computation. Thin handles can only be used in their own Pass type (PassMainThin), so passes that don't need them don't have to pay the overhead. This pull request also includes the update of the Workbench Next pre-pass to use PassMainThin, which is the main reason for the implementation of this feature. The main change from the previous PR is that the thin handles are now stored directly in the main resource_id_buf, to avoid wasting an extra bind slot. Pull Request #105261	2023-03-01 21:42:25 +01:00
Campbell Barton	efb86b75ee	Cleanup: comment block formatting	2023-02-27 21:51:57 +11:00
Clément Foucault	8b543bfa3a	Merge branch 'blender-v3.5-release' # Conflicts: # source/blender/gpu/metal/mtl_immediate.mm	2023-02-26 17:44:31 +01:00
Jason Fielder	2b7707b0d0	Fix #105059 : Fix Grease pencil fill tool with Metal. GPencil 3D stroke rendering uses a geometry shader. This is unsupported by the Metal backend, so implement fix for this failing compilation by shifting geometry shader logic into the Vertex shader for Metal backend. Authored by Apple: Michael Parkin-White Ref #96261 Pull Request #105143	2023-02-26 15:22:13 +01:00
Jason Fielder	fb63e484b9	Fix #103398 : Fix Icon sampler initialization in Metal backend. Resolves issue with nearest filtering on UI Icons. Note that as Metal does not support LOD bias as a parameter on a sampler object, the original code has been modified to perform LOD biasing at the shader level. As GPU_SAMPLER_ICON is not widely used, it is more efficient to apply directly to the affected shaders, rather than workaround passing in the sampler LOD bias as a separate value e.g. uniform or push constant. Original PR feedback addressed to also refactor ICON shaders to use consistent style for single and multi Icon rendering. Authored by Apple: Michael Parkin-White Ref #96261 Pull Request #105145	2023-02-26 13:23:40 +01:00
Clément Foucault	52ded6ab56	GPUTexture: Make sure all available texture format are supported This remove default casses from the `switch` statements to catch where the missing cases are. Uncomment unimplemented cases for the sake of completeness. Improving the overall API. This make the format conversion lists exhaustive and documented. This replace `validate_data_format_mtl` by the common version as they don't differ at all now.	2023-02-25 21:47:00 +01:00
Clément Foucault	9fb1f32f06	Cleanup: GPUTexture: Remove _ex suffix from texture creation It isn't relevant anymore now that usage flags are mandatory. Pull Request #105197	2023-02-25 11:39:54 +01:00
Clément Foucault	3d6578b33e	GPUTexture: Add texture usage flag to all texture creation All usages are currently placeholder GPU_TEXTURE_USAGE_GENERAL.	2023-02-25 11:39:53 +01:00
Clément Foucault	e01b140fb2	GPUTexture: Remove data_format from 3D texture creation function For every other texture types this is expected to be implicitly `GPU_DATA_FLOAT`. There is only one case where this is not the case. I believe this was previously needed because the data type was conditionning the texture creation. This is not the case anymore.	2023-02-25 11:39:53 +01:00
Clément Foucault	75e3371aef	GPUTexture: Remove obsolete GPU_texture_bind_ex argument set_number This was previously used when the binding number wasn't always stored inside the texture itself.	2023-02-25 11:39:53 +01:00
Clément Foucault	73da5ee90d	Cleanup: GPUTexture: Rename some functions with more descriptive names List of renames: GPU_texture_generate_mipmap > GPU_texture_update_mipmap_chain GPU_texture_orig_width > GPU_texture_original_width GPU_texture_orig_height > GPU_texture_original_height GPU_texture_orig_size_set > GPU_texture_original_size_set GPU_texture_format_description > GPU_texture_format_name GPU_texture_array > GPU_texture_is_array GPU_texture_cube > GPU_texture_is_cube GPU_texture_depth > GPU_texture_has_depth_format GPU_texture_stencil > GPU_texture_has_stencil_format GPU_texture_integer > GPU_texture_has_integer_format	2023-02-25 11:39:53 +01:00
Clément Foucault	7974557f0b	GPUTexture: Add header documentation and reorganize sections	2023-02-25 11:39:53 +01:00
Bastien Montagne	4cd00b1a71	Merge branch 'blender-v3.5-release'	2023-02-23 10:44:54 +01:00
Jeroen Bakker	2c391f8877	GPU: Patch GPencil shader for metal support. The stoke shader of grease pencil uses a geometry shader stage. Apple devices don't support shaders with geometry shader stage. In the OpenGL driver there was a pass-through implemented so it didn't fail. When using the metal backend this needs to be solved more explicitly. This change patches the grease pencil shader to support both the backends supporting a geometry stage and those without. Fixes #105059 Pull Request #105116	2023-02-23 08:26:01 +01:00
Jacques Lucke	775b51a847	Merge branch 'blender-v3.5-release'	2023-02-22 17:38:59 +01:00
Jason Fielder	629fe69c4c	Fix #104918 : EEVEE: Resolve cache warming assertion. Erroneous cache warming case where the generated material is identical to default material and cached shader is re-used, resulting in case where the parent shader is identical to the source. Authored by Apple: Michael Parkin-White Ref #96261	2023-02-22 16:37:00 +01:00
Jeroen Bakker	7fb1f060ff	Vulkan: Initial Compute Shaders support This patch adds initial support for compute shaders to the vulkan backend. As the development is oriented to the test- cases we have the implementation is limited to what is used there. It has been validated that with this patch that the following test cases are running as expected - `GPUVulkanTest.gpu_shader_compute_vbo` - `GPUVulkanTest.gpu_shader_compute_ibo` - `GPUVulkanTest.gpu_shader_compute_ssbo` - `GPUVulkanTest.gpu_storage_buffer_create_update_read` - `GPUVulkanTest.gpu_shader_compute_2d` This patch includes: - Allocating VkBuffer on device. - Uploading data from CPU to VkBuffer. - Binding VkBuffer as SSBO to a compute shader. - Execute compute shader and altering VkBuffer. - Download the VkBuffer to CPU ram. - Validate that it worked. - Use device only vertex buffer as SSBO - Use device only index buffer as SSBO - Use device only image buffers GHOST API has been changed as the original design was created before we even had support for compute shaders in blender. The function `GHOST_getVulkanBackbuffer` has been separated to retrieve the command buffer without a backbuffer (`GHOST_getVulkanCommandBuffer`). In order to do correct command buffer processing we needed access to the queue owned by GHOST. This is returned as part of the `GHOST_getVulkanHandles` function. Open topics (not considered part of this patch) - Memory barriers & command buffer encoding - Indirect compute dispatching - Rest of the test cases - Data conversions when requested data format is different than on device. - GPUVulkanTest.gpu_shader_compute_1d is supported on AMD devices. NVIDIA doesn't seem to support 1d textures. Pull-request: #104518	2023-02-21 15:04:52 +01:00
Julian Eisel	c437a8aea8	Revert release branch only commit after merge This is a revert of a revert, because the initial revert is only supposed to be in the release branch. This reverts commit `3eed00dc54`.	2023-02-20 11:51:16 +01:00
Julian Eisel	3eed00dc54	Revert "GPencil: Include UV information in simplify->sample modifier." This reverts commit `19222627c6`. Something went wrong here, seems like this commit merged the main branch into the release branch, which should never be done.	2023-02-20 11:20:07 +01:00
YimingWu	19222627c6	GPencil: Include UV information in simplify->sample modifier. Simplify modifier sample mode didn't transfer UV parameters, now fixed. Pull Request #104942	2023-02-19 11:45:22 +01:00
Jason Fielder	7481a36d51	EEVEE: Remove unnecessary material optimization assertion. Fix unreported assert in basic scenes. Authored by Apple: Michael Parkin-White Pull Request #104775	2023-02-15 11:27:25 +01:00
Campbell Barton	8d35b28f2a	Cleanup: spelling in comments	2023-02-15 13:11:14 +11:00
Jason Fielder	7b9d1cb51f	Eevee: GPU Material node graph optimization. Certain material node graphs can be very expensive to run. This feature aims to produce secondary GPUPass shaders within a GPUMaterial which provide optimal runtime performance. Such optimizations include baking constant data into the shader source directly, allowing the compiler to propogate constants and perform aggressive optimization upfront. As optimizations can result in reduction of shader editor and animation interactivity, optimized pass generation and compilation is deferred until all outstanding compilations have completed. Optimization is also delayed util a material has remained unmodified for a set period of time, to reduce excessive compilation. The original variant of the material shader is kept to maintain interactivity. Also adding a new concept to gpu::Shader allowing assignment of a parent shader from which a shader can pull PSO descriptors and any required metadata for asynchronous shader cache warming. This enables fully asynchronous shader optimization, without runtime hitching, while also reducing runtime hitching for standard materials, by using PSO descriptors from default materials, ahead of rendering. Further shader graph optimizations are likely also possible with this architecture. Certain scenes, such as Wanderer benefit significantly. Viewport performance for this scene is 2-3x faster on Apple-silicon based GPUs. Authored by Apple: Michael Parkin-White Ref T96261 Pull Request #104536	2023-02-14 21:51:03 +01:00
Campbell Barton	1ac80e8338	Cleanup: quiet unreachable-code warning, use ARRAY_SIZE macro	2023-02-14 11:50:00 +11:00
Clément Foucault	0d9fbfe7fe	GPUShader: Fix compilation caused by designated initializers in C++	2023-02-13 12:49:22 +01:00
Clément Foucault	dd171f7743	Cleanup: GPUShader: Rename `GPU_shader_uniform_vector` Rename to `GPU_shader_uniform_float/int_ex` to make more sense as a general purpose function.	2023-02-13 11:22:38 +01:00
Clément Foucault	b68bac7ced	Cleanup: GPUShader: Remove `GPU_shader_uniform_int/float` Simplify the API, leaving only one function to set uniform without the uniform name.	2023-02-13 11:22:38 +01:00

1 2 3 4 5 ...

2998 Commits