griefith/test

Author	SHA1	Message	Date
Omar Emara	a4502f82c1	Compositor: Add Image Coordinates node This patch adds a new Image Coordinates node for the compositor. The Texture Coordinates and Pixel Coordinates outputs were removed from the Image Info node and were added to the new node instead. Additionally, a new Normalized Coordinates output was added. The Pixel Coordinates output now no longer includes half pixel offsets. Pull Request: https://projects.blender.org/blender/blender/pulls/138935	2025-05-26 08:25:06 +02:00
Miguel Pozo	e6638d6e5e	Refactor: GPU: GPUMaterial & GPUPass compilation Cleanup and simplification of GPUMaterial and GPUPass compilation. See #133674 for details/goals. - Remove the `draw_manage_shader` thread. Deferred compilation is now handled by the gpu::ShaderCompiler through the batch compilation API. Batch management is handled by the `GPUPassCache`. - Simplify `GPUMaterial` status tracking so it just queries the `GPUPass` status. - Split the `GPUPass` and the `GPUCodegen` code. - Replaced the (broken) `GPU_material_recalc_flag_get` with the new `GPU_pass_compilation_timestamp`. - Add the `GPU_pass_cache_wait_for_all` and `GPU_shader_batch_wait_for_all`, and remove the busy waits from EEVEE. - Remove many unused functions, properties, includes... Pull Request: https://projects.blender.org/blender/blender/pulls/135637	2025-05-22 17:53:22 +02:00
Clément Foucault	57d9c2c098	GPU: Python: Temporarily add back non-4bytes aligned formats Add back the deprecated format to avoid asserts. GPU backends should still have the code to support them. Also add deprecation warnings as these types will be removed for 5.0. Pull Request: https://projects.blender.org/blender/blender/pulls/139213	2025-05-22 12:14:52 +02:00
Clément Foucault	8768e8553d	Fix: GPU: Wrong colorspace on context swap It can happen that the previous context drew with a different colorspace. In the case where the new context is drawing with the same shader that was previously bound (shader binding optimization), the uniform would not be set again because the dirty flag would not have been set (since the color space of this new context never changed). The shader would reuse the same colorspace as the previous context framebuffer (see #137855). Fix #137855 Pull Request: https://projects.blender.org/blender/blender/pulls/139226	2025-05-22 11:54:29 +02:00
Clément Foucault	decf99106a	GPU: Add shader and pass name inside debug groups This allows to know which shader specialization and compilation are affecting performance.	2025-05-20 14:44:23 +02:00
Clément Foucault	d3053fead0	GPU: Assert that a shader is not bound when deleting it This caused UB in the tests now that tests are all ran inside the same context. A shader could be free but its pointer would be dangling inside the `Context`. A new shader could have the same address and generate UB after binding. This is not the best way to solve this issue but at least we prevent the use of the UB. Pull Request: https://projects.blender.org/blender/blender/pulls/139109	2025-05-20 12:43:58 +02:00
Clément Foucault	de8f13d1d6	Fix: GPU: Assert caused by specialization constant with primitive expansion This path was not passing the the specialization constant state to `GPU_shader_bind`. Fix #139132	2025-05-20 11:23:42 +02:00
Clément Foucault	caac241c84	GPU: Make Shader Specialization Constant API Thread Safe This allows multiple threads to request different specializations without locking usage of all specialized shaders program when a new specialization is being compiled. The specialization constants are bundled in a structure that is being passed to the `Shader::bind()` method. The structure is owned by the calling thread and only used by the `Shader::bind()`. Only querying for the specialized shader (Map lookup) is locking the shader usage. The variant compilation is now also locking and ensured that multiple thread trying to compile the same variant will never result in race condition. Note that this removes the `is_dirty` optimization. This can be added back if this becomes a bottleneck in the future. Otherwise, the performance impact is not noticeable. Pull Request: https://projects.blender.org/blender/blender/pulls/136991	2025-05-19 17:42:55 +02:00
Clément Foucault	ca88983af2	EEVEE: Reverse-Z implementation This feature greatly increase depth buffer precision. This is very noticeable in large view distance scenes. This is enabled by default on GPUs that supports it (most of the hardware we support already supports this). This makes rendering different on the GPUs that do not support that feature (`glClipControl`). While this give much better depth precision as before, we also have a lot of imprecision caused by our vertex transformations. This can be improved in another task. Pull Request: https://projects.blender.org/blender/blender/pulls/138898	2025-05-19 16:29:26 +02:00
Campbell Barton	b3dfde88f3	Cleanup: spelling in comments (check_spelling_* target) Also uppercase acronyms: API, UTF & ASCII.	2025-05-17 10:17:37 +10:00
Campbell Barton	ec7691e35c	Cleanup: warnings with GCC mixing enums/ints & missing break An unexpected GPUVertAttr::fetch_mode could have resulted in an GPU_COMP_U16 being treated as a I32. Break & assert in this case.	2025-05-17 08:58:47 +10:00
Clément Foucault	8ac5940e33	GPU: Add GL_ARB_clip_control support to the GL backend This adds support for the extension and always set the clip state value to 0..1 to align with vulkan and metal. Moreover this is needed for the Reverse Z implementation. Note that this is a OpenGL 4.5 feature and is not required to start Blender. So there must still be a fallback path for now. Rel #138898 Pull Request: https://projects.blender.org/blender/blender/pulls/138941	2025-05-16 13:53:36 +02:00
Hans Goudey	91627b3d47	GPU: Remove int float fetch mode combination This commit finishes removing the uses of the integer to float vertex buffer fetch mode. Previous commits noted below already started that process. The last usage was geometry attributes. Now integers are converted to floats as part of the existing upload process. The change makes the Vulkan vertex buffer type conversion unused, so it's removed. That's nice because Vulkan vertex buffers go from 1040 to 568 bytes in size and have significantly less overhead on creation. Related: - `153abc372e` - `1e1ac2bb9b` - `617858e453` Pull Request: https://projects.blender.org/blender/blender/pulls/138873	2025-05-15 15:29:12 +02:00
Hans Goudey	153abc372e	GPU: Replace U32/I32 combination with GPU_FETCH_INT_TO_FLOAT in UI code The conversion from int to float is not supported natively so it ends up happening beforehand on the CPU or as a step before the vertex buffer can be used. It's better to just upload floats in the first place. Related to: - `1e1ac2bb9b` - `617858e453` Pull Request: https://projects.blender.org/blender/blender/pulls/138855	2025-05-14 15:15:21 +02:00
Clément Foucault	617858e453	GPU: Unified DataFormat enum This unifies vertex and texture data formats into a single base enum class. `TextureFormat` and `VertexFormat` then mask the invalid format for their respective usage. Having a base enum allows casting between `TextureFormat` and `VertexFormat` possible (needed for Buffer Textures). It also makes it easier to write and read data to buffers/textures as each format will have an associated host type. These enum is generated from MACRO expansion. This allow to centralize all information about the formats in one place. This avoid duplicating the list of enums for each backend. This only creates the new enum. Porting older enums will be done in other PRs. Normalized integer CPU format are missing and waiting for #130640 Rel #130632 Pull Request: https://projects.blender.org/blender/blender/pulls/138069	2025-05-13 17:08:32 +02:00
Miguel Pozo	c16aba915f	GPU: Add GPU_shader_batch_cancel Fix the recently implemented ShaderCompiler::batch_cancel. Expose it with GPU_shader_batch_cancel and GPU_shader_specialization_batch_cancel. Use them in the EEVEE ShaderModule destructor, to prevent blocking on destruction when there are in-flight compilations. Pull Request: https://projects.blender.org/blender/blender/pulls/138774	2025-05-12 19:54:03 +02:00
Sergey Sharybin	7ceb4495c5	Refactor: OpenColorIO integration Briefly about this change: - OpenColorIO C-API is removed. - The information about color spaces in ImBuf module is removed. It was stored in global ListBase in colormanagement.cc. - Both OpenColorIO and fallback implementation supports GPU drawing. - Fallback implementation supports white point, RGB curves, etc. - Removed check for support of GPU drawing in IMB. Historically it was implemented in a separate library with C-API, this is because way back C++ code needed to stay in intern. This causes all sort of overheads, and even calls that are strictly considered bad level. This change moves OpenColorIO integration into a module within imbuf, next to movie, and next to IMB_colormanagement which is the main user of it. This allows to avoid copy of color spaces, displays, views etc in the ImBuf: they were used to help quickly querying information to be shown on the interface. With this change it can be stored in the same data structures as what is used by the OpenColorIO integration. While it might not be fully avoiding duplication it is now less, and there is no need in the user code to maintain the copies. In a lot of cases this change also avoids allocations done per access to the OpenColorIO. For example, it is not needed anymore to allocate image descriptor in a heap. The bigger user-visible change is that the fallback implementation now supports GLSL drawing, with the whole list of supported features, such as curve mapping and white point. This should help simplifying code which relies on color space conversion on GPU: there is no need to figure out fallback solution in such cases. The only case when drawing will not work is when there is some actual bug, or driver issue, and shader has failed to compile. The change avoids having an opaque type for color space, and instead uses forward declaration. It is a bit verbose on declaration, but helps avoiding unsafe type-casts. There are ways to solve this in the future, like having a header for forward declaration, or to flatten the name space a bit. There should be no user-level changes under normal operation. When building without OpenColorIO or the configuration has a typo or is missing a fuller set of color management tools is applies (such as the white point correction). Pull Request: https://projects.blender.org/blender/blender/pulls/138433	2025-05-09 14:01:43 +02:00
Miguel Pozo	992e7c95a7	GPU: Converge ShaderCompiler implementations Part of #136993. Share as much of the ShaderCompiler implementations as possible. Remove the ShaderCompiler/ShaderCompilerGeneric split and make most of its functions non virtual. Move the `get_compiler` function from `Context` to `GPUBackend` and creation/deletion to `GPUBackend::init/delete_resources`. Add a `batch_cancel` function to `ShaderCompiler` (needed for the GPUPass refactor). As a nice extra, the multithreaded OpenGL compilation has become faster too. The barbershop materials + EEVEE static shaders have gone from 27s to 22s. I have not observed any performance difference on Vulkan or Metal. Pull Request: https://projects.blender.org/blender/blender/pulls/136676	2025-05-08 18:16:47 +02:00
Clément Foucault	58d32b482b	Fix #138354 : GPU: Race condition with SRGB builtin uniform Multiple threads would be setting the globals `g_shader_builtin_srgb_transform` and `g_shader_builtin_srgb_is_dirty`. These are use for color management inside the builtin shaders. But the render thread could modify these values even if its shader have no use of these. The fix is to move these globals to the `gpu::Context` class. This way we remove the race condition.	2025-05-07 16:43:42 +02:00
Jacques Lucke	b7a1325c3c	BLI: use blender::Mutex by default which wraps tbb::mutex This patch adds a new `BLI_mutex.hh` header which adds `blender::Mutex` as alias for either `tbb::mutex` or `std::mutex` depending on whether TBB is enabled. Description copied from the patch: ``` /** * blender::Mutex should be used as the default mutex in Blender. It implements a subset of the API * of std::mutex but has overall better guaranteed properties. It can be used with RAII helpers * like std::lock_guard. However, it is not compatible with e.g. std::condition_variable. So one * still has to use std::mutex for that case. * * The mutex provided by TBB has these properties: * - It's as fast as a spin-lock in the non-contended case, i.e. when no other thread is trying to * lock the mutex at the same time. * - In the contended case, it spins a couple of times but then blocks to avoid draining system * resources by spinning for a long time. * - It's only 1 byte large, compared to e.g. 40 bytes when using the std::mutex of GCC. This makes * it more feasible to have many smaller mutexes which can improve scalability of algorithms * compared to using fewer larger mutexes. Also it just reduces "memory slop" across Blender. * - It is not a fair mutex, i.e. it's not guaranteed that a thread will ever be able to lock the * mutex when there are always more than one threads that try to lock it. In the majority of * cases, using a fair mutex just causes extra overhead without any benefit. std::mutex is not * guaranteed to be fair either. / ``` The performance benchmark suggests that the impact is negilible in almost all cases. The only benchmarks that show interesting behavior are the once testing foreach zones in Geometry Nodes. These tests are explicitly testing overhead, which I still have to reduce over time. So it's not unexpected that changing the mutex has an impact there. What's interesting is that on macos the performance improves a lot while on linux it gets worse. Since that overhead should eventually be removed almost entirely, I don't really consider that blocking. Links: Documentation of different mutex flavors in TBB: https://www.intel.com/content/www/us/en/docs/onetbb/developer-guide-api-reference/2021-12/mutex-flavors.html * Older implementation of a similar mutex by me: https://archive.blender.org/developer/differential/0016/0016711/index.html * Interesting read regarding how a mutex can be this small: https://webkit.org/blog/6161/locking-in-webkit/ Pull Request: https://projects.blender.org/blender/blender/pulls/138370	2025-05-07 04:53:16 +02:00
Sergey Sharybin	a345c3db64	GPU: Add API to access render frame buffer Currently unused, but allows areas outside of DRW to render to the color render and depth texture. The primary user of this new API will be Sequencer preview to draw HDR images. Ref #138094 Pull Request: https://projects.blender.org/blender/blender/pulls/138306	2025-05-02 10:38:28 +02:00
Sergey Sharybin	180770fe3f	Draw: Scale checkerboard with pixel size Solves the problem of checkerboard in the sequencer have much smaller grid than in the 3D viewport. Ref #138094 Pull Request: https://projects.blender.org/blender/blender/pulls/138305	2025-05-02 10:36:59 +02:00
Sergey Sharybin	d568b78717	GPU: Add immediate drawing of textured rectangle New function called immRectf_with_texco(), which resides next to the other immRectf utilities. Currently used in a single place in the sequencer, but it will be used in a few other places in the future. Pull Request: https://projects.blender.org/blender/blender/pulls/138222	2025-05-01 08:40:18 +02:00
Campbell Barton	43af16a4c1	Cleanup: spelling in comments, correct comment block formatting Also use doxygen comments more consistently.	2025-05-01 11:44:33 +10:00
Hans Goudey	ec33994c62	Cleanup: Remove unused VertBuf::duplicate() function This is completely unused, not implemented for the Vulkan backend, and seems to add quite a bit of complexity to the Metal and OpenGL backends. It was added for EEVEE legacy motion blur, and the last use was removed along with EEVEE legacy. We're probably better off not maintaining it since we should avoid duplicating vertex buffer data anyway. Pull Request: https://projects.blender.org/blender/blender/pulls/138226	2025-04-30 22:35:47 +02:00
Sergey Sharybin	6ff0726ef2	Refactor: const-correctness in GPU viewport Make GPU_viewport_colorspace_set() const-crrect w.r.t view_settings. Instead of doing in-place modifications of the view_settings argument with restoring them later introduce new function for copying view settings which keeps curve mapping unchanged in the destination. Should be no functional changes. Pull Request: https://projects.blender.org/blender/blender/pulls/138189	2025-04-30 14:03:26 +02:00
Clément Foucault	d24b0e4a4e	GPU: Remove runtime parsing of GLSL source See #129009 for context. The preprocessor parses metadata and writes a header file containing an inline function that inits the `GPUSource` with the metadata. These header files are then included inside `gpu_shader_dependency.cc`. This still keep the usage of the `metadata` enums and classes to avoid pulling the whole blender module inside the preprocessor executable. This speeds-up startup time in Debug build: `gpu_shader_dependency_init` - Before : 37ms - After : 4ms I didn't measure release, but it is unlikely to be noticeable (in the order of 4ms > 1ms). Pull Request: https://projects.blender.org/blender/blender/pulls/138070	2025-04-29 15:32:36 +02:00
Jeroen Bakker	d4986be082	Fix #138088 : Anti-aliasing issues in Metal and Vulkan Metal and Vulkan don't support line smoothing. There is a workaround implemented. This workaround is only enabled when linesmooth value is larger than 1. However When using smooth lines it should also be used. This is fixed by adding a `GPU_line_smooth_get` function for getting the current line smooth state. Pull Request: https://projects.blender.org/blender/blender/pulls/138123	2025-04-29 12:32:31 +02:00
Omar Emara	ddc0f9460d	Compositor: Turn Lens Distortion options to inputs This patch turns the options of the Lens Distortion node into inputs. The Projector option was turned into an enum between two options. Reference #137223. Pull Request: https://projects.blender.org/blender/blender/pulls/138113	2025-04-29 09:54:12 +02:00
Brecht Van Lommel	b8b7f71520	Vulkan: Implement native handles for pixel buffers * Pixel buffer is always allocated with export and dedicated memory flags. * Returns an opaque file descriptor (Unix) or handle (Windows). * Native handle now includes memory size as it may be slightly bigger than the requested size. Pull Request: https://projects.blender.org/blender/blender/pulls/137363	2025-04-28 11:38:56 +02:00
Clément Foucault	59df50c326	GPU: Refactor Qualifier and ImageType This allow to use types closer to GLSL in resource declaration. These are aliased for clarity in the GPU module (i.e. `isampler2D` is shortened to `Int2D`). Rel #137446 Pull Request: https://projects.blender.org/blender/blender/pulls/137954	2025-04-24 14:38:13 +02:00
Miguel Pozo	41ae990b29	Cleanup: Remove StencilViewWorkaround and GPU_texture_view_support() Texture views should be fully supported everywhere now. Pull Request: https://projects.blender.org/blender/blender/pulls/137863	2025-04-23 20:57:00 +02:00
Brecht Van Lommel	fb2ba20b67	Refactor: Use more typed MEM_calloc<> and MEM_malloc<> Pull Request: https://projects.blender.org/blender/blender/pulls/137822	2025-04-22 11:22:18 +02:00
Brecht Van Lommel	388a21e260	Refactor: Eliminate various void pointers passed to MEM_freeN It's safer to pass a type so that it can be checked if delete should be used instead. Also changes a few void pointer casts to const_cast so that if the data becomes typed it's an error. Pull Request: https://projects.blender.org/blender/blender/pulls/137404	2025-04-21 17:59:41 +02:00
Brecht Van Lommel	637c6497e9	Refactor: Use more typed MEM_calloc<>, avoid unnecessary size_t cast Handle some cases that were missed in previous refactor. And eliminate unnecessary size_t casts as these could hide issues. Pull Request: https://projects.blender.org/blender/blender/pulls/137404	2025-04-21 17:59:41 +02:00
Campbell Barton	64f5dee6d7	Cleanup: spelling in comments (make check_spelling_*)	2025-04-17 12:06:12 +10:00
Clément Foucault	a6fa1b5969	DRW: Rename `common_*_lib.glsl` to follow module prefix	2025-04-16 20:19:09 +02:00
Clément Foucault	f2025f28e7	DRW: Move fullscreen vertex shader to gpu common This makes no sense to have in the draw namespace. Also take the opportunity for making the coordinates a float2 and rename them to something more descriptive.	2025-04-16 20:09:28 +02:00
Clément Foucault	ac7e361f3d	GPU: Expose preprocessor through API This is needed to preprocess some source inside the `intern` directory.	2025-04-16 18:50:10 +02:00
Clément Foucault	3b3a5731df	GPU: Shader: Change vector and matrix type to use blender convention This unify the C++ and GLSL codebase style. The GLSL types are still in the backend compatibility layers to support python shaders. However, the C++ shader compilation layer doesn't have them to enforce correct type usage. Note that this is going to break pretty much all PRs in flight that targets shader code. Rel #137261 Pull Request: https://projects.blender.org/blender/blender/pulls/137369	2025-04-14 13:46:41 +02:00
Clément Foucault	9990273d04	GPU: Change `Type` enum to use lower case values This is to help for future resource declaration using macros. Rel #137261 Pull Request: https://projects.blender.org/blender/blender/pulls/137367	2025-04-11 22:39:01 +02:00
Clément Foucault	890e00cecb	DRW: DebugDraw: Revamp the module and make it global - Make the module global and allow usage from anywhere. - Remove the matrix API for thread safety reason - Add lifetime management - Make display linked to the overlays for easy toggling ## Notes - Lifetime is in redraw. If there is 4 viewport redrawing, the lifetime decrement by 4 for each window redraw. This allows one viewport to be producer and another one being the consumer. - Display happens at the start of overlays. Any added visuals inside of the overlays drawing functions will be displayed the next redraw. - Redraw is not given to happen. It is only given if there is some scene / render update which tags the viewport to redraw. - Persistent lines are not reset on file load. Rel #137006 Pull Request: https://projects.blender.org/blender/blender/pulls/137106	2025-04-09 21:37:23 +02:00
Jeroen Bakker	22ae59d28d	Vulkan: Include Win32 extensions definitions Includes win32 specific extensions definitions when including `vk_common.hh`. Inside `gpu_context.cc` vulkan needs to be included before opengl, otherwise windows 10 builders will report a warning. ``` [6421/7520] Building CXX object source\blender\gpu\CMakeFiles\bf_gpu.dir\intern\gpu_context.cc.obj C:\Program Files (x86)\Windows Kits\10\include\10.0.19041.0\shared\minwindef.h(130): warning C4005: 'APIENTRY': macro redefinition C:\Users\blender\git\blender-vexp\blender.git\lib\windows_x64\epoxy\include\epoxy/gl.h(59): note: see previous definition of 'APIENTRY' ``` Pull Request: https://projects.blender.org/blender/blender/pulls/137134	2025-04-08 14:10:01 +02:00
Campbell Barton	d19362df66	Cleanup: quiet unused variable warning	2025-04-08 15:18:53 +10:00
Miguel Pozo	a5ed5dc4bf	GPU: Support deferred compilation in ShaderCompilerGeneric Update the `ShaderCompilerGeneric` to support deferred compilation using the batch compilation API, so we can get rid of `drw_manager_shader`. This approach also allows supporting non-blocking compilation for static shaders. This shouldn't cause any behavior changes at the moment, since batch compilation is not yet used when parallel compilation is disabled. This adds a `GPUWorker` and a `GPUSecondaryContext` as an easy to use wrapper for managing secondary GPU contexts. (Part of #133674) Pull Request: https://projects.blender.org/blender/blender/pulls/136518	2025-04-07 15:26:25 +02:00
Harley Acheson	68198e09f2	UI: Use Icon for Panel Header Grip On panel headers, use SVG Icon for the drag gripper. Pull Request: https://projects.blender.org/blender/blender/pulls/134471	2025-04-04 22:34:42 +02:00
Clément Foucault	3179cb0069	Cleanup: GPU: Remove unused legacy_resource_location	2025-04-04 18:21:52 +02:00
Clément Foucault	3562433ae7	pyGPU: Deprecate `Shader.program` getter This is getting in the way of making the GPUShader API more threadsafe. This getter already doesn't work for vulkan and Metal, and has very limited usage. Keeping the python function to avoid errors and display a deprecation warning. Pull Request: https://projects.blender.org/blender/blender/pulls/136983	2025-04-04 14:23:09 +02:00
Omar Emara	56b0b709ea	Compositor: Support GPU OIDN denoising This patch supports GPU OIDN denoising in the compositor. A new compositor performance option was added to allow choosing between CPU, GPU, and Auto device selection. Auto will use whatever the compositor is using for execution. The code is two folds, first, denoising code was adapted to use buffers as opposed to passing in pointers to filters directly, this is needed to support GPU devices. Second, device creation is now a bit more involved, it tries to choose the device is being used by the compositor for execution. Matching GPU devices is done by choosing the OIDN device that matches the UUID or LUID of the active GPU platform. We need both UUID and LUID because not all platforms support both. UUID is supported on all platforms except MacOS Metal, while LUID is only supported on Window and MacOS metal. If there is no active GPU device or matching is unsuccessful, we let OIDN choose the best device, which is typically the fastest. To support this case, UUID and LUID identifiers were added to the GPUPlatformGlobal and are initialized by the GPU backend if supported. OpenGL now requires GL_EXT_memory_object and GL_EXT_memory_object_win32 to support this use case, but it should function without it. Pull Request: https://projects.blender.org/blender/blender/pulls/136660	2025-04-04 11:17:08 +02:00
Clément Foucault	d767317bd0	Cleanup: GPU: Remove unused `geometry_source` in create infos	2025-04-03 16:38:21 +02:00

1 2 3 4 5 ...

3494 Commits