test2

Author	SHA1	Message	Date
Jeroen Bakker	3885a37541	Vulkan: Initial OpenXR support The Blender's VkInstance cannot be shared with OpenXR VkInstance. The reason is a chicken and egg problem where OpenXR needs to be started before Vulkan. OpenXR can add special vulkan specific requirements (instance&device) that are only available when the user starts an OpenXR session. The goal implementation is to share memory between both instances using [VK_KHR_external_memory](https://registry.khronos.org/vulkan/specs/latest/man/html/VK_KHR_external_memory.html) and related extensions. However this seems to be a bridge to far as a initial step. Reason: There are not that many samples/ guides and documentation to be found to handle the workflow that we require. We want to do a smaller step by step approach to gain the needed knowledge. For that reason this PR does the most stupidest thing that can be done to share memory between instances. Download the render result to CPU RAM share the host pointer with the OpenXR instance which copies it to the swap chain. Also the synchronization is done using wait idle commands. <video src="attachments/32a0d69b-c3fa-4272-aea0-d207609afaaf" title="Screencast From 2025-03-18 11-16-17.webm" controls></video> Gaining knowledge - Experiment with `VK_KHR_external_memory_host` extension for uploading vertex buffers (not related to OpenXR). - Import host pointer with `VK_KHR_external_memory_host`. This reduces the additional memcpy on OpenXR side. - Export host pointer from Blender side from a mappable buffer. - Replace host pointers with fd/dmabuf/winhandle - Remove mappable buffer. Ref #133718 Pull Request: https://projects.blender.org/blender/blender/pulls/133824	2025-03-27 16:57:51 +01:00
Jeroen Bakker	d5bef6cb01	Cleanup: Remove unused code	2025-03-27 14:09:15 +01:00
Campbell Barton	42ad772a1f	Cleanup: spelling & repeated terms (make check_spelling_*) Also use comment blocks for English text.	2025-03-27 01:13:34 +00:00
Jeroen Bakker	3c13d14e83	Cleanup: Remove incorrect CPP attribute Parameter was tagged to be deprecated, but in fact it is not.	2025-03-25 12:47:28 +01:00
Jeroen Bakker	409ce2b976	Vulkan: Swapchain synchronization This PR adds swapchain synchronization. When the swapchain swaps the buffers it can add a wait semaphore/signal semaphore to support GPU based synchronization 10 times playback of `rain_restaurant.blend` on AMD RX 7700 Before: 10 × Animation playback: 72347.5540 ms, average: 7234.75539684 ms After: 10 × Animation playback: 41523.2441 ms, average: 4152.32441425 ms Getting around the OpenGL performance target. Pull Request: https://projects.blender.org/blender/blender/pulls/136259	2025-03-24 10:28:52 +01:00
Jeroen Bakker	a92981e77b	Refactor: Vulkan: Move render graph submission into device_submission.cc Pull Request: https://projects.blender.org/blender/blender/pulls/136257	2025-03-20 15:55:30 +01:00
Jeroen Bakker	4429cc7e84	Fix: Vulkan: Incorrect framebuffer selection When swap chain is updated the logic could select an incorrect framebuffer. This isn't actually the case during normal usage, but has been detected during the development of OpenXR support. Here it did matter. Pull Request: https://projects.blender.org/blender/blender/pulls/136115	2025-03-18 11:49:52 +01:00
Jeroen Bakker	5a3fd4522c	Fix #135929 : Vulkan: Add support for line loops in immediate rendering Currently only implemented for immediate mode. When used it copies the first vertex to the last vertex to complete the loop. Pull Request: https://projects.blender.org/blender/blender/pulls/136083	2025-03-17 15:32:49 +01:00
Jeroen Bakker	c4feddefd7	Refactor: Vulkan: Split VKWorkarounds VKWorkarounds adds double negation. This PR splits the struct into workarounds and extensions to reduce confusing code. Pull Request: https://projects.blender.org/blender/blender/pulls/136064	2025-03-17 09:06:47 +01:00
Jeroen Bakker	7857d9e3bf	Fix: Vulkan: Std430 push constant packing When using vec3[] as push constants it selected the incorrect branch resulting in uploading incorrect data to the shader. This resulted in not seeing the clipping bounds in vulkan. Ref: #131111	2025-03-13 16:21:28 +01:00
Jeroen Bakker	330583961a	Fix: Vulkan: Incorrect background blending `GPU_BLEND_BACKGROUND` set incorrect blend mode, resulting in incorrect rendering when activating bordered rendering. Ref: #131111	2025-03-13 16:21:28 +01:00
Jeroen Bakker	15d88e544a	GPU: Storage buffer allocation alignment Since the introduction of storage buffers in Blender, the calling code has been responsible for ensuring the buffer meets allocation requirements. All backends require the allocation size to be divisible by 16 bytes. Until now, this was sufficient, but with GPU subdivision changes, an external library must also adhere to these requirements. For OpenSubdiv (OSD), some buffers are not 16-byte aligned, leading to potential misallocation. Currently, this is mitigated by allocating a few extra bytes, but this approach has the drawback of potentially reading unintended bytes beyond the source buffer. This PR adopts a similar approach to vertex buffers: the backend handles extra byte allocation while ensuring data uploads and downloads function correctly without requiring those additional bytes. No changes were needed for Metal, as its allocation size is already aligned to 256 bytes. Alternative solutions considered: - Copying the CPU buffer to a larger buffer when needed (performance impact). - Modifying OSD buffers to allocate extra space (requires changes to an external library). - Implementing GPU_storagebuf_update_sub. Ref #135873 Pull Request: https://projects.blender.org/blender/blender/pulls/135716	2025-03-13 15:05:16 +01:00
Jeroen Bakker	e1d2eee02b	Cleanup: Vulkan: Remove unused variable	2025-03-13 13:31:13 +01:00
Jeroen Bakker	1ea1f4c92c	Refactor: GHOST/Vulkan: Wrap handles in a struct Vulkan handles are currently only requested once. In the future OpenXR also needs acces to these handles and additional handles will be needed when introducing copy queues and async compute. This PR will collect the handles in a struct to ensure we don't need to alter the GHOST interface for every change. Pull Request: https://projects.blender.org/blender/blender/pulls/135905	2025-03-13 11:06:20 +01:00
Jeroen Bakker	cdc37b2235	GPU: Add support for GPU_vertbuf_update_sub `GPU_vertbuf_update_sub` is used by GPU based subdivision to integrate quads, triangles and edges. This is just an implementation to make it work as we are planning bigger changes to improve performance of uploading data to the GPU. Pull Request: https://projects.blender.org/blender/blender/pulls/135774	2025-03-11 10:14:00 +01:00
Jeroen Bakker	ba22e5e6be	Merge branch 'blender-v4.4-release'	2025-03-10 08:49:37 +01:00
Jeroen Bakker	eceb81b21f	GPU: Remove RDNA2 shader viewport workaround It has been confirmed that the latest release of AMD drivers has fixed issues for both OpenGL and Vulkan. Users should use AMD driver 25.3.1 or later. Removing the workaround as it has performance penalties on RDNA2 based GPUs. Reference: #135516 Pull Request: https://projects.blender.org/blender/blender/pulls/135630	2025-03-10 07:22:02 +01:00
Jeroen Bakker	be4f9c0ac8	Merge branch 'blender-v4.4-release'	2025-03-06 16:30:16 +01:00
Jeroen Bakker	37d781aa2a	Fix #135516 : Vulkan: Shader output viewport broken on RDNA2 When using the official RDNA2 driver +vulkan we see the same issue we as #123787. Adding the same workaround to vulkan as well. Pull Request: https://projects.blender.org/blender/blender/pulls/135565	2025-03-06 16:28:47 +01:00
Brecht Van Lommel	3dab100860	Fix: ASAN errors after addition of texture pool Same fix as #132504. Free the texture pool before the derived GPU context class, as that one is used as part of freeing the texture pool. Pull Request: https://projects.blender.org/blender/blender/pulls/135444	2025-03-04 16:54:05 +01:00
Bastien Montagne	318ae49f1e	Cleanup: Remove `void *` handling from `MEM_freen<T>`. Followup to `48e26c3afe`, and discussions in !134771 about keeping 'C-style' and 'C++ template type-safe style' implementations of our guardedalloc separated. And it makes `MEM_freeN<T>` code simpler. Also skip type-checking in `MEM_freeN<T>` only with MSVC, as clang-cl on windows-arm64 does work fine with DNA structs using `DNA_DEFINE_CXX_METHODS`. Pull Request: https://projects.blender.org/blender/blender/pulls/134861	2025-02-20 16:42:22 +01:00
Jeroen Bakker	f89a075015	Merge branch 'blender-v4.4-release'	2025-02-17 08:58:44 +01:00
Jeroen Bakker	0faba244a5	Fix: Vulkan: Async readback of storage buffers The vulkan backend was implemented with async in mind, however the one place where Blender uses for async was implemented blocking. This PR splits the readback into flushing the command and waiting for readback. Performance Improvement of animation playback performance of shader balls.blend is around 10%. Shader balls.blend frame: 1-100, 10 x animation playback \| Branch \| Total time \| Average time \| \| -------------------- \| ---------- \| ------------ \| \| blender-v4.4-release \| 26851 ms \| 2685 ms \| \| This PR \| 23675 ms \| 2367 ms \| Pull Request: https://projects.blender.org/blender/blender/pulls/134227	2025-02-17 08:58:06 +01:00
Clément Foucault	86b70143d5	Cleanup: GPU: Remove unused Transform Feedback implementation Most of the cleanup is inside the metal backend. Pull Request: https://projects.blender.org/blender/blender/pulls/134349	2025-02-10 17:30:42 +01:00
Campbell Barton	9154b5d14a	Cleanup: correct misleading variable name Don't mix up the "patch" version and the "subversion".	2025-02-06 10:12:39 +11:00
Jeroen Bakker	3d20d39115	Cleanup: Vulkan: Use `is_link_to_buffer` Previous implementation used the resource state tracker which is a hash table lookup. `is_link_to_buffer` is a bit cheaper as it is compares already loaded data.	2025-02-04 16:28:46 +01:00
Jeroen Bakker	aa535f1a5f	Cleanup: Vulkan: Remove resource locking when reordering nodes This PR changes the resource locking when reordering render graph nodes. Reordering could be done without locking resources. No measurable speedup detected. Pull Request: https://projects.blender.org/blender/blender/pulls/134032	2025-02-04 13:24:01 +01:00
Jeroen Bakker	7f04a4fef3	Fix: Vulkan: Stalling shader compilation This PR fixes an issue that shaders compilation could stall. This could be seen in the viewport (sometime not showing first EEVEE render) but was more prominent when running test cases. Pull Request: https://projects.blender.org/blender/blender/pulls/134020	2025-02-04 09:49:28 +01:00
Jeroen Bakker	27b9173081	Cleanup: Code-style Remove commented out parameter-name from header.	2025-02-03 08:06:10 +01:00
Brecht Van Lommel	c7502b092d	Cleanup: Various clang-tidy warnings in gpu Pull Request: https://projects.blender.org/blender/blender/pulls/133734	2025-01-31 17:03:18 +01:00
Jeroen Bakker	96c9153c5e	Fix: Vulkan: Use after free of VkBufferView VkBufferViews could be used after they were freed. The reason is that they were not managed by the discard pool. Detected when looking in failing render tests (pointcloud_motion.blend). This part of the API is used by motion blur in EEVEE. Fixes the next render tests - `eevee_next_motion_blur_vulkan` - `eevee_next_pointcloud_vulkan` - `eevee_next_hair_vulkan` Related: #133546 Pull Request: https://projects.blender.org/blender/blender/pulls/133856	2025-01-31 11:41:39 +01:00
Jeroen Bakker	4dbb9b34c6	Fix: Vulkan: Compositor cryptomatte When using cryptomatte the last identifier was never used due to a memory alignment issue. Scalar types should not be aligned, but they were. Pull Request: https://projects.blender.org/blender/blender/pulls/133815	2025-01-30 15:51:09 +01:00
Jeroen Bakker	4b99bc8515	Fix: Renderdoc: Corruption in debug stack In renderdoc the debug stack got corrupted when render graphs where reused. The previous usage didn't clear the stack. This PR clears the debug stack when render graphs are reset.	2025-01-30 13:46:21 +01:00
Campbell Barton	bd1ded952b	Cleanup: spelling in comments	2025-01-29 12:31:19 +11:00
Jeroen Bakker	3eaa70c251	Fix #133690 : Vulkan: Faster downloading of textures. Somehow incorrect memory is selected when not setting the host write random on a buffer that is only read. Pull Request: https://projects.blender.org/blender/blender/pulls/133721	2025-01-28 16:41:43 +01:00
Jeroen Bakker	2d3d1d249b	Cleanup: Remove compilation warnings GCC13 Pull Request: https://projects.blender.org/blender/blender/pulls/133702	2025-01-28 11:50:59 +01:00
Jeroen Bakker	e6b3cc8983	Vulkan: Device command builder This PR implements a new the threading model for building render graphs based on tests performed last month. For out workload multithreaded command building will block in the driver or device. So better to use a single thread for command building. Details of the internal working is documented at https://developer.blender.org/docs/features/gpu/vulkan/render_graph/ - When a context is activated on a thread the context asks for a render graph it can use by calling `VKDevice::render_graph_new`. - Parts of the GPU backend that requires GPU commands will add a specific render graph node to the render graph. The nodes also contains a reference to all resources it needs including the access it needs and the image layout. - When the context is flushed the render graph is submitted to the device by calling `VKDevice::render_graph_submit`. - The device puts the render graph in `VKDevice::submission_pool`. - There is a single background thread that gets the next render graph to send to the GPU (`VKDevice::submission_runner`). - Reorder the commands of the render graph to comply with Vulkan specific command order rules and reducing possible bottlenecks. (`VKScheduler`) - Generate the required barriers `VKCommandBuilder::groups_extract_barriers`. This is a separate step to reduce resource locking giving other threads access to the resource states when they are building the render graph nodes. - GPU commands and pipeline barriers are recorded to a VkCommandBuffer. (`VKCommandBuilder::record_commands`) - When completed the command buffer can be submitted to the device queue. `vkQueueSubmit` - Render graphs that have been submitted can be reused by a next thread. This is done by pushing the render graph to the `VKDevice::unused_render_graphs` queue. Pull Request: https://projects.blender.org/blender/blender/pulls/132681	2025-01-27 08:55:23 +01:00
Jeroen Bakker	e2dddea124	Fix: Vulkan: Thread safe cache folder Vulkan shader compiler accesses the cache folder via multiple threads. GHOST part isn't thread safe and can return and overwrite the returned cache path. This resulted into crashes when performing background rendering and failing test cases, loading of incorrect shaders etc. This PR fixes this to cache the cache folder location in the VKShaderCompiler, which is loaded via the main thread when the vulkan backend is initialized. Pull Request: https://projects.blender.org/blender/blender/pulls/133535	2025-01-24 12:20:43 +01:00
Jeroen Bakker	2feb435780	Fix: Vulkan: Memory allocation on no-rebar capable platforms Memory areas was requested to be preferable host visible. On some platforms this would fail to allocate. Best is to not add preferable host visible for typically large allocations. This PR also gives the caller the responsibility to set the allocation flags. Pull Request: https://projects.blender.org/blender/blender/pulls/133528	2025-01-24 11:54:59 +01:00
Miguel Pozo	8d392d41c2	Fix: GPU: GPU_indexbuf_bind_as_ssbo Make the behavior consistent across all backends. Clean up the GL function. Follow-up from #132712. Pull Request: https://projects.blender.org/blender/blender/pulls/133383	2025-01-23 15:40:45 +01:00
Jeroen Bakker	2bd4e101a0	Fix #130106 : Vulkan: Pixelbuffer performance Cycles uses pixel buffers to update the display. Due to making things work the vulkan backend downloaded the GPU allocated pixel buffer to the CPU, Copied it to a GPU allocated staging buffer and update the display texture using the staging buffer. Needless to say that a (CPU->)GPU->CPU->GPU roundtrip is a bottleneck. This PR fixes this by allowing the pixel buffer to act as a staging buffer as well. Viewport and final image rendering performance is now also similar. \| Render \| GPU Backend \| Path tracing \| Display \| \| ---------- \| --------------- \| ---------------- \| ----------- \| \| Viewport \| OpenGL \| 2.7 \| 0.06 \| \| Viewport \| Vulkan \| 2.7 \| 0.04 \| \| Image \| OpenGL \| 3.9 \| 0.02 \| \| Image \| Vulkan \| 3.9 \| 0.02 \| Tested on: ``` Operating system: Linux-6.8.0-49-generic-x86_64-with-glibc2.39 64 Bits, X11 UI Graphics card: AMD Radeon Pro W7700 (RADV NAVI32) Advanced Micro Devices radv Mesa 24.3.1 - kisak-mesa PPA Vulkan Backend ``` Pull Request: https://projects.blender.org/blender/blender/pulls/133485	2025-01-23 14:58:49 +01:00
Jeroen Bakker	ff804882bd	Refactor: Vulkan: Store large data in separate vectors VKRenderGraphNode is 892 bytes and most of the bytes are used for specific nodes. By storing large structs in separate vectors we can reduce the needed memory and improve cache pre-fetching. With this change the VKRenderGraphNode is reduced to 64 bytes. On a (50 frames shader_balls.blend) the end user performance is improved by 2%. \| Platform \| Before \| After \| \| ---------------- \| ---------- \| --------- \| \| AMD W7700 \| 1409 ms \| 1383 ms \| \| NVIDIA RTX 6000 \| 1443 ms \| 1428 ms \| Pull Request: https://projects.blender.org/blender/blender/pulls/133317	2025-01-21 14:49:23 +01:00
Jeroen Bakker	781aeb1b3f	Fix #133155 : Vulkan: Initial uniform buffer upload failing In some cases the initial uniform buffer upload fails. Using the indirect upload (via render graph) does succeed. This PR uses the render graph approach. Pull Request: https://projects.blender.org/blender/blender/pulls/133293	2025-01-20 11:00:27 +01:00
Campbell Barton	90b03d2344	Cleanup: spelling in comments	2025-01-20 11:19:23 +11:00
Jeroen Bakker	390ca01685	Cleanup: Vulkan: Remove resource ownership Images used to be tracked with ownership in order to reset swap chain images to its original layout. This isn't used anymore as we always mark them in VK_IMAGE_LAYOUT_UNDEFINED to make the first pipeline barrier a nop. This change reduces unneeded complexity and safe a few CPU cycles. Pull Request: https://projects.blender.org/blender/blender/pulls/133197	2025-01-17 14:46:22 +01:00
Jeroen Bakker	2f18e4fe29	Vulkan: Add debug group for swapchain Improves debugging swapchains when using renderdoc. Pull Request: https://projects.blender.org/blender/blender/pulls/133190	2025-01-17 11:40:11 +01:00
Jeroen Bakker	56f14a0083	Vulkan: Add support for GPU_DATA_UBYTE to F16 data conversion This data conversion is needed to download a HDR framebuffer for color-picking and saving screenshots. Pull Request: https://projects.blender.org/blender/blender/pulls/133187	2025-01-17 10:58:28 +01:00
Jeroen Bakker	80ec04b4ef	Cleanup: Vulkan: Use full surface format GHOST_ContextVK used to pass only the surface texture format to the GPU backend, it didn't pass the color space. This PR also includes the color space. Pull Request: https://projects.blender.org/blender/blender/pulls/133185	2025-01-17 10:28:22 +01:00
Jeff Moguillansky	ea4d01923b	Fix: Vulkan: local_read incorrect attachment load/store ops This fixes a rendering issue when local read enabled. Before fix, the output image is too bright. This is due to incorrect load/store. With this fix, the logic for attachment load/store ops with local_read on matches the logic with local_read off inside subpass_transition_impl... Pull Request: https://projects.blender.org/blender/blender/pulls/133111	2025-01-16 08:21:14 +01:00
Campbell Barton	a29fe64e53	Cleanup: quiet compiler warning	2025-01-15 17:29:31 +11:00

1 2 3 4 5 ...

605 Commits