griefith/test

Author	SHA1	Message	Date
Jeroen Bakker	4429cc7e84	Fix: Vulkan: Incorrect framebuffer selection When swap chain is updated the logic could select an incorrect framebuffer. This isn't actually the case during normal usage, but has been detected during the development of OpenXR support. Here it did matter. Pull Request: https://projects.blender.org/blender/blender/pulls/136115	2025-03-18 11:49:52 +01:00
Jeroen Bakker	5a3fd4522c	Fix #135929 : Vulkan: Add support for line loops in immediate rendering Currently only implemented for immediate mode. When used it copies the first vertex to the last vertex to complete the loop. Pull Request: https://projects.blender.org/blender/blender/pulls/136083	2025-03-17 15:32:49 +01:00
Jeroen Bakker	c4feddefd7	Refactor: Vulkan: Split VKWorkarounds VKWorkarounds adds double negation. This PR splits the struct into workarounds and extensions to reduce confusing code. Pull Request: https://projects.blender.org/blender/blender/pulls/136064	2025-03-17 09:06:47 +01:00
Jeroen Bakker	7857d9e3bf	Fix: Vulkan: Std430 push constant packing When using vec3[] as push constants it selected the incorrect branch resulting in uploading incorrect data to the shader. This resulted in not seeing the clipping bounds in vulkan. Ref: #131111	2025-03-13 16:21:28 +01:00
Jeroen Bakker	330583961a	Fix: Vulkan: Incorrect background blending `GPU_BLEND_BACKGROUND` set incorrect blend mode, resulting in incorrect rendering when activating bordered rendering. Ref: #131111	2025-03-13 16:21:28 +01:00
Jeroen Bakker	15d88e544a	GPU: Storage buffer allocation alignment Since the introduction of storage buffers in Blender, the calling code has been responsible for ensuring the buffer meets allocation requirements. All backends require the allocation size to be divisible by 16 bytes. Until now, this was sufficient, but with GPU subdivision changes, an external library must also adhere to these requirements. For OpenSubdiv (OSD), some buffers are not 16-byte aligned, leading to potential misallocation. Currently, this is mitigated by allocating a few extra bytes, but this approach has the drawback of potentially reading unintended bytes beyond the source buffer. This PR adopts a similar approach to vertex buffers: the backend handles extra byte allocation while ensuring data uploads and downloads function correctly without requiring those additional bytes. No changes were needed for Metal, as its allocation size is already aligned to 256 bytes. Alternative solutions considered: - Copying the CPU buffer to a larger buffer when needed (performance impact). - Modifying OSD buffers to allocate extra space (requires changes to an external library). - Implementing GPU_storagebuf_update_sub. Ref #135873 Pull Request: https://projects.blender.org/blender/blender/pulls/135716	2025-03-13 15:05:16 +01:00
Jeroen Bakker	e1d2eee02b	Cleanup: Vulkan: Remove unused variable	2025-03-13 13:31:13 +01:00
Jeroen Bakker	1ea1f4c92c	Refactor: GHOST/Vulkan: Wrap handles in a struct Vulkan handles are currently only requested once. In the future OpenXR also needs acces to these handles and additional handles will be needed when introducing copy queues and async compute. This PR will collect the handles in a struct to ensure we don't need to alter the GHOST interface for every change. Pull Request: https://projects.blender.org/blender/blender/pulls/135905	2025-03-13 11:06:20 +01:00
Jeroen Bakker	cdc37b2235	GPU: Add support for GPU_vertbuf_update_sub `GPU_vertbuf_update_sub` is used by GPU based subdivision to integrate quads, triangles and edges. This is just an implementation to make it work as we are planning bigger changes to improve performance of uploading data to the GPU. Pull Request: https://projects.blender.org/blender/blender/pulls/135774	2025-03-11 10:14:00 +01:00
Jeroen Bakker	ba22e5e6be	Merge branch 'blender-v4.4-release'	2025-03-10 08:49:37 +01:00
Jeroen Bakker	eceb81b21f	GPU: Remove RDNA2 shader viewport workaround It has been confirmed that the latest release of AMD drivers has fixed issues for both OpenGL and Vulkan. Users should use AMD driver 25.3.1 or later. Removing the workaround as it has performance penalties on RDNA2 based GPUs. Reference: #135516 Pull Request: https://projects.blender.org/blender/blender/pulls/135630	2025-03-10 07:22:02 +01:00
Jeroen Bakker	be4f9c0ac8	Merge branch 'blender-v4.4-release'	2025-03-06 16:30:16 +01:00
Jeroen Bakker	37d781aa2a	Fix #135516 : Vulkan: Shader output viewport broken on RDNA2 When using the official RDNA2 driver +vulkan we see the same issue we as #123787. Adding the same workaround to vulkan as well. Pull Request: https://projects.blender.org/blender/blender/pulls/135565	2025-03-06 16:28:47 +01:00
Brecht Van Lommel	3dab100860	Fix: ASAN errors after addition of texture pool Same fix as #132504. Free the texture pool before the derived GPU context class, as that one is used as part of freeing the texture pool. Pull Request: https://projects.blender.org/blender/blender/pulls/135444	2025-03-04 16:54:05 +01:00
Bastien Montagne	318ae49f1e	Cleanup: Remove `void *` handling from `MEM_freen<T>`. Followup to `48e26c3afe`, and discussions in !134771 about keeping 'C-style' and 'C++ template type-safe style' implementations of our guardedalloc separated. And it makes `MEM_freeN<T>` code simpler. Also skip type-checking in `MEM_freeN<T>` only with MSVC, as clang-cl on windows-arm64 does work fine with DNA structs using `DNA_DEFINE_CXX_METHODS`. Pull Request: https://projects.blender.org/blender/blender/pulls/134861	2025-02-20 16:42:22 +01:00
Jeroen Bakker	f89a075015	Merge branch 'blender-v4.4-release'	2025-02-17 08:58:44 +01:00
Jeroen Bakker	0faba244a5	Fix: Vulkan: Async readback of storage buffers The vulkan backend was implemented with async in mind, however the one place where Blender uses for async was implemented blocking. This PR splits the readback into flushing the command and waiting for readback. Performance Improvement of animation playback performance of shader balls.blend is around 10%. Shader balls.blend frame: 1-100, 10 x animation playback \| Branch \| Total time \| Average time \| \| -------------------- \| ---------- \| ------------ \| \| blender-v4.4-release \| 26851 ms \| 2685 ms \| \| This PR \| 23675 ms \| 2367 ms \| Pull Request: https://projects.blender.org/blender/blender/pulls/134227	2025-02-17 08:58:06 +01:00
Clément Foucault	86b70143d5	Cleanup: GPU: Remove unused Transform Feedback implementation Most of the cleanup is inside the metal backend. Pull Request: https://projects.blender.org/blender/blender/pulls/134349	2025-02-10 17:30:42 +01:00
Campbell Barton	9154b5d14a	Cleanup: correct misleading variable name Don't mix up the "patch" version and the "subversion".	2025-02-06 10:12:39 +11:00
Jeroen Bakker	3d20d39115	Cleanup: Vulkan: Use `is_link_to_buffer` Previous implementation used the resource state tracker which is a hash table lookup. `is_link_to_buffer` is a bit cheaper as it is compares already loaded data.	2025-02-04 16:28:46 +01:00
Jeroen Bakker	aa535f1a5f	Cleanup: Vulkan: Remove resource locking when reordering nodes This PR changes the resource locking when reordering render graph nodes. Reordering could be done without locking resources. No measurable speedup detected. Pull Request: https://projects.blender.org/blender/blender/pulls/134032	2025-02-04 13:24:01 +01:00
Jeroen Bakker	7f04a4fef3	Fix: Vulkan: Stalling shader compilation This PR fixes an issue that shaders compilation could stall. This could be seen in the viewport (sometime not showing first EEVEE render) but was more prominent when running test cases. Pull Request: https://projects.blender.org/blender/blender/pulls/134020	2025-02-04 09:49:28 +01:00
Jeroen Bakker	27b9173081	Cleanup: Code-style Remove commented out parameter-name from header.	2025-02-03 08:06:10 +01:00
Brecht Van Lommel	c7502b092d	Cleanup: Various clang-tidy warnings in gpu Pull Request: https://projects.blender.org/blender/blender/pulls/133734	2025-01-31 17:03:18 +01:00
Jeroen Bakker	96c9153c5e	Fix: Vulkan: Use after free of VkBufferView VkBufferViews could be used after they were freed. The reason is that they were not managed by the discard pool. Detected when looking in failing render tests (pointcloud_motion.blend). This part of the API is used by motion blur in EEVEE. Fixes the next render tests - `eevee_next_motion_blur_vulkan` - `eevee_next_pointcloud_vulkan` - `eevee_next_hair_vulkan` Related: #133546 Pull Request: https://projects.blender.org/blender/blender/pulls/133856	2025-01-31 11:41:39 +01:00
Jeroen Bakker	4dbb9b34c6	Fix: Vulkan: Compositor cryptomatte When using cryptomatte the last identifier was never used due to a memory alignment issue. Scalar types should not be aligned, but they were. Pull Request: https://projects.blender.org/blender/blender/pulls/133815	2025-01-30 15:51:09 +01:00
Jeroen Bakker	4b99bc8515	Fix: Renderdoc: Corruption in debug stack In renderdoc the debug stack got corrupted when render graphs where reused. The previous usage didn't clear the stack. This PR clears the debug stack when render graphs are reset.	2025-01-30 13:46:21 +01:00
Campbell Barton	bd1ded952b	Cleanup: spelling in comments	2025-01-29 12:31:19 +11:00
Jeroen Bakker	3eaa70c251	Fix #133690 : Vulkan: Faster downloading of textures. Somehow incorrect memory is selected when not setting the host write random on a buffer that is only read. Pull Request: https://projects.blender.org/blender/blender/pulls/133721	2025-01-28 16:41:43 +01:00
Jeroen Bakker	2d3d1d249b	Cleanup: Remove compilation warnings GCC13 Pull Request: https://projects.blender.org/blender/blender/pulls/133702	2025-01-28 11:50:59 +01:00
Jeroen Bakker	e6b3cc8983	Vulkan: Device command builder This PR implements a new the threading model for building render graphs based on tests performed last month. For out workload multithreaded command building will block in the driver or device. So better to use a single thread for command building. Details of the internal working is documented at https://developer.blender.org/docs/features/gpu/vulkan/render_graph/ - When a context is activated on a thread the context asks for a render graph it can use by calling `VKDevice::render_graph_new`. - Parts of the GPU backend that requires GPU commands will add a specific render graph node to the render graph. The nodes also contains a reference to all resources it needs including the access it needs and the image layout. - When the context is flushed the render graph is submitted to the device by calling `VKDevice::render_graph_submit`. - The device puts the render graph in `VKDevice::submission_pool`. - There is a single background thread that gets the next render graph to send to the GPU (`VKDevice::submission_runner`). - Reorder the commands of the render graph to comply with Vulkan specific command order rules and reducing possible bottlenecks. (`VKScheduler`) - Generate the required barriers `VKCommandBuilder::groups_extract_barriers`. This is a separate step to reduce resource locking giving other threads access to the resource states when they are building the render graph nodes. - GPU commands and pipeline barriers are recorded to a VkCommandBuffer. (`VKCommandBuilder::record_commands`) - When completed the command buffer can be submitted to the device queue. `vkQueueSubmit` - Render graphs that have been submitted can be reused by a next thread. This is done by pushing the render graph to the `VKDevice::unused_render_graphs` queue. Pull Request: https://projects.blender.org/blender/blender/pulls/132681	2025-01-27 08:55:23 +01:00
Jeroen Bakker	e2dddea124	Fix: Vulkan: Thread safe cache folder Vulkan shader compiler accesses the cache folder via multiple threads. GHOST part isn't thread safe and can return and overwrite the returned cache path. This resulted into crashes when performing background rendering and failing test cases, loading of incorrect shaders etc. This PR fixes this to cache the cache folder location in the VKShaderCompiler, which is loaded via the main thread when the vulkan backend is initialized. Pull Request: https://projects.blender.org/blender/blender/pulls/133535	2025-01-24 12:20:43 +01:00
Jeroen Bakker	2feb435780	Fix: Vulkan: Memory allocation on no-rebar capable platforms Memory areas was requested to be preferable host visible. On some platforms this would fail to allocate. Best is to not add preferable host visible for typically large allocations. This PR also gives the caller the responsibility to set the allocation flags. Pull Request: https://projects.blender.org/blender/blender/pulls/133528	2025-01-24 11:54:59 +01:00
Miguel Pozo	8d392d41c2	Fix: GPU: GPU_indexbuf_bind_as_ssbo Make the behavior consistent across all backends. Clean up the GL function. Follow-up from #132712. Pull Request: https://projects.blender.org/blender/blender/pulls/133383	2025-01-23 15:40:45 +01:00
Jeroen Bakker	2bd4e101a0	Fix #130106 : Vulkan: Pixelbuffer performance Cycles uses pixel buffers to update the display. Due to making things work the vulkan backend downloaded the GPU allocated pixel buffer to the CPU, Copied it to a GPU allocated staging buffer and update the display texture using the staging buffer. Needless to say that a (CPU->)GPU->CPU->GPU roundtrip is a bottleneck. This PR fixes this by allowing the pixel buffer to act as a staging buffer as well. Viewport and final image rendering performance is now also similar. \| Render \| GPU Backend \| Path tracing \| Display \| \| ---------- \| --------------- \| ---------------- \| ----------- \| \| Viewport \| OpenGL \| 2.7 \| 0.06 \| \| Viewport \| Vulkan \| 2.7 \| 0.04 \| \| Image \| OpenGL \| 3.9 \| 0.02 \| \| Image \| Vulkan \| 3.9 \| 0.02 \| Tested on: ``` Operating system: Linux-6.8.0-49-generic-x86_64-with-glibc2.39 64 Bits, X11 UI Graphics card: AMD Radeon Pro W7700 (RADV NAVI32) Advanced Micro Devices radv Mesa 24.3.1 - kisak-mesa PPA Vulkan Backend ``` Pull Request: https://projects.blender.org/blender/blender/pulls/133485	2025-01-23 14:58:49 +01:00
Jeroen Bakker	ff804882bd	Refactor: Vulkan: Store large data in separate vectors VKRenderGraphNode is 892 bytes and most of the bytes are used for specific nodes. By storing large structs in separate vectors we can reduce the needed memory and improve cache pre-fetching. With this change the VKRenderGraphNode is reduced to 64 bytes. On a (50 frames shader_balls.blend) the end user performance is improved by 2%. \| Platform \| Before \| After \| \| ---------------- \| ---------- \| --------- \| \| AMD W7700 \| 1409 ms \| 1383 ms \| \| NVIDIA RTX 6000 \| 1443 ms \| 1428 ms \| Pull Request: https://projects.blender.org/blender/blender/pulls/133317	2025-01-21 14:49:23 +01:00
Jeroen Bakker	781aeb1b3f	Fix #133155 : Vulkan: Initial uniform buffer upload failing In some cases the initial uniform buffer upload fails. Using the indirect upload (via render graph) does succeed. This PR uses the render graph approach. Pull Request: https://projects.blender.org/blender/blender/pulls/133293	2025-01-20 11:00:27 +01:00
Campbell Barton	90b03d2344	Cleanup: spelling in comments	2025-01-20 11:19:23 +11:00
Jeroen Bakker	390ca01685	Cleanup: Vulkan: Remove resource ownership Images used to be tracked with ownership in order to reset swap chain images to its original layout. This isn't used anymore as we always mark them in VK_IMAGE_LAYOUT_UNDEFINED to make the first pipeline barrier a nop. This change reduces unneeded complexity and safe a few CPU cycles. Pull Request: https://projects.blender.org/blender/blender/pulls/133197	2025-01-17 14:46:22 +01:00
Jeroen Bakker	2f18e4fe29	Vulkan: Add debug group for swapchain Improves debugging swapchains when using renderdoc. Pull Request: https://projects.blender.org/blender/blender/pulls/133190	2025-01-17 11:40:11 +01:00
Jeroen Bakker	56f14a0083	Vulkan: Add support for GPU_DATA_UBYTE to F16 data conversion This data conversion is needed to download a HDR framebuffer for color-picking and saving screenshots. Pull Request: https://projects.blender.org/blender/blender/pulls/133187	2025-01-17 10:58:28 +01:00
Jeroen Bakker	80ec04b4ef	Cleanup: Vulkan: Use full surface format GHOST_ContextVK used to pass only the surface texture format to the GPU backend, it didn't pass the color space. This PR also includes the color space. Pull Request: https://projects.blender.org/blender/blender/pulls/133185	2025-01-17 10:28:22 +01:00
Jeff Moguillansky	ea4d01923b	Fix: Vulkan: local_read incorrect attachment load/store ops This fixes a rendering issue when local read enabled. Before fix, the output image is too bright. This is due to incorrect load/store. With this fix, the logic for attachment load/store ops with local_read on matches the logic with local_read off inside subpass_transition_impl... Pull Request: https://projects.blender.org/blender/blender/pulls/133111	2025-01-16 08:21:14 +01:00
Campbell Barton	a29fe64e53	Cleanup: quiet compiler warning	2025-01-15 17:29:31 +11:00
Jeroen Bakker	264681344f	Vulkan: Provide more control on memory allocation This change give access to preferred and required allocation flags allowing better control over where memory is allocated. Pull Request: https://projects.blender.org/blender/blender/pulls/133059	2025-01-14 17:41:35 +01:00
Jeroen Bakker	04e64b27ea	Vulkan: Ignore swapchain image layout/content Blender always updates all pixels of the swap chain. As an optimization we can skip the initial layout transition from present to transfer destination as all pixels will be rewritten. Pull Request: https://projects.blender.org/blender/blender/pulls/133061	2025-01-14 17:08:04 +01:00
Jeroen Bakker	b898fd09b4	Fix: Vulkan: Incorrect swizzling Swizzling is supported when sampling. Outside samplers the swizzling must always be the initial swizzling. Detected when playing rain_restaurant.blend. EEVEE motion vectors use swizzling. Pull Request: https://projects.blender.org/blender/blender/pulls/133043	2025-01-14 14:09:57 +01:00
Jeroen Bakker	fbe05ac60b	Cleanup: Vulkan: Remove unused code/parameters Initial design had a more complex use case for render graphs. They are not really used and will not in the near term. This PR removes some code that doesn't do a thing Pull Request: https://projects.blender.org/blender/blender/pulls/133047	2025-01-14 14:09:50 +01:00
Jeroen Bakker	8a199dc77f	Vulkan: pipeline barriers extraction Pipeline barriers were extracted when recording commands. This works, but had the downside that it locked the device resources. Extracting pipeline barriers is fairly small task compared to recording commands. This PR will perform the extraction of pipelines separate from command recording. Code is easier to follow and when working with multiple threads this will reduce locking (enabling this will be done in separate PR). Original developed in !131965 Pull Request: https://projects.blender.org/blender/blender/pulls/132989	2025-01-14 09:50:42 +01:00
Jeroen Bakker	a4914b8972	Vulkan: Disable local read on non Qualcomm devices Only enable by default dynamic rendering local read on Qualcomm devices. NVIDIA, AMD and Intel performance is better when disabled (20%). On Qualcomm devices the improvement can be substantial (16% on shader_balls.blend). `--debug-gpu-vulkan-local-read` can be used to use dynamic rendering local read on any supported platform. Future: Check if bottleneck is during command building. If so we could fine-tune this after the device command building landed (#T132682). Pull Request: https://projects.blender.org/blender/blender/pulls/132981	2025-01-13 09:29:16 +01:00

1 2 3 4 5 ...

599 Commits