Quite obscure issue, seemingly caused by the fact that HIP-RT is passing a large (about 20 MB) global array to a different library (HIP driver, via hipModuleLoadData). Having global variables of such size seems to be always problematic as they can not be stored on stack and, possibly, extra mapping is involved here. It is not clear whether it is a quirk of the HIP driver, or Linux, or, maybe, something completely different. It is possible to work-around the problem by making a temporary copy of data on heap memory and pass it to the hipModuleLoadData(). This is how other areas are dealing with modules in Blender. This change contains patch against HIP-RT and the new HIP-RT library compiled with the patch. It seems to fix the problem reported in the report. This change does not resolve OIDN on HIP GPU which seems to have the same issue. However, it is not a recent regression and the bug with OIDN GPU denoising can be reproduced using Blender 4.3. Pull Request: https://projects.blender.org/blender/blender/pulls/135403
14 lines
456 B
Diff
14 lines
456 B
Diff
diff --git a/CMakeLists.txt b/CMakeLists.txt
|
|
index 50eb25e..b13d2da 100644
|
|
--- a/CMakeLists.txt
|
|
+++ b/CMakeLists.txt
|
|
@@ -503,7 +503,7 @@ if ( BAKE_COMPILED_KERNEL )
|
|
|
|
# Create the 'bake_compiled_kernels' project
|
|
add_custom_target(bake_compiled_kernels ALL
|
|
- DEPENDS ${KERNEL_HIPRT_H} ${KERNEL_OROCHI_H}
|
|
+ DEPENDS ${KERNEL_HIPRT_H} ${KERNEL_OROCHI_H} precompile_kernels
|
|
)
|
|
|
|
add_dependencies(${HIPRT_NAME} precompile_kernels bake_compiled_kernels)
|