griefith/test

Author	SHA1	Message	Date
Brecht Van Lommel	ae28d90578	Fix T93350: Cycles renders shows black during rendering huge resolutions The root of the issue is caused by Cycles ignoring OpenGL limitation on the maximum resolution of textures: Cycles was allocating texture of the final render resolution. It was exceeding limitation on certain GPUs and driver. The idea is simple: use multiple textures for the display, each of which will fit into OpenGL limitations. There is some code which allows the display driver to know when to start the new tile. Also added some code to allow force graphics interop to be re-created. The latter one ended up not used in the final version of the patch, but it might be helpful for other drivers implementation. The tile size is limited to 8K now as it is the safest size for textures on many GPUs and OpenGL drivers. This is an updated fix with a workaround for freezing with the NVIDIA driver on Linux. Differential Revision: https://developer.blender.org/D13385	2022-01-07 17:20:04 +01:00
Brecht Van Lommel	204ae33d75	Revert "Fix T93350: Cycles renders shows black during rendering huge resolutions" This reverts commit `5e37f70307`. It is leading to freezing of the entire desktop for a few seconds when stopping 3D viewport rendering on my Linux / NVIDIA system.	2021-12-07 20:49:34 +01:00
Sergey Sharybin	5e37f70307	Fix T93350: Cycles renders shows black during rendering huge resolutions The root of the issue is caused by Cycles ignoring OpenGL limitation on the maximum resolution of textures: Cycles was allocating texture of the final render resolution. It was exceeding limitation on certain GPUs and driver. The idea is simple: use multiple textures for the display, each of which will fit into OpenGL limitations. There is some code which allows the display driver to know when to start the new tile. Also added some code to allow force graphics interop to be re-created. The latter one ended up not used in the final version of the patch, but it might be helpful for other drivers implementation. The tile size is limited to 8K now as it is the safest size for textures on many GPUs and OpenGL drivers. Differential Revision: https://developer.blender.org/D13385	2021-12-07 19:01:42 +01:00
Patrick Mours	e14f8c2dd7	Cycles: Reintroduce device-only memory handling that got lost in Cycles X merge Somehow only a part of rBf4f8b6dde32b0438e0b97a6d8ebeb89802987127 ended up in Cycles X, causing the issue that commit fixed, "OPTIX_ERROR_INVALID_VALUE" when the system is out of memory, to show up again. This adds the missing changes to fix that problem. Maniphest Tasks: T93620 Differential Revision: https://developer.blender.org/D13488	2021-12-07 18:50:10 +01:00
Campbell Barton	ac447ba1a3	Cleanup: clang-format, trailing space	2021-11-30 10:15:17 +11:00
Michael Jones	98a5c924fc	Cycles: Metal readiness: Specify DeviceQueue::enqueue arg types This patch adds new arg-type parameters to `DeviceQueue::enqueue` and its overrides. This is in preparation for the Metal backend which needs this information for correct argument encoding. Ref T92212 Reviewed By: brecht Maniphest Tasks: T92212 Differential Revision: https://developer.blender.org/D13357	2021-11-29 14:56:06 +00:00
Sergey Sharybin	1706bf7780	Merge branch 'blender-v3.0-release'	2021-11-22 17:32:23 +01:00
Sergey Sharybin	336ca6796a	Fix T90308: Cycles crash copying memory from device to host Happens when device runs out of memory and Cycles is moving some textures to the host memory. The delayed memory free for OptiX BVH was moving data from one device_memory to another, leaving the original device memory in an invalid state. This was ruining the allocation map in the CUDA device which is using pointer to the device_memory. This change makes it so the memory pointer is stolen from BVH into the delayed memory free list. Additionally, forbid copying and moving instances of device_memory and added sanity checks in the device implementation. Differential Revision: https://developer.blender.org/D13316	2021-11-22 17:26:59 +01:00
Sebastian Herholz	d9bc8f189c	Cycles: add build option to enable a debugging feature for MIS This patch adds a CMake option "WITH_CYCLES_DEBUG" which builds cycles with a feature that allows debugging/selecting the direct-light sampling strategy. The same option may later be used to add other debugging features that could affect performance in release builds. The three options are: * Forward path tracing (e.g., via BSDF or phase function) * Next-event estimation * Multiple importance sampling combination of the previous two methods Such a feature is useful for debugging light different sampling, evaluation, and pdf methods (e.g., for light sources and BSDFs). Differential Revision: https://developer.blender.org/D13152	2021-11-17 18:03:56 +01:00
Thomas Dinges	83a4d51997	Cleanup: Remove unused show_samples() device code in Cycles.	2021-11-17 11:16:48 +01:00
Thomas Dinges	25e7365d0d	Cleanup CUDA / HIP comments Remove outdated CUDA comments for bindless textures and cleanup some HIP comments that still mentioned CUDA. Differential Revision: https://developer.blender.org/D13189	2021-11-11 16:37:29 +01:00
Clément Foucault	3f0991266f	Merge branch 'blender-v3.0-release'	2021-11-01 12:15:09 +01:00
Thomas Dinges	5327413b37	Cleanup: Remove Cycles device checks for half float. All supported devices support half float now, so we can remove the check. Differential Revision: https://developer.blender.org/D13021	2021-11-01 10:18:30 +01:00
Brecht Van Lommel	806521f703	Fix T92671: confusing Cycles debug logs about CPU architecture Instead of printing debug flags listing various CPU and GPU settings that may or may not be used, print when we are using them. This include CPU kernel types, OptiX debugging and CUDA and HIP adaptive compilation. BVH type was already printed.	2021-11-01 08:36:50 +01:00
Brecht Van Lommel	fd25e883e2	Cycles: remove prefix from source code file names Remove prefix of filenames that is the same as the folder name. This used to help when #includes were using individual files, but now they are always relative to the cycles root directory and so the prefixes are redundant. For patches and branches, git merge and rebase should be able to detect the renames and move over code to the right file.	2021-10-26 15:37:04 +02:00
Brecht Van Lommel	d7d40745fa	Cycles: changes to source code folders structure * Split render/ into scene/ and session/. The scene/ folder now contains the scene and its nodes. The session/ folder contains the render session and associated data structures like drivers and render buffers. * Move top level kernel headers into new folders kernel/camera/, kernel/film/, kernel/light/, kernel/sample/, kernel/util/ * Move integrator related kernel headers into kernel/integrator/ * Move OSL shaders from kernel/shaders/ to kernel/osl/shaders/ For patches and branches, git merge and rebase should be able to detect the renames and move over code to the right file.	2021-10-26 15:36:39 +02:00
Brecht Van Lommel	df00463764	Cycles: add shadow path compaction for GPU rendering Similar to main path compaction that happens before adding work tiles, this compacts shadow paths before launching kernels that may add shadow paths. Only do it when more than 50% of space is wasted. It's not a clear win in all scenes, some are up to 1.5% slower. Likely caused by different order of scheduling kernels having an unpredictable performance impact. Still feels like compaction is just the right thing to avoid cases where a few shadow paths can hold up a lot of main paths. Differential Revision: https://developer.blender.org/D12944	2021-10-21 15:38:03 +02:00
Brecht Van Lommel	39810b3f51	Cleanup: make HIP and CUDA code more consistent Ref D12834	2021-10-21 13:08:10 +02:00
Brecht Van Lommel	001f548227	Cycles: reduce kernel reserved local memory when not using shader raytracing Ref T87836	2021-10-20 17:50:31 +02:00
Brecht Van Lommel	a754e35198	Cycles: refactor API for GPU display * Split GPUDisplay into two classes. PathTraceDisplay to implement the Cycles side, and DisplayDriver to implement the host application side. The DisplayDriver is now a fully abstract base class, embedded in the PathTraceDisplay. * Move copy_pixels_to_texture implementation out of the host side into the Cycles side, since it can be implemented in terms of the texture buffer mapping. * Move definition of DeviceGraphicsInteropDestination into display driver header, so that we do not need to expose private device headers in the public API. * Add more detailed comments about how the DisplayDriver should be implemented. The "driver" terminology might not be obvious, but is also used in other renderers. Differential Revision: https://developer.blender.org/D12626	2021-09-30 20:48:08 +02:00
Brecht Van Lommel	a6b53ef994	Cycles: print name of kernels on errors in CUDA queue, for debugging	2021-09-27 15:24:12 +02:00
Brecht Van Lommel	ab8f24811d	Cleanup: remove unused device code and includes	2021-09-24 16:34:14 +02:00
Brecht Van Lommel	d7f803f522	Fix T91641: crash rendering with 16k environment map in Cycles Protect against integer overflow.	2021-09-23 17:48:16 +02:00
Campbell Barton	4d66cbd140	Cleanup: spelling in comments	2021-09-22 14:54:01 +10:00
Brecht Van Lommel	0803119725	Cycles: merge of cycles-x branch, a major update to the renderer This includes much improved GPU rendering performance, viewport interactivity, new shadow catcher, revamped sampling settings, subsurface scattering anisotropy, new GPU volume sampling, improved PMJ sampling pattern, and more. Some features have also been removed or changed, breaking backwards compatibility. Including the removal of the OpenCL backend, for which alternatives are under development. Release notes and code docs: https://wiki.blender.org/wiki/Reference/Release_Notes/3.0/Cycles https://wiki.blender.org/wiki/Source/Render/Cycles Credits: * Sergey Sharybin * Brecht Van Lommel * Patrick Mours (OptiX backend) * Christophe Hery (subsurface scattering anisotropy) * William Leeson (PMJ sampling pattern) * Alaska (various fixes and tweaks) * Thomas Dinges (various fixes) For the full commit history, see the cycles-x branch. This squashes together all the changes since intermediate changes would often fail building or tests. Ref T87839, T87837, T87836 Fixes T90734, T89353, T80267, T80267, T77185, T69800	2021-09-21 14:55:54 +02:00
Brecht Van Lommel	073bf8bf52	Cycles: remove WITH_CYCLES_DEBUG, add WITH_CYCLES_DEBUG_NAN WITH_CYCLES_DEBUG was used for rendering BVH debugging passes. But since we mainly use Embree an OptiX now, this information is no longer important. WITH_CYCLES_DEBUG_NAN will enable additional checks for NaNs and invalid values in the kernel, for Cycles developers. Previously these asserts where enabled in all debug builds, but this is too likely to crash Blender in scenes that render fine regardless of the NaNs. So this is behind a CMake option now. Fixes T90240	2021-07-28 19:27:57 +02:00
Brecht Van Lommel	cf74cd9367	Cycles: upgrade CUDA to 11.4 This fixes a performance regression on Ampere cards, on specific scenes like classroom. For cycles-x there is little difference, but this is still helpful for LTS releases, and we need to upgrade at some point anyway.	2021-07-26 19:46:51 +02:00
Patrick Mours	f4f8b6dde3	Cycles: Change device-only memory to actually only allocate on the device This patch changes the `MEM_DEVICE_ONLY` type to only allocate on the device and fail if that is not possible anymore because out-of-memory (since OptiX acceleration structures may not be allocated in host memory). It also fixes high peak memory usage during OptiX acceleration structure building. Reviewed By: brecht Maniphest Tasks: T85985 Differential Revision: https://developer.blender.org/D10535	2021-03-11 14:12:35 +01:00
Campbell Barton	17e1e2bfd8	Cleanup: correct spelling in comments	2021-02-05 16:23:34 +11:00
James Horsley	4fbeb3e6be	Fix T85089: Crash when rendering scene that does not fit into GPU memory with CUDA/OptiX The "cuda_mem_map_mutex" was potentially being locked recursively during the call to "CUDADevice::move_textures_to_host", which crashed. This moves around the locking and unlocking of "cuda_mem_map_mutex", so that it doesn't call a function that locks it while still holding the lock. Reviewed By: pmoursnv Maniphest Tasks: T85089, T84734 Differential Revision: https://developer.blender.org/D10219	2021-01-27 15:27:57 +01:00
Lukas Stockner	688e5c6d38	Fix T82351: Cycles: Tile stealing glitches with adaptive sampling In my testing this works, but it requires me to remove the min(start_sample...) part in the adaptive sampling kernel, and I assume there's a reason why it was there? Reviewed By: brecht Maniphest Tasks: T82351 Differential Revision: https://developer.blender.org/D9445	2021-01-11 21:04:49 +01:00
Campbell Barton	001f2c5d50	Cleanup: spelling	2020-12-15 12:34:25 +11:00
Patrick Mours	bfb6fce659	Cycles: Add CPU+GPU rendering support with OptiX Adds support for building multiple BVH types in order to support using both CPU and OptiX devices for rendering simultaneously. Primitive packing for Embree and OptiX is now standalone, so it only needs to be run once and can be shared between the two. Additionally, BVH building was made a device call, so that each device backend can decide how to perform the building. The multi-device for instance creates a special multi-BVH that holds references to several sub-BVHs, one for each sub-device. Reviewed By: brecht, kevindietrich Differential Revision: https://developer.blender.org/D9718	2020-12-11 13:24:29 +01:00
Patrick Mours	3df90de6c2	Cycles: Add NanoVDB support for rendering volumes NanoVDB is a platform-independent sparse volume data structure that makes it possible to use OpenVDB volumes on the GPU. This patch uses it for volume rendering in Cycles, replacing the previous usage of dense 3D textures. Since it has a big impact on memory usage and performance and changes the OpenVDB branch used for the rest of Blender as well, this is not enabled by default yet, which will happen only after 2.82 was branched off. To enable it, build both dependencies and Blender itself with the "WITH_NANOVDB" CMake option. Reviewed By: brecht Differential Revision: https://developer.blender.org/D8794	2020-10-05 15:03:30 +02:00
Patrick Mours	a9644c812f	Cycles: Use pre-compiled PTX kernel for older generation when no matching one is found This patch changes the discovery of pre-compiled kernels, to look for any PTX, even if it does not match the current architecture version exactly. It works because the driver can JIT-compile PTX generated for architectures less than or equal to the current one. This e.g. makes it possible to render on a new GPU architecture even if no pre-compiled binary kernel was distributed for it as part of the Blender installation. Reviewed By: brecht Differential Revision: https://developer.blender.org/D8332	2020-07-20 19:25:27 +02:00
Brecht Van Lommel	93791381fe	Cleanup: reduce hardcoded numbers in denoising neighbor tiles code	2020-07-10 17:10:05 +02:00
Brecht Van Lommel	54e3487c9e	Cleanup: remove task pool stop() and finished()	2020-06-22 13:06:47 +02:00
Brecht Van Lommel	b10b7cdb43	Cleanup: use lambdas instead of functors for task pools, remove threadid	2020-06-22 13:06:47 +02:00
Brecht Van Lommel	ace3268482	Cleanup: minor refactoring around DeviceTask	2020-06-22 13:06:47 +02:00
Brecht Van Lommel	fc7c34e380	Cleanup: fix compiler warnings	2020-06-17 14:36:51 +02:00
Patrick Mours	b586f801fc	Cycles: Improve CUDA and OptiX error reporting in the viewport This patch makes the infamous "Cancel" error in the viewport a thing of the past. Instead it now shows a more useful error message and streamlines the error handling process in CUDA. Reviewed By: brecht Differential Revision: https://developer.blender.org/D8008	2020-06-12 18:24:15 +02:00
Patrick Mours	9f7d84b656	Cycles: Add support for P2P memory distribution (e.g. via NVLink) This change modifies the multi-device implementation to support memory distribution across devices, to reduce the overall memory footprint of large scenes and allow scenes to fit entirely into combined GPU memory that previously had to fall back to host memory. Reviewed By: brecht Differential Revision: https://developer.blender.org/D7426	2020-06-08 17:55:49 +02:00
Brecht Van Lommel	d9773edaa3	Cycles: code refactor to bake using regular render session and tiles There should be no user visible change from this, except that tile size now affects performance. The goal here is to simplify bake denoising in D3099, letting it reuse more denoising tiles and pass code. A lot of code is now shared with regular rendering, with the two main differences being that we read some render result passes from the bake API when starting to render a tile, and call the bake kernel instead of the path trace kernel. With this kind of design where Cycles asks for tiles from the bake API, it should eventually be easier to reduce memory usage, show tiles as they are baked, or bake multiple passes at once, though there's still quite some work needed for that. Reviewers: #cycles Subscribers: monio, wmatyjewicz, lukasstockner97, michaelknubben Differential Revision: https://developer.blender.org/D3108	2020-05-15 20:25:24 +02:00
Brecht Van Lommel	d97c83712c	Cycles: mark CUDA 10.2 as officially supported It appears to work fine after a recent bugfix and testing for the past few weeks.	2020-05-05 15:06:49 +02:00
Dalai Felinto	2d1cce8331	Cleanup: `make format` after SortedIncludes change	2020-03-19 09:33:58 +01:00
Brecht Van Lommel	26bea849cf	Cleanup: add device_texture for images, distinct from other global memory There was too much image texture specific stuff in device_memory, and too much code duplication between devices.	2020-03-12 17:28:55 +01:00
Brecht Van Lommel	f01bc597a8	Cleanup: stop encoding image data type in slot index This is legacy code from when we had a fixed number of textures.	2020-03-11 17:07:17 +01:00
Campbell Barton	8574d68aa0	Cleanup: spelling	2020-03-06 11:52:32 +11:00
Stefan Werner	51e898324d	Adaptive Sampling for Cycles. This feature takes some inspiration from "RenderMan: An Advanced Path Tracing Architecture for Movie Rendering" and "A Hierarchical Automatic Stopping Condition for Monte Carlo Global Illumination" The basic principle is as follows: While samples are being added to a pixel, the adaptive sampler writes half of the samples to a separate buffer. This gives it two separate estimates of the same pixel, and by comparing their difference it estimates convergence. Once convergence drops below a given threshold, the pixel is considered done. When a pixel has not converged yet and needs more samples than the minimum, its immediate neighbors are also set to take more samples. This is done in order to more reliably detect sharp features such as caustics. A 3x3 box filter that is run periodically over the tile buffer is used for that purpose. After a tile has finished rendering, the values of all passes are scaled as if they were rendered with the full number of samples. This way, any code operating on these buffers, for example the denoiser, does not need to be changed for per-pixel sample counts. Reviewed By: brecht, #cycles Differential Revision: https://developer.blender.org/D4686	2020-03-05 12:21:38 +01:00
Patrick Mours	af54bbd61c	Cycles: Rework tile scheduling for denoising This fixes denoising being delayed until after all rendering has finished. Instead, tile-based denoising is now part of the "RENDER" task again, so that it is all in one task and does not cause issues with dedicated task pools where tasks are serialized. Reviewed By: brecht Differential Revision: https://developer.blender.org/D6940	2020-02-28 16:12:29 +01:00

1 2

55 Commits