test2

Author	SHA1	Message	Date
Brecht Van Lommel	f99f958c47	Refactor: Cycles: Add host_alloc/free to device API This may be used for device to do host memory allocation in a way that is more efficient for copy the host memory to the device. Also rename and group device memory allocation functions for clarity. Pull Request: https://projects.blender.org/blender/blender/pulls/134412	2025-02-13 19:58:56 +01:00
Brecht Van Lommel	e8ebcb3ee3	Fix: Cycles: Check if memory is host mapped without access to device_mem_map This avoids concurrency issues. Pull Request: https://projects.blender.org/blender/blender/pulls/132912	2025-01-29 14:12:23 +01:00
Brecht Van Lommel	8b7fce492e	Refactor: Cycles: Change API so host and device memory are freed together With host mapped memory these can be shared, and we can't get back the original host pointer unless we make a copy which is inefficient. Also add asserts to verify this doesn't happen. Pull Request: https://projects.blender.org/blender/blender/pulls/132912	2025-01-29 14:12:19 +01:00
Brecht Van Lommel	1ec04e0eec	Fix: Cycles: Only move textures to host on one device at a time This was not thread safe. And it's better to do them one by one to avoid moving more than is needed, when another thread already freed up enough. Thanks to Jorn Visser for investigating and finding this problem. Pull Request: https://projects.blender.org/blender/blender/pulls/132912	2025-01-29 14:12:09 +01:00
Brecht Van Lommel	cd3d3b2646	Refactor: Cycles: Delay load_texture_info() to enqueue Doing it immediately after moving textures to the host is less efficient, and interacts in confusing ways. Pull Request: https://projects.blender.org/blender/blender/pulls/132912	2025-01-29 14:12:06 +01:00
Brecht Van Lommel	2cfe2e0bfe	Fix: Cycles: Re-copy memory from host to device without realloc Should be a bit more efficient, and it fixes host memory fallback bugs, where host memory was incorrectly freed during re-copy. For the case where memory should get reallocated on the host, a new mem_move_to_host was added. Thanks to Jorn Visser for investigating and finding this problem. Pull Request: https://projects.blender.org/blender/blender/pulls/132912	2025-01-29 14:11:50 +01:00
Brecht Van Lommel	57ff24cb99	Refactor: Cycles: Add const keyword to more function parameters Pull Request: https://projects.blender.org/blender/blender/pulls/132361	2025-01-03 10:23:24 +01:00
Brecht Van Lommel	689633d802	Refactor: Cycles: Avoid unsafe memcpy and memcmp Pull Request: https://projects.blender.org/blender/blender/pulls/132361	2025-01-03 10:23:15 +01:00
Brecht Van Lommel	d0c2e68e5f	Refactor: Cycles: Automated clang-tidy fixups in Cycles * Use .empty() and .data() * Use nullptr instead of 0 * No else after return * Simple class member initialization * Add override for virtual methods * Include C++ instead of C headers * Remove some unused includes * Use default constructors * Always use braces * Consistent names in definition and declaration * Change typedef to using Pull Request: https://projects.blender.org/blender/blender/pulls/132361	2025-01-03 10:22:55 +01:00
Brecht Van Lommel	f53e13411b	Refactor: Cycles: Use #pragma once Pull Request: https://projects.blender.org/blender/blender/pulls/132361	2025-01-03 10:22:45 +01:00
Sergey Sharybin	6c3f3a7fb6	Fix: Proper forward declaration for friend class Turns out it is possible to have code to pick up wrong class when defining a friend: ``` intern\cycles\device/memory.h(255): warning C4099: 'GPUDevice': type name first seen using 'struct' now seen using 'class' source\blender\gpu\GPU_platform.hh(69): note: see declaration of 'GPUDevice' ``` Now made it so the classes have forward declaration in the CCL namespace, avoiding possible conflict with the classes with the same name in the global namespace. Pull Request: https://projects.blender.org/blender/blender/pulls/128485	2024-10-04 09:56:54 +02:00
Sergey Sharybin	92733a9415	Fix: Cycles memory leak in HIP-RT Some of the device memory objects had their host_pointer overwritten with another CPU-side buffer after allocation. This leads to a leak of host memory allocated by the device_memory. There are few remaining places where the host_pointer is assigned and those seems to be fine because the memory was not yet allocated with a alloc() call. While the approach in this change is not very ideal, it is small and potentially could be ported to the LTS tracks. More ideal solution would be to utilize device_vector::give_data(). Pull Request: https://projects.blender.org/blender/blender/pulls/126788	2024-08-27 12:46:54 +02:00
Nikita Sirgienko	759bb6c768	Cycles: oneAPI: Enable host memory migration This enables scenes with all textures not fitting in GPU memory to finally render. For scenes that are fitting, no functional change or performance change is expected. Pull Request: https://projects.blender.org/blender/blender/pulls/122385	2024-05-28 19:04:19 +02:00
Michael Jones	e82d69daa1	Cycles: Disambiguate shadow integrator state buffer names This patch adds a "shadow" prefix & array index suffixes to the shadow integrator state buffer names. This eliminates confusion when looking at GPU traces etc. Pull Request: https://projects.blender.org/blender/blender/pulls/121745	2024-05-15 23:19:24 +02:00
Campbell Barton	c12994612b	License headers: use SPDX-FileCopyrightText in intern/cycles	2023-06-14 16:53:23 +10:00
Sergey Sharybin	d32d787f5f	Clang-Format: Allow empty functions to be single-line For example ``` OIIOOutputDriver::~OIIOOutputDriver() { } ``` becomes ``` OIIOOutputDriver::~OIIOOutputDriver() {} ``` Saves quite some vertical space, which is especially handy for constructors. Pull Request: https://projects.blender.org/blender/blender/pulls/105594	2023-03-29 16:50:54 +02:00
William Leeson	6c03339e48	Cycles: reduce mesh memory usage by unflattening To improve mesh upload speeds and reduce the size of the scene data which allows larger scenes to be rendered. The meshes in Cycles are currently stored as flattened meshes, where each triangle is stored as a set of 3 vertices. Unflattening writes out the vertices in a list according to the index buffer. This uses a lot of memory and for current hardware does not provide a noticeable benefit. This change unflattens the mesh by directly using the meshes vertex and index buffers directly and skips the unflattening. This change allows for larger scenes and also a reduction in the sizes of the meshes. Further it results in a decrease the amount of time it takes to upload the data to a GPU. This is especially important for when multiple GPUs are used in a single machine. Pull Request #105173	2023-02-27 10:39:19 +01:00
Nikita Sirgienko	6dcfb6df9c	Cycles: Abstract host memory fallback for GPU devices Host memory fallback in CUDA and HIP devices is almost identical. We remove duplicated code and create a shared generic version that other devices (oneAPI) will be able to use. Reviewed By: brecht Differential Revision: https://developer.blender.org/D17173	2023-02-06 22:19:32 +01:00
Brecht Van Lommel	ff1883307f	Cleanup: renaming and consistency for kernel data * Rename "texture" to "data array". This has not used textures for a long time, there are just global memory arrays now. (On old CUDA GPUs there was a cache for textures but not global memory, so we used to put all data in textures.) * For CUDA and HIP, put globals in KernelParams struct like other devices. * Drop __ prefix for data array names, no possibility for naming conflict now that these are in a struct.	2022-06-20 12:30:48 +02:00
Brecht Van Lommel	9cfc7967dd	Cycles: use SPDX license headers * Replace license text in headers with SPDX identifiers. * Remove specific license info from outdated readme.txt, instead leave details to the source files. * Add list of SPDX license identifiers used, and corresponding license texts. * Update copyright dates while we're at it. Ref D14069, T95597	2022-02-11 17:47:34 +01:00
Sergey Sharybin	04d55038ee	Fix size_t -> int -> size_t round trip in Cycles There are two things achieved by this change: - No possible downcast of size_t to int when calculating motion steps. - Disambiguate call to `min()` which was for some reason considered ambiguous on 32bit platforms `min(int, unsigned int)`. - Do the same for the `max()` call to keep them symmetrical. On an implementation side the `min()` is defined for a fixed width integer type to disambiguate uint from size_t on 32bit platforms, and yet be able to use it for 32bit operands on 64bit platforms without upcast. This ended up in a bit bigger change as the conditional compile-in of functions is easiest if the functions is templated. Making the functions templated required to remove the other source of ambiguity which is `algorithm.h` which was pulling min/max from std. Now it is the `math.h` which is the source of truth for min/max. It was only one place which was relying on `algorithm.h` for these functions, hence the choice of `math.h` as the safest and least intrusive. Fixes 32bit platforms (such as i386) in Debian package build system. Differential Revision: https://developer.blender.org/D14062	2022-02-10 12:39:41 +01:00
Michael Jones	9558fa5196	Cycles: Metal host-side code This patch adds the Metal host-side code: - Add all core host-side Metal backend files (device_impl, queue, etc) - Add MetalRT BVH setup files - Integrate with Cycles device enumeration code - Revive `path_source_replace_includes` in util/path (required for MSL compilation) This patch also includes a couple of small kernel-side fixes: - Add an implementation of `lgammaf` for Metal [Nemes, Gergő (2010), "New asymptotic expansion for the Gamma function", Archiv der Mathematik](https://users.renyi.hu/~gergonemes/) - include "work_stealing.h" inside the Metal context class because it accesses state now Ref T92212 Reviewed By: brecht Maniphest Tasks: T92212 Differential Revision: https://developer.blender.org/D13423	2021-12-07 15:52:21 +00:00
Campbell Barton	76471dbd5e	Cleanup: capitalize NOTE tag	2021-11-30 10:15:17 +11:00
Sergey Sharybin	1706bf7780	Merge branch 'blender-v3.0-release'	2021-11-22 17:32:23 +01:00
Sergey Sharybin	336ca6796a	Fix T90308: Cycles crash copying memory from device to host Happens when device runs out of memory and Cycles is moving some textures to the host memory. The delayed memory free for OptiX BVH was moving data from one device_memory to another, leaving the original device memory in an invalid state. This was ruining the allocation map in the CUDA device which is using pointer to the device_memory. This change makes it so the memory pointer is stolen from BVH into the delayed memory free list. Additionally, forbid copying and moving instances of device_memory and added sanity checks in the device implementation. Differential Revision: https://developer.blender.org/D13316	2021-11-22 17:26:59 +01:00
Brecht Van Lommel	9937d5379c	Cycles: add packed_float3 type for storage Introduce a packed_float3 type for smaller storage that is exactly 3 floats, instead of 4. For computation float3 is still used since it can use SIMD instructions. Ref T92212 Differential Revision: https://developer.blender.org/D13243	2021-11-17 17:29:41 +01:00
Brecht Van Lommel	fd25e883e2	Cycles: remove prefix from source code file names Remove prefix of filenames that is the same as the folder name. This used to help when #includes were using individual files, but now they are always relative to the cycles root directory and so the prefixes are redundant. For patches and branches, git merge and rebase should be able to detect the renames and move over code to the right file.	2021-10-26 15:37:04 +02:00

27 Commits