griefith/test

Author	SHA1	Message	Date
Brecht Van Lommel	79f1cc601c	Cycles: improve ray tracing precision near triangle edges Detect cases where a ray-intersection would miss the current triangle, which if the intersection is strictly watertight, implies that a neighboring triangle would incorrectly be hit instead. When that is detected, apply a ray-offset. The idea being that we only want to introduce potential error from ray offsets if we really need to. This work for BVH2 and Embree, as we are able to match the ray-interesction bit-for-bit, though doing so for Embree requires ugly hacks. Tiny differences like fused-multiply-add or dot product intrinstics in matrix inversion and ray intersection needed to be matched exactly, so this is fragile. Unfortunately we're not able to do the same for OptiX or MetalRT, since those implementations are unknown (and possibly impossible to match as hardware instructions). Still artifacts are much reduced, though not eliminated. Ref T97259 Differential Revision: https://developer.blender.org/D15559	2022-08-09 18:42:01 +02:00
Brecht Van Lommel	230f9ade64	Cycles: make transform inverse match Embree exactly Helps improve ray-tracing precision. This is a bit complicated as it requires different implementation depending on the CPU architecture.	2022-08-09 16:59:05 +02:00
Brecht Van Lommel	286e535071	Cleanup: simplify CPU instruction checking The performance of this will be slightly more important for upcoming changes. Also removed an unused function and changed includes so these system.h can be included in more places.	2022-08-09 16:59:05 +02:00
Sergey Sharybin	cefd6140f3	Fix T100119: Cycles light object's parametric vector distorted Caused by `38af5b0501`. Adjust barycentric coordinates used for intersection result in the ray-to-rectangle intersection check. Differential Revision: https://developer.blender.org/D15592	2022-08-09 15:56:03 +02:00
Brecht Van Lommel	2e6fcd9aad	Fix Cycles crash after recent bugfix for image saving	2022-08-08 16:59:37 +02:00
Nikita Sirgienko	1382514bf2	Fix: Error in oneAPI image code for texture access with clip extension	2022-08-08 10:47:11 +02:00
Xavier Hallade	e4938b163e	Cycles: re-enable zebin format for Intel GPUs on Linux zebin format is critical for the compatibility of AoT graphics binaries across driver versions. It was previously disabled on Linux due to runtime issues that are now fixed in https://github.com/intel/compute-runtime/releases/tag/22.31.23852. The minimum supported driver version isn't bumped to this one yet as current codebase with current IGC compiler does actually run fine on earlier drivers and is not running into these issues anymore.	2022-08-07 22:54:15 +02:00
Brecht Van Lommel	18b703dc05	Fix T99991: Cycles wrong intercept with new hair curves Copy the improved hair curves sync implementation from D14942. That patch is not ready as a whole but this part was verified to match the old hair particles can be used already.	2022-08-05 19:33:53 +02:00
Brecht Van Lommel	d95100e821	Fix T98769: Cycles crash with multi-device rendering and BVH refits	2022-08-05 17:19:19 +02:00
Brecht Van Lommel	43a124bc1c	Fix T99179: holdout does not affect transparency without transparent background This was by design, but maybe not so useful in practice. It's always possible to set alpha to 1 in compositing if needed.	2022-08-05 16:32:13 +02:00
Brecht Van Lommel	fa514564b0	Fix T99201: Cycles render difference with 3D hair curves between OptiX and Emrbee It should consistently use the Cycles pirmitive ID for self intersection detection, not the one from the OptiX or Embree acceleration structure. Differential Revision: https://developer.blender.org/D15632	2022-08-05 15:03:47 +02:00
Brecht Van Lommel	b29d6de77a	Fix T99055: Cycles vertex color bake + denoising gives bad result No denoising is supported in this case, it only works on images.	2022-08-04 19:28:09 +02:00
Brecht Van Lommel	1c2c468abc	Fix T100134: Cycles faceted triangle normals with motion blur After recent changes to change barycentric coordinate convention.	2022-08-04 19:08:45 +02:00
Brecht Van Lommel	91d365f6df	Fix T100205: Cycles wrong volume shading with two materials in object Assume that all faces using the smae material form a closed mesh, so that joining meshes gives the same result as separate meshes. It does mean that using different materials on different sides of one closed mesh do not work, but the meaning of that is poorly defined anyway if there is a volume interior.	2022-08-04 19:02:25 +02:00
Julian Eisel	585dd63c6e	Cleanup: Move RNA path functions into own C++ file NOTE: This is committed to the 3.3 branch as part of D15606, which we decided should go to this release still (by Bastien, Dalai and me). That is because these are important usability fixes/improvements to have for the LTS release. Adds `rna_path.cc` and `RNA_path.h`. `rna_access.c` is a quite big file, which makes it rather hard and inconvenient to navigate. RNA path functions form a nicely coherent unit that can stand well on it's own, so it makes sense to split them off to mitigate the problem. Moreover, I was looking into refactoring the quite convoluted/overloaded `rna_path_parse()`, and found that some C++ features may help greatly with that. So having that code compile in C++ would be helpful to attempt that. Differential Revision: https://developer.blender.org/D15540 Reviewed by: Brecht Van Lommel, Campbell Barton, Bastien Montagne	2022-08-04 16:13:00 +02:00
Nikita Sirgienko	76169472d3	Cycles: Resolve recent performance regression in oneAPI implementation for Intel® Arc™ GPUs Recently, performance with oneAPI have regressed due some recent changes in Blender itself. This commit's changes is resolving this and also improve compilation time for oneAPI backend first execution (or Blender compilation time in case of AoT). Regression have appeared after `5152c7c152` and not related to the changes itself, but increase of kernels complexity introduced with it. Changes in this commit is marking some Blender functions as noinlined for oneAPI backend, which helps GPU compiler to deal with this complexity without any negative side-effects on performance.	2022-08-01 12:45:34 +02:00
Tianhao Chai	b862cf0b9f	Fix Cycles build error with CUDA on arm64 Checking arm64 assembly support before CUDA/Metal would cause NVCC to generate inline arm64 assembly. Differential Revision: https://developer.blender.org/D15569	2022-07-29 14:57:09 +02:00
Brecht Van Lommel	38af5b0501	Cycles: switch Cycles triangle barycentric convention to match Embree/OptiX Simplifies intersection code a little and slightly improves precision regarding self intersection. The parametric texture coordinate in shader nodes is still the same as before for compatibility.	2022-07-27 21:03:33 +02:00
Brecht Van Lommel	69f2732a13	Cleanup: remove unnecessary bvh_instance_motion_pop	2022-07-27 21:02:21 +02:00
Brecht Van Lommel	cd47d1b2ed	Fix broken BVH2 on CPU after recent changes Runtime switching between Embree and BVH2 got lost.	2022-07-27 20:58:02 +02:00
Xavier Hallade	d706d0460c	Cycles oneAPI: simplify num_concurrent_states selection The number of Execution Units and resident "threads" (simd width * threads per EUs) are now exposed and used to select the number of states using a simplified heuristic.	2022-07-27 09:45:33 +02:00
Campbell Barton	f1f89ca751	Cleanup: spelling in comments	2022-07-26 13:21:21 +10:00
Brecht Van Lommel	4cf6524731	Fix Cycles Metal build errors after recent changes float8 is a reserved type in Metal, but is not implemented. So rename to float8_t for now. Also move back intersection handlers to kernel.metal, they can't be in the class that encapsulates the other Metal kernel functions.	2022-07-26 00:17:37 +02:00
Clément Foucault	46dbfce7fc	Cycles: Nishita Sky: Fix sun disk imprecision for large elevation The issue was introduced by rBad5e3d30a2d2 which made possible to use unbounded elevation angle. In order to not touch the shading code, we just remap the value to the expected range the shading code expects. This means that elevation angles above +/-PI/2 effectively flip the sun rotation angle.	2022-07-25 19:26:12 +02:00
Brecht Van Lommel	f26aa186b2	Cleanup: remove __KERNEL_CPU__ This was tested in some places to check if code was being compiled for the CPU, however this is only defined in the kernel. Checking __KERNEL_GPU__ always works.	2022-07-25 17:43:35 +02:00
Andrii Symkin	793d203139	Cycles: add math functions for float8 This patch adds required math functions for float8 to make it possible using float8 instead of float3 for color data. Differential Revision: https://developer.blender.org/D15525	2022-07-25 17:36:58 +02:00
Brecht Van Lommel	7a74d91e32	Cleanup: move device BVH code to kernel/device/*/bvh.h Having the OptiX/MetalRT/Embree/MetalRT implementations all in one file with many #ifdefs became too confusing. Instead split it up per device, and also move it together with device specific hit/filter/intersect functions and associated data types.	2022-07-25 16:34:22 +02:00
Brecht Van Lommel	881ef0548a	Fix wrong Cycles SSS intersection distance after ray distance changes No need anymore to have a difference between CPU/GPU, all distances remain in world space.	2022-07-25 15:19:29 +02:00
Brecht Van Lommel	484ad31653	Cycles: simplify handling of ray distance in GPU rendering All our intersections functions now work with unnormalized ray direction, which means we no longer need to transform ray distance between world and object space, they can all remain in world space. There doesn't seem to be any real performance difference one way or the other, but it does simplify the code.	2022-07-25 13:27:40 +02:00
Brecht Van Lommel	023eb2ea7c	Cycles: more closely match some math and intersection operations in Embree This helps with debugging, and gives a slightly closer match between CPU and CUDA/HIP/Metal renders when it comes to ray tracing precision.	2022-07-25 13:27:40 +02:00
Brecht Van Lommel	cd9ebc816e	Fix build error with WITH_CYCLES_KERNEL_NATIVE_ONLY on macOS Arm -march=native is not supported for all architectures.	2022-07-25 11:23:25 +02:00
Lukas Stockner	d26c29d8e4	Fix T98367: Light group passes do not work when shadow catcher is used	2022-07-24 20:36:46 +02:00
Campbell Barton	185eeeaaac	GHOST/Wayland: Fix mouse wheel events for Sway Compositor (use seat v5) Bump the requested seat version to v5, use discreet scroll callback. Tested with gnome, river & sway.	2022-07-22 22:15:00 +10:00
Sebastiano Barrera	a5c2d0018c	Fix T91932: number sliders wrap around when dragged for long distance on X11 The value of number sliders (e.g. the "end frame" button) wrap around to their pre-click value when dragging them for a very long distance (e.g. by lifting the mouse off the desk and placing it back on to keep dragging in the same direction). The problem is X11-specific, and due to XTranslateCoordinates using a signed int16 behind the curtains, while its signature and the rest of Blender uses int32. The solution is to only use XTranslateCoordinates on (0, 0) to get the delta between the screen and client reference systems, and applying the delta in a second step. Differential Revision: https://developer.blender.org/D15507	2022-07-21 19:02:03 +02:00
Campbell Barton	03338e0270	GHOST/Wayland: fix cursor glitch after grabbing while hidden When the cursor grabbing was disabled, Blender's internal location (wmWindow.eventstate) kept the location before un-hiding. This caused the paint cursor to show in the wrong location after adjusting the color wheel for e.g.	2022-07-21 21:47:41 +10:00
Campbell Barton	46a2592eef	Cleanup: spelling in comments, typos in tool-tips	2022-07-21 13:21:53 +10:00
Hans Goudey	410a6efb74	Point Cloud: Remove redundant custom data pointers Similar to `e9f82d3dc7`, but for point clouds instead. Differential Revision: https://developer.blender.org/D15487	2022-07-19 18:06:56 -05:00
Hans Goudey	e9f82d3dc7	Curves: Remove redundant custom data pointers These mutable pointers present problems with ownership in relation to proper copy-on-write for attributes. The simplest solution is to just remove them and retrieve the layers from `CustomData` when they are needed. This also removes the complexity and redundancy of having to update the pointers as the curves change. A similar change will apply to meshes and point clouds. One downside of this change is that it makes random access with RNA slower. However, it's simple to just use the RNA attribute API instead, which is unaffected. In this patch I updated Cycles to do that. With the future attribute CoW changes, this generic approach makes sense because Cycles can just request ownership of the existing arrays. Differential Revision: https://developer.blender.org/D15486	2022-07-19 18:01:04 -05:00
Hans Goudey	597955d0a8	Cleanup: Remove compile option for curves object After `becb1530b1` the new curves object type isn't hidden behind an experimental flag anymore, and other areas depend on this, so disabling curves at compile time doesn't make sense anymore.	2022-07-19 12:10:45 -05:00
Campbell Barton	7ebd1f4b79	Cleanup: quiet compiler warnings	2022-07-19 13:32:53 +10:00
Brecht Van Lommel	3407ed5f9b	Cleanup: change internal Cycles compact BVH default to match UI	2022-07-18 15:34:13 +02:00
Brecht Van Lommel	c7f788b877	Subdiv: remove unused GPU device choice, fix crash with libepoxy on init openSubdiv_init() would detect available evaluators before any OpenGL context exists, causing a crash with libepoxy. This test however is redundant as we already check the requirements on the Blender side through the GPU API. To simplify things, completely remove the device detection in the opensubdiv module and reduce the evaluators to just CPU and GPU. The plan here is to move to the GPU module abstraction over OpenGL/Metal/Vulkan and so all these different backends no longer make sense. This also removes the user preference for OpenSubdiv compute device, which was not used for the new GPU subdivision implementation. Ref D15291 Differential Revision: https://developer.blender.org/D15470	2022-07-18 13:59:08 +02:00
Campbell Barton	49babc7caa	Cleanup: early exit MEM_lockfree_freeN when called with NULL pointer Simplify logic for freeing a NULL pointer. While no null-pointer de-reference was performed, this wasn't as so obvious as the pointer was passed to MEM_lockfree_allocN_len before checking for NULL. NOTE: T99744 claimed the a NULL pointer free was a vulnerability, while I can't see evidence for this - exiting early makes it clearer the memory isn't accessed. Details - Add MEMHEAD_LEN macro, avoids redundant NULL check. - Use "UNLIKELY(..)" hint's for error cases (freeing NULL pointer and checking if `leak_detector_has_run`).	2022-07-16 17:23:42 +10:00
Brecht Van Lommel	5152c7c152	Cycles: refactor rays to have start and end distance, fix precision issues For transparency, volume and light intersection rays, adjust these distances rather than the ray start position. This way we increment the start distance by the smallest possible float increment to avoid self intersections, and be sure it works as the distance compared to be will be exactly the same as before, due to the ray start position and direction remaining the same. Fix T98764, T96537, hair ray tracing precision issues. Differential Revision: https://developer.blender.org/D15455	2022-07-15 18:46:24 +02:00
Brecht Van Lommel	bb376da6df	Fix Cycles MetalRT error after recent specialization changes	2022-07-15 18:28:13 +02:00
Brecht Van Lommel	011d3c75a7	Cleanup: compiler warning	2022-07-15 15:20:53 +02:00
Brecht Van Lommel	523bbf7065	Cycles: generalize shader sorting / locality heuristic to all GPU devices This was added for Metal, but also gives good results with CUDA and OptiX. Also enable it for future Apple GPUs instead of only M1 and M2, since this has been shown to help across multiple GPUs so the better bet seems to enable rather than disable it. Also moves some of the logic outside of the Metal device code, and always enables the code in the kernel since other devices don't do dynamic compile. Time per sample with OptiX + RTX A6000: new old barbershop_interior 0.0730s 0.0727s bmw27 0.0047s 0.0053s classroom 0.0428s 0.0464s fishy_cat 0.0102s 0.0108s junkshop 0.0366s 0.0395s koro 0.0567s 0.0578s monster 0.0206s 0.0223s pabellon 0.0158s 0.0174s sponza 0.0088s 0.0100s spring 0.1267s 0.1280s victor 0.0524s 0.0531s wdas_cloud 0.0817s 0.0816s Ref D15331, T87836	2022-07-15 13:42:47 +02:00
Michael Jones	da4ef05e4d	Cycles: Apple Silicon optimization to specialize intersection kernels The Metal backend now compiles and caches a second set of kernels which are optimized for scene contents, enabled for Apple Silicon. The implementation supports doing this both for intersection and shading kernels. However this is currently only enabled for intersection kernels that are quick to compile, and already give a good speedup. Enabling this for shading kernels would be faster still, however this also causes a long wait times and would need a good user interface to control this. M1 Max samples per minute (macOS 13.0): PSO_GENERIC PSO_SPECIALIZED_INTERSECT PSO_SPECIALIZED_SHADE barbershop_interior 83.4 89.5 93.7 bmw27 1486.1 1671.0 1825.8 classroom 175.2 196.8 206.3 fishy_cat 674.2 704.3 719.3 junkshop 205.4 212.0 257.7 koro 310.1 336.1 342.8 monster 376.7 418.6 424.1 pabellon 273.5 325.4 339.8 sponza 830.6 929.6 1142.4 victor 86.7 96.4 96.3 wdas_cloud 111.8 112.7 183.1 Code contributed by Jason Fielder, Morteza Mostajabodaveh and Michael Jones Differential Revision: https://developer.blender.org/D14645	2022-07-15 13:40:04 +02:00
Michael Jones	5653c5fcdd	Cycles: keep track of SVM nodes used in kernels To be used for specialization in Metal, to automatically leave out unused nodes from the kernel. Ref D14645	2022-07-15 13:40:04 +02:00
Brecht Van Lommel	79da7f2a8f	Cycles: refactor to move part of KernelData definition to template header To be used for specialization on Metal in a following commit, turning these members into compile time constants. Ref D14645	2022-07-15 13:40:04 +02:00

1 2 3 4 5 ...

11776 Commits