griefith/test

Author	SHA1	Message	Date
Alexander Gavrilov	a7f6f900f3	Cycles: avoid making NaNs in Vector Math node by normalizing zero vectors. Since inputs are user controlled, the node can't assume they aren't zero.	2016-08-09 13:20:22 +03:00
Sergey Sharybin	6353ecb996	Cycles: Tweaks to support CUDA 8 toolkit All the changes are mainly giving explicit tips on inlining functions, so they match how inlining worked with previous toolkit. This make kernel compiled by CUDA 8 render in average with same speed as previous kernels. Some scenes are somewhat faster, some of them are somewhat slower. But slowdown is within 1% so far. On a positive side it allows us to enable newer generation cards on buildbots (so GTX 10x0 will be officially supported soon).	2016-08-01 15:54:29 +02:00
Brecht Van Lommel	9b6ed3a42b	Cycles: refactor kernel closure storage to use structs per closure type. Reviewed By: dingto, sergey Differential Revision: https://developer.blender.org/D2127	2016-07-31 02:34:43 +02:00
Mai Lavelle	c96ae81160	Cycles microdisplacement: ngons and attributes for subdivision meshes This adds support for ngons and attributes on subdivision meshes. Ngons are needed for proper attribute interpolation as well as correct Catmull-Clark subdivision. Several changes are made to achieve this: - new primitive `SubdFace` added to `Mesh` - 3 more textures are used to store info on patches from subd meshes - Blender export uses loop interface instead of tessface for subd meshes - `Attribute` class is updated with a simplified way to pass primitive counts around and to support ngons. - extra points for ngons are generated for O(1) attribute interpolation - curves are temporally disabled on subd meshes to avoid various bugs with implementation - old unneeded code is removed from `subd/` - various fixes and improvements Reviewed By: brecht Differential Revision: https://developer.blender.org/D2108	2016-07-29 03:36:30 -04:00
Lukas Stockner	23c276832b	Cycles: Add multi-scattering, energy-conserving GGX as an option to the Glossy, Anisotropic and Glass BSDFs This commit adds a new distribution to the Glossy, Anisotropic and Glass BSDFs that implements the multiple-scattering microfacet model described in the paper "Multiple-Scattering Microfacet BSDFs with the Smith Model". Essentially, the improvement is that unlike classical GGX, which only models single scattering and assumes the contribution of multiple bounces to be zero, this new model performs a random walk on the microsurface until the ray leaves it again, which ensures perfect energy conservation. In practise, this means that the "darkening problem" - GGX materials becoming darker with increasing roughness - is solved in a physically correct and efficient way. The downside of this model is that it has no (known) analytic expression for evalation. However, it can be evaluated stochastically, and although the correct PDF isn't known either, the properties of MIS and the balance heuristic guarantee an unbiased result at the cost of slightly higher noise. Reviewers: dingto, #cycles, brecht Reviewed By: dingto, #cycles, brecht Subscribers: bliblubli, ace_dragon, gregzaal, brecht, harvester, dingto, marcog, swerner, jtheninja, Blendify, nutel Differential Revision: https://developer.blender.org/D2002	2016-06-23 22:57:26 +02:00
Lukas Stockner	7a5a02509b	Cycles: Use faster ray-quad-intersection test The original quad intersection test works by just testing against the two triangles that define the quad. However, in this case it's actually faster to use the same test that's also used for portals: Determining the distance to the plane in which the quad lies, calculating the hitpoint and checking whether it's in the quad by projecting onto the sides. Reviewers: brecht, sergey, dingto Reviewed By: dingto Differential Revision: https://developer.blender.org/D2045	2016-06-06 23:38:50 +02:00
Sergey Sharybin	3165e8740b	Fix T48139: Checker texture strange behavior in cycles Seems particular CUDA implementations has some precision issues, which made integer coordinate (which was expected to always be positive) to go negative.	2016-04-15 15:30:30 +02:00
Sergey Sharybin	b30ab24fb8	Cycles: Avoid re-definition of math cnstants with MSVC	2016-02-20 14:06:05 +05:00
Lukas Stockner	6995b4d8d9	Cycles: Adding Hilbert Spiral as a tile order for rendering This patch adds the "Hilbert Spiral", a custom-designed continuous space-filling curve, as a tile order for rendering in Cycles. It essentially works by dividing the tiles into tile blocks which are processed in a spiral outwards from the center. Inside each block, the tiles are processed in a regular Hilbert curve pattern. By rotating that pattern according to the spiral direction, a continuous curve is obtained, which helps with cache coherency and therefore rendering speed. The curve is a compromise between the faster-rendering Bottom-to-Top etc. orders and the Center order, which is a bit slower, but starts with the more important areas. The Hilbert Spiral also starts in the center (unless huge tiles are used) and is still marginally slower than Bottom-to-Top, but noticeably faster than Center. Reviewers: sergey, #cycles, dingto Reviewed By: #cycles, dingto Subscribers: iscream, gregzaal, sergey, mib2berlin Differential Revision: https://developer.blender.org/D1166	2016-01-10 00:13:53 +01:00
Lukas Stockner	8512e284a0	Fix T46906: Cycles syntax error while compiling OpenCL kernels The safe normalization was using a float as a condition, now the intended non-zero test is explicit.	2015-12-01 13:53:29 +01:00
Sergey Sharybin	c18e6fd87c	Cycles: Remove 32bit cuda workaroudn and disable cubins for buildbot Recent changes to kernel broke compilation of the kernels again, need some other kind of solution for this issue. Don't have much time for this currently, but will be addressed before the release. Meanwhile it's better to have some buildbot builds instead of totally failing one.	2015-08-04 18:50:37 +02:00
Sergey Sharybin	7973363e34	Cycles: Final-ish tweaks for 32bit cubin compilation	2015-07-27 16:55:50 +02:00
Sergey Sharybin	61e4800b45	Cycles: One more attempt to fix compilation of 32bit CUDA kernels	2015-07-27 14:18:20 +02:00
Sergey Sharybin	41d817f15d	Fix T44548: Cycles Tube Mapping off / not compatible with BI Was a typo in original implementation, probably a result of some code reshuffle happened for optimization reasons.	2015-04-30 14:27:16 +05:00
Sergey Sharybin	ae7d84dbc1	Cycles: Use native saturate function for CUDA This more a workaround for CUDA optimizer which can't optimize clamp(x, 0, 1) into a single instruction and uses 4 instructions instead. Original patch by @lockal with own modification: Don't make changes outside of the kernel. They don't make any difference anyway and term saturate() has a bit different meaning outside of kernel. This gives around 2% of speedup in Barcelona file, but in more complex shader setups with lots of math nodes with clamping speedup could be much nicer. Subscribers: dingto Projects: #cycles Differential Revision: https://developer.blender.org/D1224	2015-04-28 00:38:32 +05:00
Sergey Sharybin	5ff132182d	Cycles: Code cleanup, spaces around keywords This inconsistency drove me totally crazy, it's really confusing when it's inconsistent especially when you work on both Cycles and Blender sides. Shouldn;t cause merge PITA, it's whitespace changes only, Git should be able to merge it nicely.	2015-03-28 00:15:15 +05:00
Sergey Sharybin	3e534833e3	Cycles: Make sphere and tube image mapping friendly with OpenCL OpenCL doesn't let you to get address of vector components, which is kinda annoying. On the other hand, maybe now compiler will have more chances to optimize something out.	2015-02-19 12:52:48 +05:00
Sergey Sharybin	bf4c44491a	Cycles: Some more constants fixes for fast math	2015-02-06 15:40:07 +05:00
Sergey Sharybin	9617446be2	Cycles: Fix compilation error with some compilers Not sure why this was not visible previously, but the change is logical anyway.	2015-01-22 17:04:01 +05:00
Sergey Sharybin	dda355442d	Cycles: Support tube projection for images This way Cycles finally becomes feature-full on image projections compared to Blender Internal and Gooseberry Project Team could finally finish the movie.	2015-01-22 00:41:42 +05:00
Sergey Sharybin	4f2583ee13	Fix T43027: OpenCL kernel compilation broken after QBVH OpenCL apparently does not support templates, so the idea of generic function for swapping is a bit of a failure. Now it is either inlined into the code (in triangle intersection) or has specific implementation for QBVH. This is probably even better, because we can't create QBVH-specific function in util_math anyway.	2015-01-02 14:58:01 +05:00
Thomas Dinges	ee36e75b85	Cleanup: Fix Cycles Apache header. This was already mixed a bit, but the dot belongs there.	2014-12-25 02:50:24 +01:00
Sergey Sharybin	ab8d9c4b88	Cycles: Add some utility functions and structures Most of them are not currently used but are essential for the further work. - CPU kernels with SSE2 support will now have sse3b, sse3f and sse3i - Added templatedversions of min4, max4 which are handy to use with register variables. - Added util_swap function which gets arguments by pointers. So hopefully it'll be a portable version of std::swap.	2014-12-25 02:50:49 +05:00
Sergey Sharybin	f770bc4757	Cycles: Implement watertight ray/triangle intersection Using this paper: Sven Woop, Watertight Ray/Triangle Intersection http://jcgt.org/published/0002/01/05/paper.pdf This change is expected to address quite reasonable amount of reports from the bug tracker, plus it might help reducing the noise in some scenes. Unfortunately, it's currently about 7% slower than the previous solution with pre-computed triangle plane equations, but maybe with some smart tweaks to the code (tests reshuffle, using SIMD in a nice way or so) we can avoid the speed regression. But perhaps smartest thing to do here would be to change single triangle / ray intersection with multiple triangles / ray intersections. That's how Embree does this and it's watertight single ray intersection is not any faster that this. Currently only triangle intersection is modified accordingly to the paper, in the future we would also want to modify the node / ray intersection. Reviewers: brecht, juicyfruit Subscribers: dingto, ton Differential Revision: https://developer.blender.org/D819	2014-12-25 02:50:49 +05:00
Campbell Barton	7b873b0662	Add safe_normalize to cycles, avoid checking length first This won't give any big speedup, just avoids redundant sqrtf and may be useful in future. Differential Revision: https://developer.blender.org/D880	2014-11-08 13:37:42 +01:00
Campbell Barton	106ea0b20b	Cleanup: sync map_to_sphere, UNLIKELY xy zero case	2014-09-16 12:41:16 +10:00
Thomas Dinges	03ce9882af	Fix T41839, OpenCL error. Also some style fixes, we don't do the "put as much as possible in 1 line" contest.	2014-09-15 14:22:39 +02:00
Thomas Dinges	00acf4b816	Cleanup: Use function call and delete obsolete comment.	2014-09-02 23:26:49 +02:00
Thomas Dinges	e3ed13cbd4	Cleanup: Remove special code for Visual Studio 2008. Goodbye VC2008, it has been a pleasure (more or less) :D SCons / CMake cleaenup will follow. Differential Revision: https://developer.blender.org/D715	2014-08-07 13:52:15 +02:00
Sergey Sharybin	946f291c46	Fix T41174: Tangent space required UV map in Cycles Now Cycles behaves in the same way as BI in terms of using sphere projection of orco coordinates if there's no UV map when calculating tangent space.	2014-07-29 16:08:47 +06:00
Thomas Dinges	be182d9704	Code cleanup.	2014-06-13 22:26:20 +02:00
Campbell Barton	65d54f34b1	Code cleanup: spelling/indentation	2014-05-08 04:53:05 +10:00
Matt Heimlich	3fbc984b06	Nodes: add absolute value operation to all math nodes Reviewed By: dingto, brecht Differential Revision: https://developer.blender.org/D507	2014-05-07 16:43:59 +02:00
Campbell Barton	d828d44d7a	Cycles: use LIKELY/UNLIKELY macros Gives overall ~3% speedup in own tests for BMW scene.	2014-05-05 03:49:22 +10:00
Sv. Lockal	ab32a1807d	Cycles: SSE optimization for Voronoi cells texture Gives 5-6% speedup for Caterpillar_PatazStudio.blend. Reviewed By: brecht, dingto Differential Revision: https://developer.blender.org/D419	2014-04-03 23:35:10 +04:00
Brecht Van Lommel	3847d0c0df	Cycles code internals: add initial implementation of decoupled ray marching. This basically records all volumes steps, which can then later be used multiple time to take scattering samples, without having to step through the volume again. From the paper: "Importance Sampling Techniques for Path Tracing in Participating Media" This works only on the CPU, due to usage of malloc/free.	2014-03-29 13:03:50 +01:00
Brecht Van Lommel	6020d00990	Cycles: add support for mesh deformation motion blur.	2014-03-29 13:03:47 +01:00
Brecht Van Lommel	934767cf7f	Cycles code refactor: change curve key to float4 for easier storage as attribute.	2014-03-29 13:03:46 +01:00
Sv. Lockal	1c49eb0072	Cycles, Code cleanup: simplify code for color linear interpolation and float math Reviewed By: brecht Differential Revision: https://developer.blender.org/D215	2014-01-14 22:55:02 +04:00
Campbell Barton	a288644b1e	Code Cleanup: WIN32 defines, check for _MSC_VER instead of !FREE_WINDOWS	2014-01-03 20:46:12 +11:00
Martijn Berger	1c8a12ee61	Fix T37987: MSVC 2013 has C99 headers and warns for out define hypot _hypot for good reason it seems	2014-01-02 22:19:10 +01:00
Martijn Berger	e3a79258d1	Cycles: test code for sse 4.1 kernel and alignment for some vector types. This is mostly work towards enabling the __KERNEL_SSE__ option to start using SIMD operations for vector math operations. This 4.1 kernel performes about 8% faster with that option but overall is still slower than without the option. WITH_CYCLES_OPTIMIZED_KERNEL_SSE41 is the cmake flag for testing this kernel. Alignment of int3, int4, float3, float4 to 16 bytes seems to give a slight 1-2% speedup on tested systems with the current kernel already, so is enabled now.	2013-11-22 14:42:41 +01:00
Brecht Van Lommel	c18712e868	Cycles: change __device and similar qualifiers to ccl_device in kernel code. This to avoids build conflicts with libc++ on FreeBSD, these __ prefixed values are reserved for compilers. I apologize to anyone who has patches or branches and has to go through the pain of merging this change, it may be easiest to do these same replacements in your code and then apply/merge the patch. Ref T37477.	2013-11-18 08:48:15 +01:00
Brecht Van Lommel	b9ce231060	Cycles: relicense GNU GPL source code to Apache version 2.0. More information in this post: http://code.blender.org/ Thanks to all contributes for giving their permission!	2013-08-18 14:16:15 +00:00
Brecht Van Lommel	d43682d51b	Cycles: Subsurface Scattering New features: * Bump mapping now works with SSS * Texture Blur factor for SSS, see the documentation for details: http://wiki.blender.org/index.php/Doc:2.6/Manual/Render/Cycles/Nodes/Shaders#Subsurface_Scattering Work in progress for feedback: Initial implementation of the "BSSRDF Importance Sampling" paper, which uses a different importance sampling method. It gives better quality results in many ways, with the availability of both Cubic and Gaussian falloff functions, but also tends to be more noisy when using the progressive integrator and does not give great results with some geometry. It works quite well for the non-progressive integrator and is often less noisy there. This code may still change a lot, so unless you're testing it may be best to stick to the Compatible falloff function. Skin test render and file that takes advantage of the gaussian falloff: http://www.pasteall.org/pic/show.php?id=57661 http://www.pasteall.org/pic/show.php?id=57662 http://www.pasteall.org/blend/23501	2013-08-18 14:15:57 +00:00
Brecht Van Lommel	3d847ed6e6	Fix #36064 : cycles direct/indirect light passes with materials that have zero RGB color components gave non-grey results when you might no expect it. What happens is that some of the color channels are zero in the direct light pass because their channel is zero in the color pass. The direct light pass is defined as lighting divided by the color pass, and we can't divide by zero. We do a division after all samples are added together to ensure that multiplication in the compositor gives the exact combined pass even with antialiasing, DoF, .. Found a simple tweak here, instead of setting such channels to zero it will set it to the average of other non-zero color channels, which makes the results look like the expected grey.	2013-07-08 23:31:45 +00:00
Thomas Dinges	c6ce8de20e	Code cleanup / Cycles: * Some cleanup for castings.	2013-06-27 15:48:16 +00:00
Brecht Van Lommel	484d765bd4	Cycles: attempt to fix internal compile error with some visual studio builds	2013-06-18 13:19:16 +00:00
Brecht Van Lommel	d835d2f4e6	Code cleanup: avoid some warnings due to implicit uint/int/float/double conversion.	2013-06-07 16:06:17 +00:00
Thomas Dinges	9e4914e055	Cycles: * Revert r57203 (len() renaming) There seems to be a problem with nVidia OpenCL after this and I haven't figured out the real cause yet. Better to selectively enable native length() later, after figuring out what's wrong. This fixes [#35612].	2013-06-04 17:20:00 +00:00

1 2

89 Commits