griefith/test

Author	SHA1	Message	Date
Lukas Stockner	1272ee455e	Cycles: Implement texture coordinates for Point, Spot and Area Lamps When using the Normal output of the Texture Coordinate node on Point and Spot lamps, the coordinates now depend on the rotation of the lamp. On Area lamps, the Parametric output of the Geometry node now returns UV coordinates on the area lamp. Credit for the Area lamp part goes to Stefan Werner (from D1995).	2016-10-29 19:24:08 +02:00
Sergey Sharybin	f11298692b	Cycles: More workarounds for weird crashes on AVX2 Oh man, is it a compiler bug? Is it something we do stupid? For now more crap to prevent crashes. During the conference will talk to Maxyn about how can we troubleshoot such weird issues.	2016-10-27 12:51:03 +02:00
Sergey Sharybin	7e380ad4c0	Cycles: Another attempt to fix crashes on AVX2 processors Basically don't use rcp() in areas which seems to be critical after second look. Also disabled some multiplication operators, not sure yet why they might be a problem. Tomorrow will be setting up a full test with all cases which were buggy in our farm to see if this fix is complete.	2016-10-26 22:14:41 +02:00
Sergey Sharybin	de22e55291	Cycles: Fix compilation error of AVX2 kernel without SSE math	2016-10-26 20:49:33 +02:00
Sergey Sharybin	35f152358b	Cycles: Completely disable transform SSE for now Was causing issues on another frame. On a tight schedule, disabling for now so artists are happy. Still looking into root of the issue!	2016-10-26 15:23:58 +02:00
Sergey Sharybin	7c7d23691f	Cycles: Fix crashes after recent optimization commits There is some precision issues for big magnitude coordinates which started to give weird behavior of release builds. Some weird memory usage in BVH which is tricky to nail down because only happens in release builds and GDB reports all variables as optimized out when trying to use RelWithDebInfo. There are two things in this commit: - Attempt to make vectorized code closer to original one, hoping that it'll eliminate precision issue. This seems to work for transform_point(). - Similar trick did not work for transform_direction() even tho absolute error here is much smaller. For now disabled that function, need a more careful look here.	2016-10-26 14:30:25 +02:00
Sergey Sharybin	f523fb43f9	Cycles: Fix for fix (tm) Sorry guys, for some reason read the expression back-to-front and did wrong fix :S	2016-10-25 18:29:13 +02:00
Sergey Sharybin	5c4113a3e4	Cycles: Fix typo in previous commit for BVH improvements	2016-10-25 18:06:38 +02:00
Aaron Carlisle	cf9a6b416c	API: Fix Links Self-explanatory. to find broken links run `sphinx-build -b linkcheck sphinx-in sphinx-out` Reviewers: mont29 Tags: #bf_blender, #python, #infrastructure:_websites Differential Revision: https://developer.blender.org/D2297	2016-10-25 17:34:01 +02:00
Sergey Sharybin	c54381488b	Cycles: Enable SSE math optimization for AVX kernels This gives about 5% speedup for AVX processors. Benefit of such optimization on other microarchitectures is still under investigation.	2016-10-25 16:10:47 +02:00
Sergey Sharybin	8c761ff838	Cycles: Use new SSE version of offset calculation for all QBVH flavors Gives up to ~1% speedup again. While it seems to be small, still nice since the code now is actually more clean that it used to be before.	2016-10-25 15:27:50 +02:00
Sergey Sharybin	f7cf2f659a	Cycles: Move QBVH near/far offset calculation to an utility function Just preparing for new optimization to be used in all traversal implementation. Should be no measurable difference.	2016-10-25 15:08:33 +02:00
Sergey Sharybin	064caae7b2	Cycles: BVH-related SSE optimization Several ideas here: - Optimize calculation of near_{x,y,z} in a way that does not require 3 if() statements per update, which avoids negative effect of wrong branch prediction. - Optimization of direction clamping for BVH. - Optimization of point/direction transform. Brings ~1.5% speedup again depending on a scene (unfortunately, this speedup can't be sum across all previous commits because speedup of each of the changes varies from scene to scene, but it still seems to be nice solid speedup of few percent on Linux and bigger speedup was reported on Windows). Once again ,thanks Maxym for inspiration! Still TODO: We have multiple places where we need to calculate near x,y,z indices in BVH, for now it's only done for main BVH traversal. Will try to move this calculation to an utility function and see if that can be easily re-used across all the BVH flavors.	2016-10-25 14:47:34 +02:00
Sergey Sharybin	81c9e0d295	Cycles: Avoid branching in SSE version of intersection pre-calculation Similar to the previous commit, avoid negative effect of bad branch prediction. Gives measurable performance up to ~2% in tests here. Once again, thanks to Maxym Dmytrychenko!	2016-10-25 14:18:32 +02:00
Sergey Sharybin	af411d918e	Cycles: Implement SSE-optimized path of util_max_axis() The idea here is to avoid if statements which could cause wrong branch prediction. Gives a bit of measurable speedup up to ~1%. Still nice :) Inspired by Maxym Dmytrychenko, thanks!	2016-10-25 13:54:17 +02:00
Sergey Sharybin	10a25b655a	Cycles: Add AVX2 path to subsurface triangle intersection Similar to regular triangle intersection case. Gives about 3% speedup rendering SSS object on my desktop, Question: how to avoid such a code duplication in a nice way without speed loss?	2016-10-24 16:56:41 +02:00
Sergey Sharybin	da8f5d6eac	Cycles: Don't use guarded vector for statically initialized data This will confuse hell of a guarded allocators because it is possible to have allocation happened prior to Blender's guarded allocator is fully initialized. This was causing crashes and assert failures when running blender with fully guarded memory allocator.	2016-10-24 14:18:22 +02:00
Sergey Sharybin	14a55bc059	Cycles: Fix shadowing variable which also causes use of uninitialized variable Was causing wrong aperture for panorama cameras. Seems to be a regression in `371d357`.	2016-10-24 14:04:31 +02:00
Sergey Sharybin	cde18cf3b3	Cycles: Fix static initialization order fiasco Initialization order of global stats and node types was not strictly defined and it was possible to have node types initialized first and stats after that. This will zero out memory which was allocated from the statistics causing assert failure when de-initializing node types.	2016-10-24 13:47:39 +02:00
Sergey Sharybin	963aa7e270	Cycles: Fix uninitialized variable from the previous commit	2016-10-24 12:54:24 +02:00
Sergey Sharybin	80a6e5beb5	Cycles: Remove explicit std:: from types where possible We have our own abstraction level on top of the STL's implementation. This commit will guarantee our tweaks are used for all cases.	2016-10-24 12:31:11 +02:00
Sergey Sharybin	48997d2e40	Cycles: Cleanup, style	2016-10-24 12:26:12 +02:00
Sergey Sharybin	3f29259676	Fix T49818: Crash when rendering with motion blur It was possible to have non-initialized unaligned BVH split to be used when regular BVH split SAH was inf. Now we ensure that unaligned splitter is only used when it's really initialized. It's a regression and should be in 2.78a.	2016-10-24 11:47:32 +02:00
Sergey Sharybin	1e1811357d	Cycles: Cleanup, spaces	2016-10-24 11:47:32 +02:00
Hristo Gueorguiev	8905c5c874	Cycles: OpenCL 3d textures support. Note that volume rendering is not supported yet, this is a step towards that. Reviewed By: brecht Differential Revision: https://developer.blender.org/D2299	2016-10-22 23:49:29 +02:00
Brecht Van Lommel	371d3570e0	Fix Cycles address space OpenCL error after recent fix.	2016-10-22 23:36:30 +02:00
Brecht Van Lommel	9d0ac94d52	Fix T49750: Cycles wrong ray differentials for perspective and stereo cameras.	2016-10-22 16:37:26 +02:00
Jörg Müller	132478d4b8	Fix T49657: Audio backend "Jack" should be named "JACK".	2016-10-22 14:20:47 +02:00
Jörg Müller	d5ee031f76	Fix T49764: Audio strips crackle when animating the volume - Implemented linear interpolation for volume changes in the software mixer. - Using this in the software device.	2016-10-22 13:39:55 +02:00
Lukas Stockner	f7ce482385	Cycles: Fix another OpenCL logging issue Previously an error message would be printed whenever the OpenCL build produced output. However, some frameworks seem to print extra information even if the build succeeded, so now the actual returned error is checked as well. When --debug-cycles is activated, the build output will always be printed, otherwise it only gets printed if there was an error.	2016-10-21 02:49:00 +02:00
Lukas Stockner	cd843409d3	Fix T49630: Cycles: Swapped shader and bake kernels The problem here was, as the title says, that the two kernels were swapped. Since shader evaluation is only used for building the samling map when World MIS is enabled, rendering without it would still work fine, although baking also was broken.	2016-10-17 12:28:01 +02:00
Lukas Stockner	d5dd12e56c	Cycles: Improve OpenCL kernel compilation logging The previous refactor changed the code to use a separate logging mechanism to support multithreaded compilation. However, since that's not supported by any frameworks yes, it just resulted in bad logging behaviour. So, this commit changes the logging to go diectly to stdout/stderr once again by default.	2016-10-17 11:51:18 +02:00
Scott Wu	7fec7eee20	Cycles: use near clipping distance in panorama camera. Reviewed By: sergey, brecht, dfelinto Differential Revision: https://developer.blender.org/D1952	2016-10-15 00:26:59 +02:00
Sergey Sharybin	0ddb8d9b13	Cycles: Disable optimization of operator / for float3 This was giving some speedup but made intersection tests to fail from watertight point of view. Needs deeper investigation, but need to quickly get it fixed for the studio.	2016-10-14 13:53:26 +02:00
Brecht Van Lommel	7f5441b916	Fix T49640: Cycles constant folding incorrect for texture coordinates.	2016-10-12 18:42:38 +02:00
Brecht Van Lommel	21e65d7457	Fix build error with WITH_CYCLES_NATIVE_ONLY and recent AVX2 changes.	2016-10-12 17:35:03 +02:00
Sergey Sharybin	22cdf44101	Cycles: Use const reference for register variables in non-OpenCL code This is something tested by @LazyDodo and suggested by Maxym to make MSVC happier.	2016-10-12 14:48:59 +02:00
Sergey Sharybin	e588106d45	Cycles: Use more SSE intrinsics for float3 type This gives about 5% speedup on AVX2 kernels (other kernels still have SSE disabled for math operations) and this solves the slowdown of koro scene mention in the previous commit. The title says it all actually. This commit also contains changes to pass float3 as const reference in affected functions. This should make MSVC happier without breaking OpenCL because it's only done in areas which are ifdef-ed for non-OpenCL. Another patch based on inspiration from Maxym Dmytrychenko, thanks!	2016-10-12 14:43:00 +02:00
Sergey Sharybin	42aeb608e7	Cycles: Implement AVX2 version of triangle_intersect This commit basically vectorizes existing code using AVX2 instructions (without modifying algorithm itself). This gives quite nice speedups: BMW: -8% Classroom: -5% Cat: -5% Koro: +1% Barcelona: -8% That's on Linux machine, reported performance improvement on Windows goes up to 20%. Not currently sure why Koro is somewhat slower because it mainly uses curve intersection tests, could be a time noise? Or osmething with the cache utilization perhaps? In any case speedup in other scenes makes me thinking that current state is acceptable for initial implementation. This is again inspired by Maxym Dmytrychenko.	2016-10-12 14:11:55 +02:00
Sergey Sharybin	6a4ec3ca43	Cycles: Add new avxf vectorized data type Based on existing ssef data type and to my knowledge it's also what happens in Embree nowadays. Inspired by Maxym Dmytrychenko and required for the upcoming triangle intersection commit. Hopefully the copyright message is correct.	2016-10-12 13:54:13 +02:00
Sergey Sharybin	fa62a989b4	Cycles: Enable SSE options of math module for AVX2 kernels Currently this does not give measurable difference, but is required ground work for some upcoming further optimization of AVX2 kernels.	2016-10-12 12:54:31 +02:00
Sergey Sharybin	87d08a5dc1	Cycles: Get rid of ifdef-ed noinline policy	2016-10-12 12:15:24 +02:00
Sergey Sharybin	cc95172667	Cycles: Fix use of uninitialized variable in SSS When ray hits curve segment with SSS shader it was possible to have uninitialized hit_P variable used for sampling. Seems that was a reason of our headache of difference between AVX2 and SSE4 render results here, so now we can revert all the nasty ifdef-ed inline policies.	2016-10-12 12:12:28 +02:00
Sergey Sharybin	edd9d89673	Cycles: Cleanup, style	2016-10-12 11:54:33 +02:00
Lukas Stockner	9ea71bc674	Cycles: Split device_opencl.cpp into multiple files for easier maintenance There are no user-visible changes, just some internal restructuring. Differential Revision: https://developer.blender.org/D2231	2016-10-09 15:49:50 +02:00
Brecht Van Lommel	74e0f900c5	Fix a few compile errors with C++11 on macOS.	2016-10-08 15:03:53 +02:00
Lukas Stockner	2dccf5a6e8	Cycles: Fix OpenCL split kernel compilation after recent CUDA 8 performance fix	2016-10-07 18:50:43 +02:00
Brecht Van Lommel	5a0f397eaa	Fix T49523: very slow normal map tangent computation for rendering in 2.78.	2016-10-06 03:12:04 +02:00
Brecht Van Lommel	b4f9766ed1	Cycles CUDA: make CUDA 8.0 the officially supported version for all platforms.	2016-10-03 22:15:26 +02:00
Brecht Van Lommel	a3abb020e3	Fix Cycles CUDA performance on CUDA 8.0. Mostly this is making inlining match CUDA 7.5 in a few performance critical places. The end result is that performance is now better than before, possibly due to less register spilling or other CUDA 8.0 compiler improvements. On benchmarks scenes, there are 3% to 35% render time reductions. Stack memory usage is reduced a little too. Reviewed By: sergey Differential Revision: https://developer.blender.org/D2269	2016-10-03 22:15:25 +02:00

1 2 3 4 5 ...

6306 Commits