griefith/test

Author	SHA1	Message	Date
Thomas Dinges	4eab0e72b3	Cleanup: Update some comments and add ToDo.	2015-04-29 23:56:46 +02:00
Thomas Dinges	b3def11f5b	Cycles: Record all possible volume intersections for SSS and camera checks This replaces sequential ray moving followed with scene intersection with single BVH traversal, which gives us all possible intersections. Only implemented for CPU, due to qsort and a bigger memory usage on GPU which we rather avoid. GPU still uses the regular bvh volume intersection code, while CPU now uses the new code. This improves render performance for scenes with: a) Camera inside volume mesh b) SSS mesh intersecting a volume mesh/domain In simple volume files (not much geometry) performance is roughly the same (slightly faster). In files with a lot of geometry, the performance increase is larger. bmps.blend with a volume shader and camera inside the mesh, it renders ~10% faster here. Patch by Sergey and myself. Differential Revision: https://developer.blender.org/D1264	2015-04-29 23:31:06 +02:00
Sergey Sharybin	7aab5c6ca9	Cycles: Fix wrong termination criteria in SSS volume stack update Another issue spotted with Thomas.	2015-04-30 01:20:17 +05:00
Thomas Dinges	5e423775da	Cleanup: Move Cycles volume stack update for subsurface into kernel_volume.h.	2015-04-28 11:20:27 +02:00
Thomas Dinges	58a2b10a65	Cycles: Initialize portal variable directly, so we can avoid the one NULL check.	2015-04-27 23:12:53 +02:00
Lukas Stockner	f478c2cfbd	Cycles: Added support for light portals This patch adds support for light portals: objects that help sampling the environment light, therefore improving convergence. Using them tor other lights in a unidirectional pathtracer is virtually useless. The sampling is done with the area-preserving code already used for area lamps. MIS is used both for combination of different portals and for combining portal- and envmap-sampling. The direction of portals is considered, they aren't used if the sampling point is behind them. Reviewers: sergey, dingto, #cycles Reviewed By: dingto, #cycles Subscribers: Lapineige, nutel, jtheninja, dsisco11, januz, vitorbalbio, candreacchio, TARDISMaker, lichtwerk, ace_dragon, marcog, mib2berlin, Tunge, lopataasdf, lordodin, sergey, dingto Differential Revision: https://developer.blender.org/D1133	2015-04-28 01:30:16 +05:00
Sergey Sharybin	ae7d84dbc1	Cycles: Use native saturate function for CUDA This more a workaround for CUDA optimizer which can't optimize clamp(x, 0, 1) into a single instruction and uses 4 instructions instead. Original patch by @lockal with own modification: Don't make changes outside of the kernel. They don't make any difference anyway and term saturate() has a bit different meaning outside of kernel. This gives around 2% of speedup in Barcelona file, but in more complex shader setups with lots of math nodes with clamping speedup could be much nicer. Subscribers: dingto Projects: #cycles Differential Revision: https://developer.blender.org/D1224	2015-04-28 00:38:32 +05:00
Thomas Dinges	bc160d8a85	Cleanup: Code style.	2015-04-26 00:42:26 +02:00
Lukas Stockner	60c5a2f2d2	Cycles: Add Mirror ball mapping to camera panorama options The projection code was already in place, so this just exposes the option. Differential Revision: https://developer.blender.org/D1079	2015-04-25 23:51:56 +02:00
Campbell Barton	b82d571c85	Cleanup: style	2015-04-21 15:53:32 +10:00
Sergey Sharybin	828abaf11c	Cycles: Split BVH nodes storage into inner and leaf nodes This way we can get rid of inefficient memory usage caused by BVH boundbox part being unused by leaf nodes but still being allocated for them. Doing such split allows to save 6 of float4 values for QBVH per leaf node and 3 of float4 values for regular BVH per leaf node. This translates into following memory save using 01.01.01.G rendered without hair: Device memory size Device memory peak Global memory peak Before the patch: 4957 5051 7668 With the patch: 4467 4562 7332 The measurements are done against current master. Still need to run speed tests and it's hard to predict if it's faster or not: on the one hand leaf nodes are now much more coherent in cache, on the other hand they're not so much coherent with regular nodes anymore. Reviewers: brecht, juicyfruit Subscribers: venomgfx, eyecandy Differential Revision: https://developer.blender.org/D1236	2015-04-20 17:29:51 +05:00
Sergey Sharybin	bf11e362c5	Fix T44046: Cycles speed regression in 2.74 (CPU only) Issue was caused by MSVC not being able to optimize some code out in the same way as GCC/Clang does, so now that parts of code are explicitly unfolded in order to help compilers out. This makes speed loss much less drastic on my laptop. That's probably as good as we can do with MSVC without investing infinite amount of time looking trying to workaround the optimizer.	2015-04-08 18:47:25 +05:00
Sergey Sharybin	09a746b857	Cycles: Cleanup, typos	2015-04-08 01:15:38 +05:00
Sergey Sharybin	858f54f16e	Cycles: Cleanup, indentation	2015-04-07 22:41:08 +05:00
Sergey Sharybin	e2354e64d2	Cycles: Cleanup, spaces around assignment operator Did some bad spacing in recent commits, better to get rid of those so they does not confuse those who're working on sources.	2015-04-07 00:25:54 +05:00
Sergey Sharybin	c1d8ddacaf	Cycles: Avoid doing paranoid checks in filepath of builtin images Originally we thought it's needed in order to distinguish builtin file from filename which starts with '@', but the filepath is actually full path there and it's unlikely to have file system where '@' is a proper root character. Surprisingly this does not give visible speed differences, but it's still nice to get rid of redundant check.	2015-04-07 00:11:47 +05:00
Sergey Sharybin	7c19239bf9	Cycles: Support bultin 3d textures with OSL backend	2015-04-06 23:29:29 +05:00
Sergey Sharybin	a9bb8d8a73	Cycles: de-duplicate fast/approximate erf function calculation Our own implementation is in fact the same performance as in fast_math from OpenShadingLanguage, but implementation from fast_math is using explicit madd function, which increases chance of compiler deciding to use intrinsics.	2015-04-06 12:49:44 +05:00
Sergey Sharybin	ab2d05d958	Fix T44269: Typo in volume_attribute_float:geom_volume.h Was rather harmless typo since we either pass both dx,dy or pass both NULL.	2015-04-05 19:07:45 +05:00
Sergey Sharybin	b06962fcfe	Cycles: Avoid using lookup table for Beckmann slopes on GPU This patch is based on some work done in D788 and re-formulation from Beckmann implementation in OpenShadingLanguage. Skipping texture lookup helps a lot on GPUs where it's more expensive to access texture memory than to do some extra calculation in threads. CPU code still uses lookup-table based approach since this seems to be still faster (at least on computers i've got access to). This change gives about 2% speedup on BMW scene with GTX560TI.	2015-04-05 19:07:45 +05:00
Sergey Sharybin	252b36ce77	Cycles: Remove unused Beckmann slope sampling code It did not preserve stratification too well and lookup-table approach was working much better. There are now also some more interesting forumlation from Wenzel and OpenShadingLanguage which should work better than old code.	2015-04-05 19:07:44 +05:00
Thomas Dinges	e5392069cc	Cleanup: Typo fix in HSV code.	2015-04-04 07:50:09 +02:00
Sergey Sharybin	f1494edf78	Cycles: Make SSS intersection closer to regular triangle intersection	2015-04-01 21:20:04 +05:00
Sergey Sharybin	394b947a50	Cycles: Remove unused direction from triangle intersection functions This argument was unused and got nicely optimized out. But once it starts to be using registers are getting stressed really crazy, causing slow down of render.	2015-04-01 21:08:12 +05:00
Sergey Sharybin	af399884e1	Fix T44113: Ashikhmin-Shirley distribution of glossy shader at 0 roughness causes artifacts when background uses MIS Was a division by zero error, solved in the same way as beckmann/ggx deals with small roughness values.	2015-04-01 14:21:21 +05:00
Sergey Sharybin	79918e0577	Cycles: Avoid float/int conversion in few places	2015-03-31 19:52:14 +05:00
Sergey Sharybin	7da4c2637d	Cycles: Fix typo in distance heuristic for shadow rays It's not that bad because this typo could only caused not really efficient BVH traversal, causing higher render times. Not as if it was causing render artifacts.	2015-03-31 19:52:14 +05:00
Sergey Sharybin	afbc45ed93	Cycles: Attempt to fix osl+scons compilation Defines (and other cflags) are not inherited by scons to the subdirectories, need to take care of them in all nested SConscripts.	2015-03-30 14:00:03 +05:00
Sergey Sharybin	5ff132182d	Cycles: Code cleanup, spaces around keywords This inconsistency drove me totally crazy, it's really confusing when it's inconsistent especially when you work on both Cycles and Blender sides. Shouldn;t cause merge PITA, it's whitespace changes only, Git should be able to merge it nicely.	2015-03-28 00:15:15 +05:00
Sergey Sharybin	6cd82dbf57	CMake: Enable strict flags for C++	2015-03-27 18:23:31 +05:00
Sergey Sharybin	585dd26120	Cycles: Code cleanup, prepare for strict C++ flags	2015-03-27 18:23:31 +05:00
Jens Verwiebe	9fc1a29de3	Fix 2 typos ( shakin' hands )	2015-03-25 16:56:51 +01:00
Sergey Sharybin	22dfb50622	Fix T44128: Ray visibility only enables diffuse if glossy is also enabled Issue was caused by accident in `c8a9a56` which not only disabled glossy reflection if Glossy visibility is disabled, but also Diffuse reflection. Quite safe and should go to final release branch.	2015-03-25 14:53:20 +05:00
Sergey Sharybin	87cff57207	Fix T44123: Cycles SSS renders black in recent builds Issue was introduced in 01ee21f where i didn't notice *_setup() function only doing partial initialization, and some of parameters are expected to be initialized by callee function. This was hitting only some setups, so tests with benchmark scenes didn't unleash issues. Now it should all be fine. This is to go to the 2.74 branch and we actually might re-AHOY.	2015-03-25 02:33:49 +05:00
Sergey Sharybin	ed7e593a4b	Fix T43926: Volume scatter: intersecting objects GPU rendering artifacts Fix T44007: Cycles Volumetrics: block artifacts with overlapping volumes The issue was caused by uninitialized parameters of some closures, which lead to unpredictable behavior of shader_merge_closures().	2015-03-23 12:48:33 +05:00
Sergey Sharybin	61eab743f1	Cycles: Optimization for CMJ in CUDA kernels Two things: - Use intrinsics for clz/ctz (ctz is implemented via ffs()). - Use faster sqrt() function which precision is enough for integer values.	2015-03-13 12:38:14 +05:00
Thomas Dinges	3db0e1ef6a	Cycles: Simplify volume light connect code.	2015-03-13 00:09:13 +01:00
Thomas Dinges	0ed914a194	Cleanup: Use differential helper class.	2015-03-12 23:35:01 +01:00
Sergey Sharybin	dce16d57dc	Revert "Fix T43865: Cycles: Watertight rendering produces artifacts on a huge plane" The fix was really flacky, in terms during speed benchmarks i had abort() in the fallback block to be sure it never runs in production scenes, but that affected on the optimization as well. Without this abort there's quite bad slowdown of 5-7% on the renders even tho the Pleucker fallback was never run. This is all weird and for now reverting the change which affects on all the production scenes and will look into alternative fixes for the original issue with precision loss on huge planes. This reverts commit `9489205c5c`.	2015-03-12 18:24:53 +05:00
Thomas Dinges	064fa4baae	Cycles / Decoupled Ray Marching: Skip consecutive empty steps. This merges consecutive empty steps in the decoupled record function, which can lead to fewer iterations in the scatter functions. Only helps slightly though (1%), but doesn't hurt to have this. Differential Revision: https://developer.blender.org/D873	2015-03-12 13:50:12 +01:00
Sv. Lockal	c8fb488b08	Fix T41066: An actual fix for curve intersection on FMA-enabled CPUs	2015-03-07 16:20:34 +00:00
Sergey Sharybin	9489205c5c	Fix T43865: Cycles: Watertight rendering produces artifacts on a huge plane The issue was caused by numerical instability whrn having ray origin close to a huge triangle, which could have aused bad ray distance check. Watertight Woop intersection isn't really addressing such cases, it's dealing with small triangles far away from the ray origin instead, so it's a bit tricky yo make it working reliably. While we're quite close to the release it's safer to do check in Pleaucker coordinates if ray close to a huge triangle. Likely this additional check combined with some other tweaks to the code doesn't cause measurable slowdown in the scenes tested here. After the release we can play a bit more with this code in order to make it more stable without Pleucker fallback.	2015-03-05 18:55:30 +05:00
Sergey Sharybin	d544bc5cd5	Cycles: Fix embarrassing type remained after getting rid of utility SWAP()	2015-03-04 00:16:21 +05:00
Sergey Sharybin	ed5df50192	Cycles: Fix/workaround for toggling world MIS causing CUDA to fail Seems it's just another issue with the compiler, worked around by explicitly telling not to inline some function. In theory we can unify this with CPU, but we're quite close to the release so better be safe than sorry.	2015-03-03 18:48:37 +05:00
Thomas Dinges	60679a171d	Revert "Cleanup: Simplify camera sample motion blur code." This reverts commit `8197f0bb64`.	2015-02-26 13:27:02 +01:00
Thomas Dinges	8197f0bb64	Cleanup: Simplify camera sample motion blur code.	2015-02-26 10:30:01 +01:00
Sergey Sharybin	a585cbd2af	Fix T43783: Cycles clipping doesn't match viewport when camera is inside volume Ray length adjustment got lost in some refactor commit back to 2.71 days.	2015-02-24 13:07:52 +05:00
Dalai Felinto	abd630de62	Disable Bake Jitter code (recently added) The following commits were supposed to add anti-alias and help with OSL baking: `7b16fda379` `1b92dfa961` However they introduced other issues (artifacts mostly), see T43550 . Leaving the code ifdef'ed for now.	2015-02-23 17:50:44 -03:00
Thomas Dinges	97422ea64f	Cleanup: Simplify brick texture code a bit.	2015-02-23 16:49:50 +01:00
Sergey Sharybin	578cc2143d	Cycles: Add note about autodiff in OSL wireframe shader	2015-02-21 17:31:41 +05:00

1 2 3 4 5 ...

1137 Commits