test2

Author	SHA1	Message	Date
Sergey Sharybin	34c3beb339	Cycles: Fix missing node distance update when only two child intersected in QBVH	2015-06-12 10:06:46 +02:00
Sergey Sharybin	596eadf0e1	Cycles: Add debug pass which shows number of instance pushes during camera ray intersection TODO: We might want to refactor debug passes into PASS_DEBUG and some debug_type (similar to Blender's side passes) to avoid issue of running out of bits.	2015-06-12 00:12:03 +02:00
Sergey Sharybin	b3cc602adc	Cycles: Remove meaningless debug traversal steps increment from QBVH volume code	2015-06-11 23:54:57 +02:00
Sergey Sharybin	f6748183a2	Cycles: Enable transparent shadows for experimental AMD kernel They're working just fine on AMD Tonga GPU and probably other architectures, lets enable it under the experimental feature set and see what exact system configuration gives issues.	2015-06-11 23:49:21 +02:00
Thomas Dinges	6a0a205cb4	Cycles: Simplify volume_phase_eval(). This simplification is safe, as the call to volume_phase_eval() is guarded behind a CLOSURE_IS_PHASE check, which is equal to CLOSURE_VOLUME_HENYEY_GREENSTEIN_ID. I don't think we will add more phase functions anytime soon, if at all.	2015-06-11 15:18:33 +02:00
Sergey Sharybin	2bd6de5bbb	Cycles: Add debug pass showing average number of ray bounces per pixel Quite straightforward implementation, but still needs some work for the split kernel. Includes both regular and split kernel implementation for that. The pass is not exposed to the interface yet because it's currently not really easy to have same pass listed in the menu multiple times.	2015-06-11 14:53:15 +02:00
Sergey Sharybin	c6c06285a7	Cycles: Remove requirement of using experimental kernel for hair and blur on AMD Those features are not selectively compiled, so there's no real benefit of hiding them under the experimental feature set.	2015-06-08 11:15:39 +02:00
Sergey Sharybin	27ed75271c	Cycles: Make hair, object and motion blur selective compiled into OpenCL This features are now based on the scene settings, so scenes without those features used are rendered even faster. This gives about 30% speedup on the AMD A10 APU here, but at the same time it does not mean such an improvement will happen on all the hardware. That being said, the Tonga device here seems to have no measurable difference. In any case it seems handy to have for the future, when we'll want to support SSS in the kernel or to port selective compilation/split kernel to CUDA devices.	2015-06-08 11:15:39 +02:00
Sergey Sharybin	f0a0b1eaac	Cycles: Assert in the cases when SVM node was not handled This will help figuring out cases when node was not properly handled by the SVM by aborting execution on CPU, where all the nodes are expected to be supported.	2015-06-01 19:49:52 +05:00
Sergey Sharybin	ecd4ee75af	Cycles: Implement selective nodes compilation This commits finishes initial selective nodes compilation into kernel, which helps a lot performance-wise for AMD OpenCL kernels. Split by node groups is based on statistics from simple scenes like BMW and more complex scenes like mango and gooseberry production files. Further tweaks are always possible, but it should be a good starting point. TODO: Still need to ignore unused nodes when calculating requested shader features.	2015-06-01 19:49:52 +05:00
Sergey Sharybin	c0235da53c	Cycles: Fix some typos in the selective modes compilation	2015-06-01 19:49:52 +05:00
Sergey Sharybin	4d8cf1329d	Cycles: Add bump feature for selective nodes compilation For now it is unused in the kernel, actual usage will come with the next commits.	2015-06-01 19:49:52 +05:00
Thomas Dinges	3511e2d6ae	Cycles: Enable Object Motion on AMD OpenCL. Like Camera Motion, only available in the Experimental kernel. This should be it for the upcoming release, we now support almost everything, apart from Transparent Shadows, SSS and Volume.	2015-05-28 22:10:53 +02:00
Thomas Dinges	46d8bcb617	Cleanup: Remove unused Noise Basis texture code. Same as last commit, code is unused and this one actually would have required some fixes, as these variants output values outside the 0-1 value range, which doesn't fit Cycles shader design.	2015-05-28 01:07:37 +02:00
Thomas Dinges	20f6a0f2d7	Cleanup: Remove unused Voronoi texture code. Let's finally delete this code, after 4 years of being unused, there really is no excuse anymore. If we decide to extend the procedural textures in SVM, we can do this anytime in the future.	2015-05-28 00:36:33 +02:00
Sergey Sharybin	92022218c2	Cycles: Code cleanup, split kernel	2015-05-27 13:08:17 +05:00
Sergey Sharybin	84ad20acef	Fix T44833: Can't use ccl_local space in non-kernel functions This commit re-shuffles code in split kernel once again and makes it so common parts which is in the headers is only responsible to making all the work needed for specified ray index. Getting ray index, checking for it's validity and enqueuing tasks are now happening in the device specified part of the kernel. This actually makes sense because enqueuing is indeed device-specified and i.e. with CUDA we'll want to enqueue kernels from kernel and avoid CPU roundtrip. TODO: - Kernel comments are still placed in the common header files, but since queue related stuff is not passed to those functions those comments might need to be split as well. Just currently read them considering that they're also covering the way how all devices are invoking the common code path. - Arguments might need to be wrapped into KernelGlobals, so we don't ened to pass all them around as function arguments.	2015-05-26 22:54:02 +05:00
Sergey Sharybin	6245f4a39c	Cycles: Enable advanced shading for NVidia OpenCL kernel It was kept disabled due to render artifacts which weer in fact caused by bad memory access, which is fixed in the previous commit. We now also can make it enabled in regular AMD split kernel after someone tests the updated code.	2015-05-26 21:29:21 +05:00
Campbell Barton	2c3c477223	Cleanup: warning, spelling	2015-05-26 16:46:33 +10:00
Sergey Sharybin	62f2d9b566	Cycles: Fix compilation error of split kernel The code was failing to compile on runtime because of some path differences, and it seems we don't need to specify full path to the file which originally seemed to be needed to make include directives expansion working correct.	2015-05-25 14:18:01 +05:00
Thomas Dinges	a3ef51bba5	Fix T44833, OpenCL compile error on AMD. This was broken after the kernel file restructure. Variables allocated in the __local address space can only be defined inside a __kernel function. We probably need to solve this a bit differently once we do the CUDA kernel split, but this fix shoud be good enough until then.	2015-05-25 01:02:06 +02:00
Sergey Sharybin	2c503d8303	Cycles: Restructure kernel files organization Since the kernel split work we're now having quite a few of new files, majority of which are related on the kernel entry points. Keeping those files in the root kernel folder will eventually make it really hard to follow which files are actual implementation of Cycles kernel. Those files are now moved to kernel/kernels/<device_type>. This way adding extra entry points will be less noisy. It is also nice to have all device-specific files grouped together. Another change is in the way how split kernel invokes logic. Previously all the logic was implemented directly in the .cl files, which makes it a bit tricky to re-use the logic across other devices. Since we'll likely be looking into doing same split work for CUDA devices eventually it makes sense to move logic from .cl files to header files. Those files are stored in kernel/split. This does not mean the header files will not give error messages when tried to be included from other devices and their arguments will likely be changed, but having such separation is a good start anyway. There should be no functional changes. Reviewers: juicyfruit, dingto Differential Revision: https://developer.blender.org/D1314	2015-05-22 16:31:34 +05:00
Thomas Dinges	53eab562b4	Cleanup: Remove some outdated comments related to split kernel.	2015-05-21 20:32:20 +02:00
Sergey Sharybin	7938bd1877	Cycles: Remove OSL from split headers Split kernel is mainly useful for GPUs which can not support OSL in visible future anyway.	2015-05-21 16:12:50 +05:00
Sergey Sharybin	329f704601	Cycles: Move utility atomics function to util_atomic.h No functional changes, just better to keep all atomic function in a single place, they might become handy later.	2015-05-21 16:12:50 +05:00
Sergey Sharybin	148ed4e05e	Cycles: Cleanup, synchronize name across file name, program and kernel names	2015-05-20 23:10:07 +05:00
Thomas Dinges	dae566894a	Cycles / OpenCL: Enable Camera Motion and Hair for AMD. Only enabled for the Experimental kernel though, so the feature set must be changed in the UI to use the features.	2015-05-17 18:46:25 +02:00
Campbell Barton	daeb3069cf	Cleanup: typos	2015-05-17 16:09:32 +10:00
Campbell Barton	31e96cbf96	Cleanup: style, spelling	2015-05-15 23:38:53 +10:00
Sergey Sharybin	c86a6f3efb	Cycles: Enable CMJ for Intel/NVidia experimental split kernels It is still disabled for AMD devices since can't test if it works fine on this hardware.	2015-05-15 13:22:47 +05:00
Sergey Sharybin	2ab909a88c	Cycles: Make experimental kernel build option more generic Previously it was explicitly mentioning it's NVidia kernel related option, but in fact it's also handy for the OpenCL kernel.	2015-05-15 13:22:47 +05:00
Sergey Sharybin	c9e8888f87	Cycles: Disable bake OpenCL kernel for NVidia devices prior to sm_30 Driver fails to compile kernel in reasonable time for those devices here, so for easier testing of the OpenCL split kernel work disabling bake kernel for now.	2015-05-15 13:22:47 +05:00
Sergey Sharybin	3c10ec96b5	Cycles: Enable object motion blur on Intel OpenCL platform This required allocating some memory related on object transform needed by ShaderData and currently it is done for all the platforms. Since we're targeting full feature-complete platforms this is rather acceptable at this point and in the future we'll do selective NO_HAIR/NO_SSS/NO_BLUR kernels. This is experimental still and in fact there're some major issues on NVidia platform and it's not really clear if it's a bug in compiler, some uninitizlied variable or other kind of issue.	2015-05-15 00:48:12 +05:00
Sergey Sharybin	f6c6dd44de	Cycles: Remove meaningless ifdef checks for features in device_opencl This file was actually checking for features enabled on CPU and surely all of them were enabled, so removing them does not cause any difference. ideally we'll need to do runtime feature detection and just pass some stuff as NULL to the kernel, or maybe also have variadic kernel entry points which is also possible quite easily.	2015-05-14 23:44:19 +05:00
Sergey Sharybin	5c34266383	Cycles: Enable camera motion blur in split kernel for Intel/NVidia It's good for testing and seems to work quite reliably here. This probably not totally cheap in terms of performance, but this we could solve quite easily by selective kernel compilation once other things are tested/proved to be reliable.	2015-05-14 23:35:19 +05:00
Sergey Sharybin	3d3d805b64	Cycles: Prepare code for OpenCL camera/motion blur The kernels are now compiling just fine, but there're some issues during rendering. This is still to be investigated.	2015-05-14 18:48:56 +05:00
Sergey Sharybin	5a63edb929	Cycles: Use special _auto versions of transform function in motion blur code Doing this as a separate commit so it's easier to revert in the future, once OpenCL 2.0 is becoming our requirement.	2015-05-14 18:48:56 +05:00
Sergey Sharybin	79aa50dc53	Cycles: Enable hair for split kernels when using Intel or NVidia drivers Apart from simply enabling this features needed changes to the code were done. Technical change, replacing SD access from "simple" structure to SOA.	2015-05-14 18:48:56 +05:00
Thomas Dinges	fc31bae66f	Cleanup: Avoid temp variable in portal sampling code.	2015-05-13 19:54:52 +02:00
Thomas Dinges	0a6e32173e	Cleanup / Cycles: De-Duplicate Portal data fetch and side check.	2015-05-13 16:05:30 +02:00
Sergey Sharybin	583fd3af65	Cycles: Fix typo in global space version of normal transform It was using direction transform, which is obviously wrong.	2015-05-10 00:53:32 +05:00
Sergey Sharybin	2840a5de8f	Cycles: Workaround for AMD compiler crashing building the split kernel It's a but in compiler but it's nice to have working kernel for until that bug is fixed.	2015-05-09 19:56:38 +05:00
George Kyriazis	7f4479da42	Cycles: OpenCL kernel split This commit contains all the work related on the AMD megakernel split work which was mainly done by Varun Sundar, George Kyriazis and Lenny Wang, plus some help from Sergey Sharybin, Martijn Berger, Thomas Dinges and likely someone else which we're forgetting to mention. Currently only AMD cards are enabled for the new split kernel, but it is possible to force split opencl kernel to be used by setting the following environment variable: CYCLES_OPENCL_SPLIT_KERNEL_TEST=1. Not all the features are supported yet, and that being said no motion blur, camera blur, SSS and volumetrics for now. Also transparent shadows are disabled on AMD device because of some compiler bug. This kernel is also only implements regular path tracing and supporting branched one will take a bit. Branched path tracing is exposed to the interface still, which is a bit misleading and will be hidden there soon. More feature will be enabled once they're ported to the split kernel and tested. Neither regular CPU nor CUDA has any difference, they're generating the same exact code, which means no regressions/improvements there. Based on the research paper: https://research.nvidia.com/sites/default/files/publications/laine2013hpg_paper.pdf Here's the documentation: https://docs.google.com/document/d/1LuXW-CV-sVJkQaEGZlMJ86jZ8FmoPfecaMdR-oiWbUY/edit Design discussion of the patch: https://developer.blender.org/T44197 Differential Revision: https://developer.blender.org/D1200	2015-05-09 19:52:40 +05:00
Sergey Sharybin	6fc1669679	Cycles: Initial work towards selective nodes support compilation The goal is to be able to compile kernel with nodes which are actually needed to render current scene, hence improving performance of the kernel, The idea is: - Have few node groups, starting with a group which contains nodes are used really often, and then couple of groups which will be extension of this one. - Have feature-based nodes disabling, so it's possible to disable nodes related to features which are not used with the currently used nodes group. This commit only lays down needed routines for this approach, actual split will happen later after gathering statistics from bunch of production scenes.	2015-05-09 19:22:16 +05:00
Sergey Sharybin	5068f7dc01	Cycles: Add utility function to graph to query number of closures used in it Currently unused but will be needed soon for the split kernel work.	2015-05-09 19:13:32 +05:00
Sergey Sharybin	d69c80f717	Cycles: Presumably correct workaround for addrspace in camera motion blur	2015-05-09 19:04:19 +05:00
Sergey Sharybin	c9133778cf	Cycles: Add CPU compat headers to some of the OSL implementation files This header was already included into some of the implementation files already, and this change is needed for some upcoming changes in the way how kernel_types.h works.	2015-05-09 19:04:16 +05:00
Thomas Dinges	900fc43bb4	Cleanup: Remove unused ray type flags. They were added for completeness, but it seems we don't need them.	2015-05-08 12:10:26 +02:00
Sergey Sharybin	9ca2b76a9f	Cycles: Cleanup, make it more clear what endif closes what ifdef	2015-05-07 15:02:43 +05:00
Campbell Barton	165598e49e	Correct typo: ifdef'd now, but obviously wrong	2015-05-07 10:12:12 +10:00

1 2 3 4 5 ...

1188 Commits