test2

Author	SHA1	Message	Date
Thomas Dinges	a613290775	Cycles / CUDA: Workaround to make sm_52 (Maxwell) cards work. * sm_52 can run a sm_50 kernel, so tell runtime detection to use that until we build a dedicated sm_52 kernel.	2014-10-05 04:13:40 +02:00
Thomas Dinges	dde740bcd7	Cycles / CUDA: Change inline rules for BVH intersection functions. * On sm_30 and above there is no change (was not inlined already before), this just fixes a speed regression from yesterday. `6359c36ba4` * On sm_2x (tested with sm_21), I get a nice 8% speedup in the bmw scene with this. As a bonus, cubin compilation time and memory usage is significantly reduced. Regular cubin size went from 2.5MB to 2.0MB, Experimental one from 3.8MB to 2.5MB.	2014-10-05 03:53:51 +02:00
Sergey Sharybin	15969e8a30	Cycles: Fix wrong ifdef check around shadows record all	2014-10-04 16:21:05 +02:00
Sergey Sharybin	27d660ad20	Cycles: Add support for debug passes Currently only summed number of traversal steps and intersections used by the camera ray intersection pass is implemented, but in the future we will support more debug passes which would help checking what things makes the scene slow. Example of such extra passes could be number of bounces, time spent on the shader tree evaluation and so. Implementation from the Cycles side is pretty much straightforward, could only mention here that it's a build-time option disabled by default. From the blender side it's implemented as a PASS_DEBUG with several subtypes possible. This way we don't need to create an extra DNA pass type for each of the debug passes, saving us a bits. Reviewers: campbellbarton Reviewed By: campbellbarton Differential Revision: https://developer.blender.org/D813	2014-10-04 19:00:26 +06:00
Thomas Dinges	6359c36ba4	Cycles: Remove a workaround for Titan GPUs, not needed anymore with the latest CUDA compiler.	2014-10-04 01:29:08 +02:00
Thomas Dinges	cdbac018a2	Cycles, some tweaks to scene_intersect_shadow_all() * Function returns a bool, not an uint. * Remove GPU ifdefs, this is CPU only due to malloc / qsort.	2014-10-03 20:41:38 +02:00
Thomas Dinges	02ffed4052	Cleanup: Remove some unused / unreferenced functions for perdiodic perlin noise.	2014-10-03 18:00:45 +02:00
Thomas Dinges	3aa65574f5	Cycles / OSL: Make the signed/unsigned Perlin parameter more self explaining.	2014-10-03 17:51:21 +02:00
Thomas Dinges	dc1ca0c94f	Cycles: Fix OpenCL compile after new Volume BVH introduction and add some comments.	2014-10-03 17:23:45 +02:00
Thomas Dinges	5e10392e9f	Cycles: Missing volume traversal header in cmake for GPU compilation.	2014-10-03 17:11:00 +02:00
Thomas Dinges	4b2fadeaba	Cycles: Remove Westin closure. Was hooked up last year for testing purposes, as we already had some code for it, but the closure itself is not really good nor really useful, so let's remove it.	2014-10-03 16:03:49 +02:00
Thomas Dinges	02f58ac623	Cleanup: Spelling.	2014-10-03 15:28:52 +02:00
Sergey Sharybin	1e4d99368b	Cycles: Use more accurate implementation of erf() and erfinv() This functions are orders of magnitude more accurate than the old ones, and they're around the same complexity to compute.	2014-10-03 18:28:44 +06:00
Sergey Sharybin	0fa7e4c853	Cycles: Decouple object flags update to a separate update step This way there's much less cross-references between objects and meshes device update functions. The only thing remained s the object bounds calculation which is needed by bvh update. This could also be decoupled, but it's not that crucial yet because its's how it used to be for ages now.	2014-10-03 12:13:41 +02:00
Sergey Sharybin	502f6d538d	Fix T41920: Changing Use Alpha settings doesn't refresh viewport properly	2014-10-03 11:27:05 +02:00
Sergey Sharybin	a654512356	Cycles: Implement preliminary test for volume stack update from SSS This adds an AABB collision check for objects with volumes and if there's a collision detected then the object will have SD_OBJECT_INTERSECTS_VOLUME flag. This solves a speed regression introduced by the fix for T39823 by skipping volume stack update in cases no volumes intersects the current SSS object.	2014-10-03 10:52:04 +02:00
Sergey Sharybin	b86f199a98	Cycles: Fix for non-initialized variable	2014-10-03 10:44:24 +02:00
Sergey Sharybin	527d049c5c	Cycles: Make camera-in-volume an official feature This means it's no longer needed to enable experimental feature set in order to have proper camera in volume support. And this also means if there's something wrong going on, or if there's speed regression for cases when camera is obviously not in the volume -- this issues are to be reported and handled in the regular matter. Happy blending!	2014-10-03 12:55:31 +06:00
Sergey Sharybin	7dabfb2048	Cycles: Speedup of kernel side camera-in-volume detection The idea is to only count intersections with objects which has volumetric shader and ignore all other objects. This is probably as fast as we can go without involving some forth level magic.	2014-10-03 12:55:31 +06:00
Sergey Sharybin	faa10d1ced	Cycles: optimization of panoramic camera in volume Now we do much better preliminary check for panoramic camera is inside the volume object boundings. Also we're now cacheing the has_volume in the mesh, which makes it unneeded iterations for each object's shaders. Should be no functional changes, just faster sync and panoramic-in-volume rendering.	2014-10-02 20:45:30 +02:00
Campbell Barton	927099ceb8	Cleanup: style	2014-09-30 02:04:34 +10:00
Sergey Sharybin	d41f99ac57	Cycles: Correct object flags bitfield, was missing negative scale there It's quite a few of circumstances to be met to hit the case when render wouldn't be correct. Better to be ported to the final release.	2014-09-28 14:13:36 +06:00
Sergey Sharybin	21825c4359	Cycles: Avoid temp variable in camera-in-volume check Was a left-over from some experiments, no need it with the current implementation, and likely wouldn't need in the future.	2014-09-28 02:35:37 +06:00
Sergey Sharybin	53b05e4f06	Cycles: Cleanup of the SSS volume stack update code Was a leftover after the changed scene_intersect() which used to be ifdefed depending on the __HAIR__ in the original patch.	2014-09-28 02:19:17 +06:00
Sergey Sharybin	4832538ad0	Cycles: Keep STACK_MAX_HITS private in kernel_shadow This way adding record_all for other things becomes easier and doesn't lead to naming conflicts.	2014-09-26 14:23:48 +06:00
Thomas Dinges	ff4a867dc0	Code style.	2014-09-26 02:04:40 +02:00
Brecht Van Lommel	0b12e61040	OpenNL: modify SuperLU to use doubles rather than floats, for better precision. This helps to improve the accuracy of UV unwrapping and laplacian deform for high poly meshes, which could get warped quite badly. It's not much slower, doubles are pretty fast on modern CPUs, but it does double memory usage. This seems acceptable as otherwise high poly meshes would not work correctly anyway. Fixes T39004.	2014-09-26 00:04:10 +02:00
Brecht Van Lommel	32f83a298c	Fix build errors in atomic ops and warning in aligned malloc on OS X.	2014-09-25 23:59:38 +02:00
Sergey Sharybin	2307bd7174	Cycles: Keep ccl_always_inline always inlining the stuff It works around strange shading bug when building with MSVC. If such weirdeness continues, we perhaps would need to use proper inline flags all the time. Anyway, lets see how things will behave now.	2014-09-26 02:03:49 +06:00
Sergey Sharybin	0929821590	Cycles: Accidentally inverted the logic of NDEBUG macro	2014-09-26 01:34:43 +06:00
Sergey Sharybin	4735fdc280	Cycles: Better feedback about experimental features being used Instead of having a label which basically duplicated the information about experimental feature set being used (which had a bug because it claimed experimental GPU kernel is used even if compute device is CPU btw) now we've got an enum item icon. So once you switched to experimental feature set you'll see an exclamation mark icon in the enum, so you know something might be unstable or slow.	2014-09-26 01:02:28 +06:00
Sergey Sharybin	faf4f29cc0	Guardedalloc: Implement atomic peak memory update Updating maximum requires a bit of a cycle which usually does 1 iteration only, sometimes needs a bit more but seems there's no speed regressions. For now the code is commented out. This way it's easier for others to verify there's no speed regressions. Reviewers: campbellbarton Differential Revision: https://developer.blender.org/D626	2014-09-26 00:40:53 +06:00
Sergey Sharybin	37f3843ab0	Atomics: Add CAS (compare-and-swap) functions	2014-09-26 00:33:04 +06:00
Sergey Sharybin	b90d849171	Cycles: Fix for the MSVC which doesn't have default osteram constructor	2014-09-26 00:27:04 +06:00
Thomas Dinges	38a54f4e01	Cycles: Make CUDA backend aware of sm_52 (Maxwell). In order to compile the new kernel you need to specify sm_52 in SCons / CMake, and use CUDA Toolkit 6.5.19, from here: https://developer.nvidia.com/cuda-downloads-geforce-gtx9xx Note: sm_52 is not enabled per default yet, so it won't be bundled with the Buildbot builds. That will be addressed later.	2014-09-25 20:07:50 +02:00
Sergey Sharybin	fe731686fb	Cycles: Add support for cameras inside volume Basically the title says it all, volume stack initialization now is aware that camera might be inside of the volume. This gives quite noticeable render time regressions in cases camera is in the volume (didn't measure them yet) because this requires quite a few of ray-casting per camera ray in order to check which objects we're inside. Not quite sure if this might be optimized. But the good thing is that we can do quite a good job on detecting whether camera is outside of any of the volumes and in this case there should be no time penalty at all (apart from some extra checks during the sync state). For now we're only doing rather simple AABB checks between the viewplane and volume objects. This could give some false-positives, but this should be good starting point. Need to mention panoramic cameras here, for them it's only check for whether there are volumes in the scene, which would lead to speed regressions even if the camera is outside of the volumes. Would need to figure out proper check for such cameras. There are still quite a few of TODOs in the code, but the patch is good enough to start playing around with it checking whether there are some obvious mistakes somewhere. Currently the feature is only available in the Experimental feature sey, need to solve some of the TODOs and look into making things faster before considering the feature is ready for the official feature set. This would still likely happen in current release cycle. Reviewers: brecht, juicyfruit, dingto Differential Revision: https://developer.blender.org/D794	2014-09-25 23:28:01 +06:00
Sergey Sharybin	ccc5983e2b	Fix T39823: SSS scatter doesn't update volume stack, causing shading artifacts Basically the title says it all, we need to update volume stack when doing ray scatter for SSS. This leads to speed regressions in cases scene does have both volume and SSS (performance in case there's no SSS or no volume should be the same). We might try optimizing kernel_path_subsurface_update_volume_stack() a bit by either recording all intersections or using some more appropriate visibility flags. Reviewers: brecht, juicyfruit, dingto Differential Revision: https://developer.blender.org/D795	2014-09-25 23:17:45 +06:00
Sergey Sharybin	d165b1b266	Cycles: Add method to dump current shader graph to the graphiz file This is rather useful to see how good optimization went and so. Currently uses quite simple notation: shader nodes are nodes on the graph, connects between graph nodes are named by the sockets names, so i.e. connection between BSDF and Mix would be named bsdf:closure1. Could be improved in the feature to draw fancier graph, but it's good enough already. Use in the following way: - To create graphix file call graph->dump_graph("graph.dot") - To visualize the grapf call: dot -Tpng graph.dot -o graph.png	2014-09-25 17:08:32 +06:00
Sergey Sharybin	b3d414cc21	Cycles: Don't inline functions for debug CPU kernel Nobody will use debug mode for benchmarks anyway and this way it's much easier to set breakpoints on inlined functions to catch all their usages.	2014-09-25 17:08:32 +06:00
Sergey Sharybin	13d8671a1a	Cycles: Add support of Glog logging This commit makes it possible to use Glog library for the debug logging. For now only possible when using CMake and in order to use the logging the WITH_CYCLES_LOGGING configuration variable is to be enabled. When this option is not enabled or when using Scons there's no difference in Cycles behavior at all, when using logging and no output to the console impact is gonna to be minimal. This is done in order to make it possible to have debug logging persistent in code (without need to add it when troubleshooting some bug and removing it afterwards). For now actual logging is not placed yet, only all the functions needed for the logging are written and so.	2014-09-25 17:08:32 +06:00
Jens Verwiebe	faaf0c719f	OSX: ensure windows are restored at their saved position, meaning here we need to take docksize into account	2014-09-24 20:55:48 +02:00
Martijn Berger	25ec0d97f9	make "tri_shader" an int instead of a float tri_shader does no longer need to a float. Reviewers: dingto, sergey Reviewed By: dingto, sergey Subscribers: dingto Projects: #cycles Differential Revision: https://developer.blender.org/D789	2014-09-24 13:34:28 +02:00
Thomas Dinges	cbffc7499e	Cycles: Shader Graph Optimization for Mix RGB nodes. Basically the same as AC2c58e96685e8, but for Mix RGB Shaders, in case we use the Mix type. This way the node can be used as texture switch for example, setting the Factor to 0.0 or 1.0, without wasting extra memory / render time.	2014-09-24 12:52:36 +02:00
Thomas Dinges	1b5ec32ed9	Cleanup: Avoid some defines for scene_intersect(), related to Min Width.	2014-09-24 11:32:29 +02:00
Sergey Sharybin	362b0239fe	Fix typo in previous commit Buttons are too much close to each other on the keyboards!	2014-09-23 23:09:44 +06:00
Sergey Sharybin	e422e56db0	Move versioning code under the subversion check	2014-09-23 22:56:37 +06:00
Thomas Dinges	2ed1b67835	Fix T41912, OpenCL compile error when building without __SVM__ Thanks to Vitaliy Filippov for the patch.	2014-09-23 12:54:16 +02:00
Thomas Dinges	31da72545e	Cycles: Backward compatibility code for the Clamp splitting in 2.70. If an older file (< 270) had clamp enabled, with e.g. a value of 2.0, Direct and Indirect clamp are now automatically set to 2.0 as well.	2014-09-19 22:25:36 +02:00
Thomas Dinges	0542442310	Cycles: Add a UI warning, in case the experimental GPU kernel is used. The experimental kernel is slower and can cause issues on some cards still, so better communicate it well.	2014-09-19 22:25:35 +02:00
Thomas Dinges	75b61f5346	Cycles: Remove unused Mix Shaders from the ShaderGraph, instead of only relinking. Differential revision: https://developer.blender.org/D796	2014-09-19 13:21:25 +02:00

1 2 3 4 5 ...

4478 Commits