test2

Author	SHA1	Message	Date
Sergey Sharybin	b406b7be00	Cycles: Mark which CUDA device is used for display It is really handy to know which one is display when having two cards of same type in the machine.	2016-06-03 11:52:08 +02:00
Sergey Sharybin	d2bb0e660b	Fix T46207: Slow OpenCL GPU bake and blown out baking Cycles render	2016-05-31 17:48:42 +02:00
Mai Lavelle	4388b29e98	Cycles: Add human readable sizes to debug output Some of these values can get quite large and are hard to read, adding this makes it easy to read them at a glance. Reviewed By: sergey Differential Revision: https://developer.blender.org/D2039	2016-05-31 06:13:54 -04:00
Brecht Van Lommel	f7c28a66e2	Fix Cycles compile errors with GCC due to double promotion as errors.	2016-05-22 19:17:22 +02:00
Brecht Van Lommel	ec51175f1f	Code refactor: add generic Cycles node infrastructure. Differential Revision: https://developer.blender.org/D2016	2016-05-22 17:29:24 +02:00
Thomas Dinges	dedc995018	Cycles / CUDA: Don't use bundled kernel if Adaptive is enforced by the user.	2016-05-19 16:32:57 +02:00
Thomas Dinges	c9f1ed1e4c	Cycles: Add support for bindless textures. This adds support for CUDA Texture objects (also known as Bindless textures) for Kepler GPUs (Geforce 6xx and above). This is used for all 2D/3D textures, data still uses arrays as before. User benefits: * No more limits of image textures on Kepler. We had 5 float4 and 145 byte4 slots there before, now we have 1024 float4 and 1024 byte4. This can be extended further if we need to (just change the define). * Single channel textures slots (byte and float) are now supported on Kepler as well (1024 slots for each type). ToDo / Issues: * 3D textures don't work yet, at least don't show up during render. I have no idea whats wrong yet. * Dynamically allocate bindless_mapping array? I hope Fermi still works fine, but that should be tested on a Fermi card before pushing to master. Part of my GSoC 2016. Reviewers: sergey, #cycles, brecht Subscribers: swerner, jtheninja, brecht, sergey Differential Revision: https://developer.blender.org/D1999	2016-05-19 13:14:37 +02:00
Sergey Sharybin	7b356a8565	Cycles: Reduce amount of malloc() calls from the kernel This commit makes it so malloc() is only happening once per volume and once per transparent shadow query (per thread), improving scalability of the code to multiple CPU cores. Hard to measure this with a low-bottom i7 here currently, but from quick tests seems volume sampling gave about 3-5% speedup. The idea is to store allocated memory in kernel globals, which are per thread on CPU already. Reviewers: dingto, juicyfruit, lukasstockner97, maiself, brecht Reviewed By: brecht Subscribers: Blendify, nutel Differential Revision: https://developer.blender.org/D1996	2016-05-18 10:14:24 +02:00
Thomas Dinges	29a17d54da	Fix CUDA MEMCPY condition, it should only copy 3D, 2D or 1D. Found by Brecht, thanks!	2016-05-17 00:37:34 +02:00
Thomas Dinges	99d861169f	Cycles / Requested Features: Volume was missing in logging print.	2016-05-17 00:36:22 +02:00
Thomas Dinges	4a4f043bc4	Cycles: Add support for single channel float textures on CPU. Until now, single channel textures were packed into a float4, wasting 3 floats per pixel. Memory usage of such textures is now reduced by 3/4. Voxel Attributes such as density, flame and heat benefit from this, but also Bumpmaps with one channel. This commit also includes some cleanup and code deduplication for image loading. Example Smoke render from Cosmos Laundromat: http://www.pasteall.org/pic/show.php?id=102972 Memory here went down from ~600MB to ~300MB. Reviewers: #cycles, brecht Differential Revision: https://developer.blender.org/D1981	2016-05-11 21:58:34 +02:00
Sergey Sharybin	92774ff792	Cycles: Use explicit qualifier for single-argument constructors Almost in all cases we want such constructors to be explicit, there are exceptions but only in few places.	2016-05-11 16:51:14 +02:00
Thomas Dinges	d6555d936c	Cleanup: Avoid duplicative defines for CPU textures, use the ones from util_texture.h Also includes some further byte -> byte4 renaming, missed that in last commit.	2016-05-09 09:16:41 +02:00
Thomas Dinges	1c46ecd86b	Cleanup: Remove unneded (void) line, we don't have ifdefs here anymore.	2016-05-07 15:55:28 +02:00
Thomas Dinges	4422b3f919	Some fixes for CUDA runtime compile: * When Baking wasn't used we got an error. * On top of Volume Nodes (NODES_FEATURE_VOLUME), we now also check if we need volume sampling code, so we can disable that as well and save some further compilation time.	2016-05-06 23:13:33 +02:00
Thomas Dinges	734d1aec3f	Cycles: Make CUDA adaptive feature compile a Debug flag. If the CUDA Toolkit is installed and the user is on Linux, adaptive, feature based CUDA runtime compile is now possible to enable via: * Environment flag CYCLES_CUDA_ADAPTIVE_COMPILE or * Debug menu (Debug value 256) in the Cycles UI.	2016-05-06 23:13:33 +02:00
Thomas Dinges	3807bcb3a8	Cleanup: Rename texture slots to float4 and byte, to distinguish from future float (single channel) and half_float slots. Should be no functional changes, tested CPU and CUDA.	2016-05-06 14:37:35 +02:00
Sergey Sharybin	e50d229273	Fix T47794: Point density sometime seems stretched when rendered on GPU	2016-04-20 14:42:19 +02:00
Sergey Sharybin	b20f12d835	Cycles: Some typo fixes	2016-03-12 15:01:20 +05:00
Sergey Sharybin	8cab327316	Cycles: Make CUDA 7.5 officially recommended This was a hard decision, because going newer CUDA toolkit makes rendering up to 5% slower. But on another hand, it solves major speed regressions (up to 30%) with branched path tracing on a top level cards. Neither of those regressions have a meaningful and sane workaround from the code itself. Toolkit 6.5 could still be used, but it's no longer recommended one.	2016-02-17 15:18:56 +01:00
Sergey Sharybin	c5e1781944	Cycles: Fix crash when trying to render after re-enabling the addon	2016-02-16 12:47:31 +01:00
Sergey Sharybin	1c4f21f85e	Cycles: Initial support of 3D textures for CUDA rendering Supports both smoke/fire and point density textures now. Reduces number of textures available for sm_20 and sm_21, but you have to compromise somewhere on such a limited hardware. Currently limited to linear interpolation only, and decoupled ray marching is not supported yet. Think those could be considered just a further improvement. Some quick example: https://developer.blender.org/F282934 Code is minimal and we can fully consider it a fix for missing support of 3D textures with CUDA. Reviewers: lukasstockner97, brecht, juicyfruit, dingto Reviewed By: brecht, juicyfruit, dingto Subscribers: mib2berlin Differential Revision: https://developer.blender.org/D1806	2016-02-15 21:26:29 +01:00
Sergey Sharybin	c8d2bc7890	Cycles: Always use guarded allocator of vectors We don't have vectors re-allocation happening multiple times from inside a loop anymore, so we can safely switch to a memory guarded allocator for vectors and keep track on the memory usage at various stages of rendering. Additionally, when building from inside Blender repository, Cycles will use Blender's guarded allocator, so actual memory usage will be displayed in the Space Info header. There are couple of tricky aspects of the patch: - TaskScheduler::exit() now explicitly frees memory used by `threads`. This is needed because `threads` is a static member which destructor isn't getting called on Blender's exit which caused memory leak print to happen. This shouldn't give any measurable speed issues, reallocation of that vector is only one of fewzillion other allocations happening during synchronization. - Use regular guarded malloc (not aligned one). No idea why it was made to be aligned in the first place. Perhaps some corner case tests or so. Vector was never expected to be aligned anyway. Let's see if we'll have actual bugs with this. Reviewers: dingto, lukasstockner97, juicyfruit, brecht Reviewed By: brecht Differential Revision: https://developer.blender.org/D1774	2016-02-12 15:43:26 +01:00
Sergey Sharybin	28604c46a1	Cycles: Make Blender importer more forward compatible Basically the idea is to make code robust against extending enum options in the future by falling back to a known safe default setting when RNA is set to something unknown. While this approach solves the issues similar to T47377, but it wouldn't really help when/if any of the RNA values gets ever deprecated and removed. There'll be no simple solution to that apart from defining explicit mapping from RNA value to Cycles one. Another part which isn't so great actually is that we now have to have some enum guards and give some explicit values to the enum items, but we can live with that perhaps. Reviewers: dingto, juicyfruit, lukasstockner97, brecht Reviewed By: brecht Differential Revision: https://developer.blender.org/D1785	2016-02-12 15:27:33 +01:00
Sergey Sharybin	10cc4ae359	Cycles: Fix typo in network device Spotted by jesterKing, thanks!	2016-02-11 13:05:55 +01:00
Sergey Sharybin	f25f7c8030	Cycles: Re-implement some utilities to avoid use of boost The title says it all actually, the idea is to make Cycles only requiring Boost via 3rd party dependencies like OIIO and OSL. So now there are only few places which still uses Boost: - Foreach, function bindings and threading primitives. Those we can easily get rid with C++11 bump (which seems inevitable sooner or later if we'll want ot use newer LLVM for OSL), - Networking devices There's no quick solution for those currently, but there are some patches around which improves serialization. Reviewers: juicyfruit, mont29, campbellbarton, brecht, dingto Reviewed By: brecht, dingto Differential Revision: https://developer.blender.org/D1764	2016-02-06 19:19:20 +01:00
Sergey Sharybin	3aa74828ab	Cycles: Cleanup, indentation and braces	2016-02-03 15:00:55 +01:00
Sergey Sharybin	9815f8a623	Cycles: Cleanup of OpenCL split kernel routines The idea is to switch from allocating separate buffers for shader data's structure of arrays to allocating one huge memory block and do some index trickery to make it accessed as SOA. This saves quite reasonable amount of lines of code in device_opencl and also makes it possible to get rid of special declaration of ShaderData structure. As a side effect it also makes it easier to experiment with SOA vs. AOS for split kernel. Works fine here on NVidia GTX580, Intel CPU amd AMD Fiji cards. Reviewers: #cycles, brecht, juicyfruit, dingto Differential Revision: https://developer.blender.org/D1593	2016-01-30 00:23:06 +01:00
Sergey Sharybin	25aea19323	Cycles: Remove some unused variables from split kernel function	2016-01-29 18:54:46 +01:00
Sergey Sharybin	e2161ca854	Cycles: Remove few function arguments needed only for the split kernel Use KernelGlobals to access all the global arrays for the intermediate storage instead of passing all this storage things explicitly. Tested here with Intel OpenCL, NVIDIA GTX580 and AMD Fiji, didn't see any artifacts, so guess it's all good. Reviewers: juicyfruit, dingto, lukasstockner97 Differential Revision: https://developer.blender.org/D1736	2016-01-28 18:59:27 +01:00
Sergey Sharybin	5f31089957	Cycles: Make OpenCL's argument wrapper able to get int/float values directly	2016-01-28 15:03:42 +01:00
Sergey Sharybin	9163fc05a7	Cycles: Fix typo in flags check	2016-01-24 17:05:02 +05:00
Campbell Barton	3174254142	Cleanup: style	2016-01-24 12:13:37 +11:00
Sergey Sharybin	19adfd3176	Cycles: Fix OpenCL kernel compilation after the bake commit There is no function pointers in OpenCL specification. For as long as we want to support this platform we should follow the specifications. While the code is not totally optimal now, it should not be that huge of performance issue on CPU since it does jump tables just nicely, so it's not that much extra computation here.	2016-01-19 22:53:19 +01:00
Sergey Sharybin	52e34ffe33	Cycles: Pass missing shader filter argument to CUDA and OpenCL kernels	2016-01-19 22:53:19 +01:00
Dalai Felinto	9a76354585	Cycles-Bake: Custom Baking passes The combined pass is built with the contributions the user finds fit. It is useful for lightmap baking, as well as non-view dependent effects baking. The manual will be updated once we get closer to the 2.77 release. Meanwhile the new page can be found here: http://dalaifelinto.com/blender-manual/render/cycles/baking.html Reviewers: sergey, brecht Differential Revision: https://developer.blender.org/D1674	2016-01-15 13:00:56 -02:00
Sergey Sharybin	55926ad298	Cycles: Fix string compiler warnings after recent changes	2016-01-14 17:04:56 +05:00
Thomas Dinges	3ba9742be2	Cycles: Remove the experimental CUDA kernel. This commit removes the experimental CUDA kernel, making SSS and CMJ regular features. Several improvements have been made in the past few weeks (thanks Sergey!) which make SSS render several times faster (2-3x compared to 2.76b) on the GPU, and the increased VRAM usage has also been fixed. Therefore the experimental kernel is no longer needed. Differential Revision: https://developer.blender.org/D1726 Manual has been updated: too: https://www.blender.org/manual/render/cycles/features.html	2016-01-14 12:56:08 +01:00
Sergey Sharybin	5af103fe00	Cycles: Reduce scope of some defines set in CMakeLists Should be no functional changes at all, just speeds up re-compilation when some features needs to be disabled for development purposes. For example, when running lots of Valgrind it's handy to disable any GPU devices because otherwise you'll be wasting quite some time in the driver while enumerating devices. Reviewers: dingto, lukasstockner97, brecht, juicyfruit Differential Revision: https://developer.blender.org/D1730	2016-01-14 13:12:50 +05:00
Sergey Sharybin	2af7637f20	Cycles: Add option to directly link against CUDA libraries The main purpose of such linking is to make Blender compatible with NVidia's debuggers and profilers which are doing some LD_PRELOAD magic to intercept some function calls. Such magic conflicts with our CUDA wrangler magic and causes segmentation faults. The option is disabled by default, so there's no affect on any of artists. In order to make Blender linked directly against CUDA library use the WITH_CUDA_DYNLOAD CMake option (it's marked as advanced).	2016-01-14 12:27:22 +05:00
Sergey Sharybin	ac7aefd7c2	Cycles: Use special debug panel to fine-tune debug flags This panel is only visible when debug_value is set to 256 and has no affect in other cases. However, if debug value is not set to this value, environment variables will be used to control which features are enabled, so there's no visible changes to anyone in fact. There are some changes needed to prevent devices re-enumeration on every Cycles session create. Reviewers: juicyfruit, lukasstockner97, dingto, brecht Reviewed By: lukasstockner97, dingto Differential Revision: https://developer.blender.org/D1720	2016-01-12 16:21:30 +05:00
Sergey Sharybin	02739bd051	Cycles: Cleanup, use "string_" prefix for functions in util_string No functional changes, just makes it easier to track where the function is coming from.	2016-01-07 11:47:58 +05:00
Thomas Dinges	3da0af1464	Cycles: Add utility function to convert bool to string.	2016-01-07 01:38:25 +01:00
Thomas Dinges	81a253a0d5	Cycles OpenCL: Change environment flags for testing. CYCLES_OPENCL_TEST was removed, there was an insonsistency between opencl_kernel_use_split() and opencl_get_usable_devices(). From now on, to test non whitelisted devices please use either CYCLES_OPENCL_MEGA_KERNEL_TEST or CYCLES_OPENCL_SPLIT_KERNEL_TEST.	2016-01-07 00:14:04 +01:00
Thomas Dinges	83e73a2100	Cycles: Refactor how we pass bounce info to light path node. This commit changes the way how we pass bounce information to the Light Path node. Instead of manualy copying the bounces into ShaderData, we now directly pass PathState. This reduces the arguments that we need to pass around and also makes it easier to extend the feature. This commit also exposes the Transmission Bounce Depth to the Light Path node. It works similar to the Transparent Depth Output: Replace a Transmission lightpath after X bounces with another shader, e.g a Diffuse one. This can be used to avoid black surfaces, due to low amount of max bounces. Reviewed by Sergey and Brecht, thanks for some hlp with this. I tested compilation and usage on CPU (SVM and OSL), CUDA, OpenCL Split and Mega kernel. Hopefully this covers all devices. :)	2016-01-06 23:43:29 +01:00
Sergey Sharybin	944b6322e6	Cycles: Log whch optimizations are used for CPU kernels Not fully thread-safe, but is rather harmless. Just some messages might be logged several times.	2016-01-06 20:25:19 +05:00
Sergey Sharybin	e2846c999a	Cycles: Fix stupid mistake which was assining kernel function in a loop	2016-01-06 20:05:33 +05:00
Sergey Sharybin	da49ee30b0	Fix T47100: OpenCL compilation warnings due to missing space in the argument list	2016-01-03 23:13:49 +05:00
Sergey Sharybin	3918c8b9a5	Cycles: Optionally output luminance from the shader evaluation kernel This makes it possible to move some parts of evaluation from host to the device and hopefully reduce memory usage by avoid having full RGBA buffer on the host. Reviewers: juicyfruit, lukasstockner97, brecht Reviewed By: lukasstockner97, brecht Differential Revision: https://developer.blender.org/D1702	2015-12-30 19:04:04 +05:00
Sergey Sharybin	0ae2ade17a	Cycles; Fix typo in the comment	2015-12-28 19:01:26 +05:00

1 2 3 4 5 ...

343 Commits