griefith/test

Author	SHA1	Message	Date
lazydodo	5a8b5a0377	Land D2339 by bliblu bli	2016-12-09 08:28:04 -07:00
Lukas Stockner	a2ebc5268f	Cycles: Refactor Progress system to provide better estimates The Progress system in Cycles had two limitations so far: - It just counted tiles, but ignored their size. For example, when rendering a 600x500 image with 512x512 tiles, the right 88x500 tile would count for 50% of the progress, although it only covers 15% of the image. - Scene update time was incorrectly counted as rendering time - therefore, the remaining time started very long and gradually decreased. This patch fixes both problems: First of all, the Progress now has a function to ignore time spans, and that is used to ignore scene update time. The larger change is the tile size: Instead of counting samples per tile, so that the final value is num_samplesnum_tiles, the code now counts every sample for every pixel, so that the final value is num_samplesnum_pixels. Along with that, some unused variables were removed from the Progress and Session classes. Reviewers: brecht, sergey, #cycles Subscribers: brecht, candreacchio, sergey Differential Revision: https://developer.blender.org/D2214	2016-12-03 05:02:21 +01:00
Sergey Sharybin	9aa8d1bc45	Cycles: Fix strict compilation warnings Should be no functional changes.	2016-11-22 16:39:03 +01:00
Sergey Sharybin	af7343ae22	Cycles: Attempt to fix compilation error on ppc64el There is some define conflict between system headers and clew, so delay include of clew.h as much as possible.] This is something which needed to be done in the code before the refactor, hopefully such change will still work.	2016-11-21 13:32:41 +01:00
Lukas Stockner	dd921238d9	Cycles: Refactor Device selection to allow individual GPU compute device selection Previously, it was only possible to choose a single GPU or all of that type (CUDA or OpenCL). Now, a toggle button is displayed for every device. These settings are tied to the PCI Bus ID of the devices, so they're consistent across hardware addition and removal (but not when swapping/moving cards). From the code perspective, the more important change is that now, the compute device properties are stored in the Addon preferences of the Cycles addon, instead of directly in the User Preferences. This allows for a cleaner implementation, removing the Cycles C API functions that were called by the RNA code to specify the enum items. Note that this change is neither backwards- nor forwards-compatible, but since it's only a User Preference no existing files are broken. Reviewers: #cycles, brecht Reviewed By: #cycles, brecht Subscribers: brecht, juicyfruit, mib2berlin, Blendify Differential Revision: https://developer.blender.org/D2338	2016-11-07 03:19:29 +01:00
Martijn Berger	c02cce7b75	cycles, cuDeviceComputeCapability is deprecated as of cuda 5.0	2016-11-04 14:49:54 +01:00
Martijn Berger	4fdf68271c	Cycles standalone, compile fix UINT_MAX is not defined in device_cuda.cpp	2016-11-02 10:56:16 +01:00
Sergey Sharybin	80a6e5beb5	Cycles: Remove explicit std:: from types where possible We have our own abstraction level on top of the STL's implementation. This commit will guarantee our tweaks are used for all cases.	2016-10-24 12:31:11 +02:00
Sergey Sharybin	48997d2e40	Cycles: Cleanup, style	2016-10-24 12:26:12 +02:00
Lukas Stockner	f7ce482385	Cycles: Fix another OpenCL logging issue Previously an error message would be printed whenever the OpenCL build produced output. However, some frameworks seem to print extra information even if the build succeeded, so now the actual returned error is checked as well. When --debug-cycles is activated, the build output will always be printed, otherwise it only gets printed if there was an error.	2016-10-21 02:49:00 +02:00
Lukas Stockner	cd843409d3	Fix T49630: Cycles: Swapped shader and bake kernels The problem here was, as the title says, that the two kernels were swapped. Since shader evaluation is only used for building the samling map when World MIS is enabled, rendering without it would still work fine, although baking also was broken.	2016-10-17 12:28:01 +02:00
Lukas Stockner	d5dd12e56c	Cycles: Improve OpenCL kernel compilation logging The previous refactor changed the code to use a separate logging mechanism to support multithreaded compilation. However, since that's not supported by any frameworks yes, it just resulted in bad logging behaviour. So, this commit changes the logging to go diectly to stdout/stderr once again by default.	2016-10-17 11:51:18 +02:00
Lukas Stockner	9ea71bc674	Cycles: Split device_opencl.cpp into multiple files for easier maintenance There are no user-visible changes, just some internal restructuring. Differential Revision: https://developer.blender.org/D2231	2016-10-09 15:49:50 +02:00
Sergey Sharybin	80837d06de	Cycles: Support earlier tile rendering termination on cancel It will discard the whole tile, but it's still kind of more friendly than fully locked interface (sort of) for until tile is fully sampled. Sorry if it causes PITA to merge for the opencl split work, but this issue bothering a lot when collecting benchmarks.	2016-09-29 16:00:25 +02:00
Sergey Sharybin	333366dbcf	Cycles: Fix typo in shader cancel routines	2016-09-29 15:48:10 +02:00
Sergey Sharybin	2372e67dd6	Cycles: Don't sum up memory usage of all devices together for the stats	2016-09-23 12:43:23 +02:00
Sergey Sharybin	91e0a16f2f	Cycles: Use XDG's .cache folder for cached kernels Basically just moves cached kernels from ~/.config/blender/BLENDER_VERSION to ~/.cache/cycles/kernels. This has following benefits: - Follows XDG specification more closely, not as if it's totally crucial or measurable by users, but still nice. - Prevents unexpected sizes of config folder, makes disk space used in more predictable for users way. - Allows to share kernels across multiple Blender versions, which makes it easier debugging at the times close to release. - "Copy Previous Settings" operator will no longer be copying possibly gigabytes of cached kernels, which used to lead to really nast disk usage and annoying delays of copying settings. - In the future we can have some smart logic to clear old unused cached kernels. Currently only done for Linux and OSX. Windows still follows old "cache" folder logic, but it's not really important for now because we don't support kernel compilation on this platform yet. Reviewers: dingto, juicyfruit, brecht Reviewed By: brecht Differential Revision: https://developer.blender.org/D2197	2016-09-12 09:39:05 +02:00
Mai Lavelle	76b6c77f2c	Cycles microdisplacement: Allow kernels to be built without patch evaluation Kernels can now be built without patch evaluation when not needed by the scene (Catmull-Clark subdivision not in use), giving a performance boost for some devices.	2016-08-15 11:13:18 -04:00
Thomas Dinges	9d236ac06c	Cycles: Enable half float support (4 channels and 1 channel) on CUDA. Atm OpenEXR half files benefit from this and will use only 1/2 of the memory now. More space for HDRs! Part of my GSoC 2016.	2016-08-11 22:47:53 +02:00
Thomas Dinges	c2a7317d1f	CUDA: We don't support Toolkits < 7.5, update error message.	2016-08-09 11:41:25 +02:00
Sergey Sharybin	29dc04d9bb	Cycles: Report human-readable string of compilation error code It is possible that compilation will fail without giving anything in the log buffer. For this cases giving a tip about error code will be really handy. Patch by @Ilia, thanks!	2016-08-04 12:14:43 +02:00
Sergey Sharybin	b416168d85	Cycles: Cleanup, trailing whitespace	2016-08-02 14:09:34 +02:00
Sergey Sharybin	7b8b16a18c	Cycles: Some cleanup in CUDA device file	2016-08-02 14:09:34 +02:00
Sergey Sharybin	ad48f13099	Cycles: Include NVCC compiler flags into md5 hash This way we can easily switch between toolkits without worrying whether some kernel was compiled with old or new CUDA toolkit. It's also now possible to switch machine architecture and have proper cached kernel detected. Not as if it happens every day, but i did such a bitness switch back in the days :)	2016-08-02 14:09:34 +02:00
Sergey Sharybin	6353ecb996	Cycles: Tweaks to support CUDA 8 toolkit All the changes are mainly giving explicit tips on inlining functions, so they match how inlining worked with previous toolkit. This make kernel compiled by CUDA 8 render in average with same speed as previous kernels. Some scenes are somewhat faster, some of them are somewhat slower. But slowdown is within 1% so far. On a positive side it allows us to enable newer generation cards on buildbots (so GTX 10x0 will be officially supported soon).	2016-08-01 15:54:29 +02:00
Sergey Sharybin	274795045c	Cycles: Give better idea which OpenCL kernel is currently compiling	2016-07-14 12:49:20 +02:00
Sergey Sharybin	95a0bff83a	Cycles: Avoid strings passed by value in OpenCL device Also use more const qualifiers in the code.	2016-07-14 12:46:57 +02:00
Brecht Van Lommel	48caadfdd5	Fix Cycles assert after recent half changes.	2016-06-19 20:17:25 +02:00
Sergey Sharybin	b406b7be00	Cycles: Mark which CUDA device is used for display It is really handy to know which one is display when having two cards of same type in the machine.	2016-06-03 11:52:08 +02:00
Sergey Sharybin	d2bb0e660b	Fix T46207: Slow OpenCL GPU bake and blown out baking Cycles render	2016-05-31 17:48:42 +02:00
Mai Lavelle	4388b29e98	Cycles: Add human readable sizes to debug output Some of these values can get quite large and are hard to read, adding this makes it easy to read them at a glance. Reviewed By: sergey Differential Revision: https://developer.blender.org/D2039	2016-05-31 06:13:54 -04:00
Brecht Van Lommel	f7c28a66e2	Fix Cycles compile errors with GCC due to double promotion as errors.	2016-05-22 19:17:22 +02:00
Brecht Van Lommel	ec51175f1f	Code refactor: add generic Cycles node infrastructure. Differential Revision: https://developer.blender.org/D2016	2016-05-22 17:29:24 +02:00
Thomas Dinges	dedc995018	Cycles / CUDA: Don't use bundled kernel if Adaptive is enforced by the user.	2016-05-19 16:32:57 +02:00
Thomas Dinges	c9f1ed1e4c	Cycles: Add support for bindless textures. This adds support for CUDA Texture objects (also known as Bindless textures) for Kepler GPUs (Geforce 6xx and above). This is used for all 2D/3D textures, data still uses arrays as before. User benefits: * No more limits of image textures on Kepler. We had 5 float4 and 145 byte4 slots there before, now we have 1024 float4 and 1024 byte4. This can be extended further if we need to (just change the define). * Single channel textures slots (byte and float) are now supported on Kepler as well (1024 slots for each type). ToDo / Issues: * 3D textures don't work yet, at least don't show up during render. I have no idea whats wrong yet. * Dynamically allocate bindless_mapping array? I hope Fermi still works fine, but that should be tested on a Fermi card before pushing to master. Part of my GSoC 2016. Reviewers: sergey, #cycles, brecht Subscribers: swerner, jtheninja, brecht, sergey Differential Revision: https://developer.blender.org/D1999	2016-05-19 13:14:37 +02:00
Sergey Sharybin	7b356a8565	Cycles: Reduce amount of malloc() calls from the kernel This commit makes it so malloc() is only happening once per volume and once per transparent shadow query (per thread), improving scalability of the code to multiple CPU cores. Hard to measure this with a low-bottom i7 here currently, but from quick tests seems volume sampling gave about 3-5% speedup. The idea is to store allocated memory in kernel globals, which are per thread on CPU already. Reviewers: dingto, juicyfruit, lukasstockner97, maiself, brecht Reviewed By: brecht Subscribers: Blendify, nutel Differential Revision: https://developer.blender.org/D1996	2016-05-18 10:14:24 +02:00
Thomas Dinges	29a17d54da	Fix CUDA MEMCPY condition, it should only copy 3D, 2D or 1D. Found by Brecht, thanks!	2016-05-17 00:37:34 +02:00
Thomas Dinges	99d861169f	Cycles / Requested Features: Volume was missing in logging print.	2016-05-17 00:36:22 +02:00
Thomas Dinges	4a4f043bc4	Cycles: Add support for single channel float textures on CPU. Until now, single channel textures were packed into a float4, wasting 3 floats per pixel. Memory usage of such textures is now reduced by 3/4. Voxel Attributes such as density, flame and heat benefit from this, but also Bumpmaps with one channel. This commit also includes some cleanup and code deduplication for image loading. Example Smoke render from Cosmos Laundromat: http://www.pasteall.org/pic/show.php?id=102972 Memory here went down from ~600MB to ~300MB. Reviewers: #cycles, brecht Differential Revision: https://developer.blender.org/D1981	2016-05-11 21:58:34 +02:00
Sergey Sharybin	92774ff792	Cycles: Use explicit qualifier for single-argument constructors Almost in all cases we want such constructors to be explicit, there are exceptions but only in few places.	2016-05-11 16:51:14 +02:00
Thomas Dinges	d6555d936c	Cleanup: Avoid duplicative defines for CPU textures, use the ones from util_texture.h Also includes some further byte -> byte4 renaming, missed that in last commit.	2016-05-09 09:16:41 +02:00
Thomas Dinges	1c46ecd86b	Cleanup: Remove unneded (void) line, we don't have ifdefs here anymore.	2016-05-07 15:55:28 +02:00
Thomas Dinges	4422b3f919	Some fixes for CUDA runtime compile: * When Baking wasn't used we got an error. * On top of Volume Nodes (NODES_FEATURE_VOLUME), we now also check if we need volume sampling code, so we can disable that as well and save some further compilation time.	2016-05-06 23:13:33 +02:00
Thomas Dinges	734d1aec3f	Cycles: Make CUDA adaptive feature compile a Debug flag. If the CUDA Toolkit is installed and the user is on Linux, adaptive, feature based CUDA runtime compile is now possible to enable via: * Environment flag CYCLES_CUDA_ADAPTIVE_COMPILE or * Debug menu (Debug value 256) in the Cycles UI.	2016-05-06 23:13:33 +02:00
Thomas Dinges	3807bcb3a8	Cleanup: Rename texture slots to float4 and byte, to distinguish from future float (single channel) and half_float slots. Should be no functional changes, tested CPU and CUDA.	2016-05-06 14:37:35 +02:00
Sergey Sharybin	e50d229273	Fix T47794: Point density sometime seems stretched when rendered on GPU	2016-04-20 14:42:19 +02:00
Sergey Sharybin	b20f12d835	Cycles: Some typo fixes	2016-03-12 15:01:20 +05:00
Sergey Sharybin	8cab327316	Cycles: Make CUDA 7.5 officially recommended This was a hard decision, because going newer CUDA toolkit makes rendering up to 5% slower. But on another hand, it solves major speed regressions (up to 30%) with branched path tracing on a top level cards. Neither of those regressions have a meaningful and sane workaround from the code itself. Toolkit 6.5 could still be used, but it's no longer recommended one.	2016-02-17 15:18:56 +01:00
Sergey Sharybin	c5e1781944	Cycles: Fix crash when trying to render after re-enabling the addon	2016-02-16 12:47:31 +01:00
Sergey Sharybin	1c4f21f85e	Cycles: Initial support of 3D textures for CUDA rendering Supports both smoke/fire and point density textures now. Reduces number of textures available for sm_20 and sm_21, but you have to compromise somewhere on such a limited hardware. Currently limited to linear interpolation only, and decoupled ray marching is not supported yet. Think those could be considered just a further improvement. Some quick example: https://developer.blender.org/F282934 Code is minimal and we can fully consider it a fix for missing support of 3D textures with CUDA. Reviewers: lukasstockner97, brecht, juicyfruit, dingto Reviewed By: brecht, juicyfruit, dingto Subscribers: mib2berlin Differential Revision: https://developer.blender.org/D1806	2016-02-15 21:26:29 +01:00

1 2 3 4 5 ...

371 Commits