test2

Author	SHA1	Message	Date
Sergey Sharybin	e26f61a2b5	Cycles: Disable OpenCL clFlush workarounds This is something which was reported to work fine by Mai, Benjamin and confirmed by myself. Disabling this workaround gains us some speedup: Before Now bmw27 04:28.42 04:07.79 classroom 09:26.48 08:54.53 fishy_cat 08:44.01 08:18.70 koro 09:17.98 08:57.18 pavillon_barcelone 12:26.64 11:52.81 Test environment is: - Ubuntu 16.04, with all updates installed - AMD RX 480 GPU - amdgpu pro driver version 17.10-450821	2017-07-11 12:16:58 +02:00
Sergey Sharybin	fee7f688c3	Cycles: Fix ambiguity in call of min() function	2017-07-07 10:40:19 +02:00
Mai Lavelle	9c3f1ad003	Cycles: Add artificial memory limit debug option for OpenCL	2017-07-06 05:25:46 -04:00
Mai Lavelle	f9963f29e8	Cycles: Dont allow global size to fall to zero	2017-07-05 20:19:15 -04:00
Mai Lavelle	222b96e5c7	Cycles: Detect out of memory before buffer allocation in OpenCL devices	2017-07-05 20:19:12 -04:00
Sergey Sharybin	d37dd97e45	Cycles: Pass string by const reference rather than by value Some of the functions might have been inlined, but others i don't see how that was possible (don't think virtual functions can be inlined here). In any case, better be explicitly optimal in the code.	2017-07-05 12:27:41 +02:00
Mai Lavelle	56dcfcce05	Cycles: Disable baking in mega kernel when not in use to improve build times	2017-06-29 23:07:18 -04:00
Hristo Gueorguiev	04530c9383	Cycles: adjust supported driver version for AMD GPUs On Windows 17.Q1 and 17.Q2 return driver version 2236.10.	2017-06-11 23:17:46 +02:00
Mai Lavelle	eb293f59f2	Cycles: Pass all buffers to each kernel call for OpenCL Technically not passing all buffers used by a kernel is undefined behavior. We haven't had any issues with this so far on AMD or Nvidia, but it's known to be a problem with Intel and we received a report from AMD that this is a problem on newer hardware, so we need to make this change at some point. Unfortunately there a cost to being correct, about 5% for the benchmark scenes. For low sample counts it's even worse, I've seen up to 50% slowdown. For the latter case I think adjusting tile updating logic can help, but not sure what that would look like yet (it would be just a few lines change however).	2017-06-10 04:08:49 -04:00
Hristo Gueorguiev	1f0998baa7	Cycles: Blacklist unsupported OpenCL devices Due to various driver issues with AMD GCN 1 cards we can no longer support these GPUs. This patch makes them unavailable to select for Cycles rendering. GCN cards 2 and higher are still supported. Please use the most recent drivers available to ensure proper functionality. See here for a list to check which GPUs are supported: https://en.wikipedia.org/wiki/List_of_AMD_graphics_processing_units	2017-06-10 03:51:18 -04:00
Lukas Stockner	705c43be0b	Cycles Denoising: Merge outlier heuristic and confidence interval test The previous outlier heuristic only checked whether the pixel is more than twice as bright compared to the 75% quantile of the 5x5 neighborhood. While this detected fireflies robustly, it also incorrectly marked a lot of legitimate small highlights as outliers and filtered them away. This commit adds an additional condition for marking a pixel as a firefly: In addition to being above the reference brightness, the lower end of the 3-sigma confidence interval has to be below it. Since the lower end approximates how low the true value of the pixel might be, this test separates pixels that are supposed to be very bright from pixels that are very bright due to random fireflies. Also, since there is now a reliable outlier filter as a preprocessing step, the additional confidence interval test in the reconstruction kernel is no longer needed.	2017-06-09 03:46:11 +02:00
Sergey Sharybin	78c0f09d4f	Cycles: Cleanup, indentation	2017-06-08 12:03:08 +02:00
Sergey Sharybin	38a2bf665b	Cycles: Cleanup, style and unused arguments - Some arguments were inapproriatry tagged as unused using (void)foo semantic. Only use such semantic in tricky casses, when something needs to be ignored in release builds or something is dependent on tricky ifndef policy. For rest of the cases just use void foo(int /bar*/) semantic, which ensures variable is not used. Solves confusion and code running out of sync with later development. - Used proper unused semantic to some arguments. - Added braces to make code easier to follow, tricky indentation with ifdef, uh.	2017-05-20 05:21:27 -07:00
Lukas Stockner	740cd28748	Cycles Denoising: Add more robust outlier heuristic to avoid artifacts Extremely bright pixels in the rendered image cause the denoising algorithm to produce extremely noticable artifacts. Therefore, a heuristic is needed to exclude these pixels from the filtering process. The new approach calculates the 75% percentile of the 5x5 neighborhood of each pixel and flags the pixel if it is more than twice as bright. During the reconstruction process, flagged pixels are skipped. Therefore, they don't cause any problems for neighboring pixels, and the outlier pixels themselves are replaced by a prediction of their actual value based on their feature pass values and the neighboring pixels. Therefore, the denoiser now also works as a smarter despeckling filter that uses a more accurate prediction of the pixel instead of a simple average. This can be used even if denoising isn't wanted by setting the denoising radius to 1.	2017-05-18 21:55:56 +02:00
Lukas Stockner	43b374e8c5	Cycles: Implement denoising option for reducing noise in the rendered image This commit contains the first part of the new Cycles denoising option, which filters the resulting image using information gathered during rendering to get rid of noise while preserving visual features as well as possible. To use the option, enable it in the render layer options. The default settings fit a wide range of scenes, but the user can tweak individual settings to control the tradeoff between a noise-free image, image details, and calculation time. Note that the denoiser may still change in the future and that some features are not implemented yet. The most important missing feature is animation denoising, which uses information from multiple frames at once to produce a flicker-free and smoother result. These features will be added in the future. Finally, thanks to all the people who supported this project: - Google (through the GSoC) and Theory Studios for sponsoring the development - The authors of the papers I used for implementing the denoiser (more details on them will be included in the technical docs) - The other Cycles devs for feedback on the code, especially Sergey for mentoring the GSoC project and Brecht for the code review! - And of course the users who helped with testing, reported bugs and things that could and/or should work better!	2017-05-07 14:40:58 +02:00
Hristo Gueorguiev	b9fda4480f	Cycles: Show samples progress for OpenCL split kernel	2017-05-05 13:37:21 +02:00
Mai Lavelle	d187014675	Cycles: Remove extra clFinish from driver workaround These were causing problems with Nvidia OpenCL.	2017-05-02 14:26:46 -04:00
Hristo Gueorguiev	e91dc3a97c	Cycles: use safe compiler flags for OpenCL. Using -cl-fast-relaxed-math assumes no NaN/Inf values in any expression. This causes problems on overflow, division by zero, square root of negative number. Comparisons with NaN or infinite value are affected as well. This patch causes <2% slowdown on benchmark scenes. Fix T50985: Rendering volume scatter with GPU OpenCL comes to an halt after a few seconds	2017-04-25 20:10:51 +02:00
Sergey Sharybin	f970e859cf	Cycles: Cleanup, style	2017-04-18 11:39:21 +02:00
Sergey Sharybin	9539cfacca	Cycles: Apparently board name could be an empty string	2017-04-10 15:31:21 +02:00
Sergey Sharybin	867d311307	Cycles: Fix warning with MSVC	2017-04-07 18:28:38 +02:00
Mai Lavelle	5b45fff136	Cycles: Add missing flush	2017-04-07 06:06:08 -04:00
Mai Lavelle	4b7d95290f	Cycles: More fixes after include changes	2017-03-31 10:12:13 +02:00
Sergey Sharybin	a88801b99b	Cycles: Fix missing kernel re-compilation after recent changes Reported by Mai in IRC, thanks!	2017-03-30 11:45:30 +02:00
Sergey Sharybin	0579eaae1f	Cycles: Make all #include statements relative to cycles source directory The idea is to make include statements more explicit and obvious where the file is coming from, additionally reducing chance of wrong header being picked up. For example, it was not obvious whether bvh.h was refferring to builder or traversal, whenter node.h is a generic graph node or a shader node and cases like that. Surely this might look obvious for the active developers, but after some time of not touching the code it becomes less obvious where file is coming from. This was briefly mentioned in T50824 and seems @brecht is fine with such explicitness, but need to agree with all active developers before committing this. Please note that this patch is lacking changes related on GPU/OpenCL support. This will be solved if/when we all agree this is a good idea to move forward. Reviewers: brecht, lukasstockner97, maiself, nirved, dingto, juicyfruit, swerner Reviewed By: lukasstockner97, maiself, nirved, dingto Subscribers: brecht Differential Revision: https://developer.blender.org/D2586	2017-03-29 13:41:11 +02:00
Sergey Sharybin	a0f16e12a0	Cycles: Use more friendly GPU device name for AMD cards For example, for RX480 you'll no longer see "Ellesmere" but will see "AMD Radeon RX 480 Graphics" which makes more sense and allows to easily distinguish which exact card it is when having multiple different cards of Ellesmere codenames (i.e. RX480 and WX7100) in the same machine.	2017-03-21 12:01:11 +01:00
Sergey Sharybin	7780a108b3	Cycles: Simplify some extra OpenCL query code	2017-03-21 12:01:03 +01:00
Sergey Sharybin	fceb1d0781	Cycles: Cleanup, add some utility functions to shorten access to low level API Should be no functional changes.	2017-03-21 12:01:03 +01:00
Sergey Sharybin	3c4df13924	Fix T50268: Cycles allows to select un supported GPUs for OpenCL	2017-03-20 15:37:27 +01:00
Mai Lavelle	4833a71621	Cycles: Adjust global size for OpenCL CPU devices to make them faster	2017-03-16 06:11:42 -04:00
Sergey Sharybin	5ba51de84a	Cycles: Cleanup, indentation	2017-03-14 16:54:16 +01:00
Mai Lavelle	96868a3941	Fix T50888: Numeric overflow in split kernel state buffer size calculation Overflow led to the state buffer being too small and the split kernel to get stuck doing nothing forever.	2017-03-11 05:39:28 -05:00
Hristo Gueorguiev	9de9f25b24	Cycles: add single program debug option for split kernel Single program generally compiles kernels faster (2-3 times), loads faster, takes less drive space (2-3 times), and reduces the number of cached kernels.	2017-03-09 17:09:37 +01:00
Sergey Sharybin	97c4c2689f	Cycles: Make it more obvious message which initialization failed	2017-03-08 13:57:21 +01:00
Sergey Sharybin	ecfbfe478b	Cycles: Log which device kernels are being loaded for	2017-03-08 12:33:51 +01:00
Sergey Sharybin	712f7c3640	Cycles: Make it possible to access KernelGlobals from split data initialization function	2017-03-08 11:02:54 +01:00
Sergey Sharybin	ef7c36f5ed	Cycles: Cleanup, remove residue of previous split kernel data This is all in split data state array.	2017-03-08 10:26:29 +01:00
Mai Lavelle	64751552f7	Cycles: Fix indentation	2017-03-08 01:31:32 -05:00
Mai Lavelle	306034790f	Cycles: Calculate size of split state buffer kernel side By calculating the size of the state buffer in the kernel rather than the host less code is needed and the size actually reflects the requested features. Will also be a little faster in some cases because of larger global work size.	2017-03-08 01:31:30 -05:00
Mai Lavelle	18e50927f7	Cycles: Faster building of split kernel Simple change to make it so that only kernels that have been modified are rebuilt. Might only be useful during development.	2017-03-08 01:31:09 -05:00
Mai Lavelle	b78e543af9	Cycles: Add names to buffer allocations This is to help debug and track memory usage for generic buffers. We have similar for textures already since those require a name, but for buffers the name is only for debugging proposes.	2017-03-08 01:24:55 -05:00
Sergey Sharybin	a87766416f	Cycles: Report device maximum allocation and detected global size	2017-03-08 00:52:41 -05:00
Mai Lavelle	365a4239c5	Cycles: Workaround for driver hangs Simple workaround for some issues we've been having with AMD drivers hanging and rendering systems unresponsive. Unfortunately this makes things a bit slower, but its better than having to do hard reboots. Will be removed when drivers have been fixed. Define CYCLES_DISABLE_DRIVER_WORKAROUNDS to disable for testing purposes.	2017-03-08 00:52:41 -05:00
Mai Lavelle	230c00d872	Cycles: OpenCL split kernel refactor This does a few things at once: - Refactors host side split kernel logic into a new device agnostic class `DeviceSplitKernel`. - Removes tile splitting, a new work pool implementation takes its place and allows as many threads as will fit in memory regardless of tile size, which can give performance gains. - Refactors split state buffers into one buffer, as well as reduces the number of arguments passed to kernels. Means there's less code to deal with overall. - Moves kernel logic out of OpenCL kernel files so they can later be used by other device types. - Replaced OpenCL specific APIs with new generic versions - Tiles can now be seen updating during rendering	2017-03-08 00:52:41 -05:00
Mai Lavelle	520b53364c	Cycles: Add OpenCL kernel for zeroing memory buffers Transferring memory to the device was very slow and there's really no need when only zeroing a buffer.	2017-03-08 00:52:41 -05:00
Mai Lavelle	0f56f7a811	Cycles: Allow device_memory to be used directly This is useful for when theres no host side memory attched to the buffer	2017-03-08 00:52:41 -05:00
Sergey Sharybin	2c30fd83f1	Cycles: Additionally report all OpenCL cflags This way we can control exact spaces and such added to the cflags which is crucial to troubleshoot certain drivers.	2017-02-22 10:06:02 +01:00
Lukas Stockner	a2ebc5268f	Cycles: Refactor Progress system to provide better estimates The Progress system in Cycles had two limitations so far: - It just counted tiles, but ignored their size. For example, when rendering a 600x500 image with 512x512 tiles, the right 88x500 tile would count for 50% of the progress, although it only covers 15% of the image. - Scene update time was incorrectly counted as rendering time - therefore, the remaining time started very long and gradually decreased. This patch fixes both problems: First of all, the Progress now has a function to ignore time spans, and that is used to ignore scene update time. The larger change is the tile size: Instead of counting samples per tile, so that the final value is num_samplesnum_tiles, the code now counts every sample for every pixel, so that the final value is num_samplesnum_pixels. Along with that, some unused variables were removed from the Progress and Session classes. Reviewers: brecht, sergey, #cycles Subscribers: brecht, candreacchio, sergey Differential Revision: https://developer.blender.org/D2214	2016-12-03 05:02:21 +01:00
Sergey Sharybin	9aa8d1bc45	Cycles: Fix strict compilation warnings Should be no functional changes.	2016-11-22 16:39:03 +01:00
Sergey Sharybin	af7343ae22	Cycles: Attempt to fix compilation error on ppc64el There is some define conflict between system headers and clew, so delay include of clew.h as much as possible.] This is something which needed to be done in the code before the refactor, hopefully such change will still work.	2016-11-21 13:32:41 +01:00

1 2

56 Commits