test2

Author	SHA1	Message	Date
Sergey Sharybin	e2c0197a96	Merge branch 'master' into blender2.8	2017-07-11 12:30:30 +02:00
Sergey Sharybin	e26f61a2b5	Cycles: Disable OpenCL clFlush workarounds This is something which was reported to work fine by Mai, Benjamin and confirmed by myself. Disabling this workaround gains us some speedup: Before Now bmw27 04:28.42 04:07.79 classroom 09:26.48 08:54.53 fishy_cat 08:44.01 08:18.70 koro 09:17.98 08:57.18 pavillon_barcelone 12:26.64 11:52.81 Test environment is: - Ubuntu 16.04, with all updates installed - AMD RX 480 GPU - amdgpu pro driver version 17.10-450821	2017-07-11 12:16:58 +02:00
Dalai Felinto	3615350957	Merge remote-tracking branch 'origin/master' into blender2.8	2017-07-07 11:27:48 +02:00
Sergey Sharybin	fee7f688c3	Cycles: Fix ambiguity in call of min() function	2017-07-07 10:40:19 +02:00
Sergey Sharybin	9d71ec5f8d	Merge branch 'master' into blender2.8	2017-07-06 12:21:21 +02:00
Mai Lavelle	9c3f1ad003	Cycles: Add artificial memory limit debug option for OpenCL	2017-07-06 05:25:46 -04:00
Mai Lavelle	f9963f29e8	Cycles: Dont allow global size to fall to zero	2017-07-05 20:19:15 -04:00
Mai Lavelle	222b96e5c7	Cycles: Detect out of memory before buffer allocation in OpenCL devices	2017-07-05 20:19:12 -04:00
Luca Rood	bdeeb29482	Merge branch 'master' into blender2.8	2017-07-05 15:50:01 +02:00
Sergey Sharybin	d37dd97e45	Cycles: Pass string by const reference rather than by value Some of the functions might have been inlined, but others i don't see how that was possible (don't think virtual functions can be inlined here). In any case, better be explicitly optimal in the code.	2017-07-05 12:27:41 +02:00
Bastien Montagne	f23ed929ee	Merge branch 'master' into blender2.8 Conflicts: source/blender/makesdna/DNA_particle_types.h	2017-07-04 13:13:49 +02:00
Lukas Stockner	6782a6076c	Cycles: Add missing split kernel to CPUDevice	2017-07-03 18:26:18 +02:00
Dalai Felinto	d97c3bc7ad	Merge branch 'master' into blender2.8	2017-07-03 15:18:46 +02:00
Mai Lavelle	56dcfcce05	Cycles: Disable baking in mega kernel when not in use to improve build times	2017-06-29 23:07:18 -04:00
Sergey Sharybin	0f4f4d8754	Merge branch 'master' into blender2.8	2017-06-12 15:12:36 +02:00
Hristo Gueorguiev	04530c9383	Cycles: adjust supported driver version for AMD GPUs On Windows 17.Q1 and 17.Q2 return driver version 2236.10.	2017-06-11 23:17:46 +02:00
Sergey Sharybin	e097fc4aa6	Cycles: Selectively include denoising in kernel	2017-06-10 04:45:13 -04:00
Mai Lavelle	eb293f59f2	Cycles: Pass all buffers to each kernel call for OpenCL Technically not passing all buffers used by a kernel is undefined behavior. We haven't had any issues with this so far on AMD or Nvidia, but it's known to be a problem with Intel and we received a report from AMD that this is a problem on newer hardware, so we need to make this change at some point. Unfortunately there a cost to being correct, about 5% for the benchmark scenes. For low sample counts it's even worse, I've seen up to 50% slowdown. For the latter case I think adjusting tile updating logic can help, but not sure what that would look like yet (it would be just a few lines change however).	2017-06-10 04:08:49 -04:00
Mai Lavelle	6238214159	Cycles: Faster split branched path tracing by sharing samples with inactive threads Unlike regular path tracing, branched path tracing is usually used with lower sample counts, at least for primary rays. This means that are less samples for the GPU to work on in parallel and rendering is slower. As there is less work overall there is also more inactive threads during rendering with BPT. This patch makes use of those inactive rays to render branched samples in parallel with other samples. Each thread that is preparing for a branched sample will attempt to find an inactive thread and if one is found the state for the sample is copied to that thread. Potentially, if there are enough inactive threads, 100s of branched samples could be generated from the same originating thread and ran in parallel giving large speed ups. Gives 70% faster render for pavillion midday scene. 20-60% faster on BMW with car paint replaced with SSS/volumes.	2017-06-10 04:08:49 -04:00
Mai Lavelle	ea846a4dfc	Cycles: Add kernel to enqueue inactive rays The queue will be used to make reuse of inactive threads to keep the GPU more busy.	2017-06-10 03:51:18 -04:00
Hristo Gueorguiev	1f0998baa7	Cycles: Blacklist unsupported OpenCL devices Due to various driver issues with AMD GCN 1 cards we can no longer support these GPUs. This patch makes them unavailable to select for Cycles rendering. GCN cards 2 and higher are still supported. Please use the most recent drivers available to ensure proper functionality. See here for a list to check which GPUs are supported: https://en.wikipedia.org/wiki/List_of_AMD_graphics_processing_units	2017-06-10 03:51:18 -04:00
Campbell Barton	bb773acd5f	Merge branch 'master' into blender2.8	2017-06-09 19:40:47 +10:00
Lukas Stockner	705c43be0b	Cycles Denoising: Merge outlier heuristic and confidence interval test The previous outlier heuristic only checked whether the pixel is more than twice as bright compared to the 75% quantile of the 5x5 neighborhood. While this detected fireflies robustly, it also incorrectly marked a lot of legitimate small highlights as outliers and filtered them away. This commit adds an additional condition for marking a pixel as a firefly: In addition to being above the reference brightness, the lower end of the 3-sigma confidence interval has to be below it. Since the lower end approximates how low the true value of the pixel might be, this test separates pixels that are supposed to be very bright from pixels that are very bright due to random fireflies. Also, since there is now a reliable outlier filter as a preprocessing step, the additional confidence interval test in the reconstruction kernel is no longer needed.	2017-06-09 03:46:11 +02:00
Campbell Barton	346619159a	Merge branch 'master' into blender2.8	2017-06-09 07:21:43 +10:00
Sergey Sharybin	45d3e22204	Cycles: Display optional board name in system info	2017-06-08 12:10:15 +02:00
Sergey Sharybin	78c0f09d4f	Cycles: Cleanup, indentation	2017-06-08 12:03:08 +02:00
Bastien Montagne	44f91a9a18	Merge branch 'master' into blender2.8 Conflicts: source/blender/blenloader/intern/versioning_270.c	2017-05-22 22:49:02 +02:00
Sergey Sharybin	34b689892b	Fix T51568: CUDA error in viewport render after fix for for OpenCL Seems re-loading module invalidates memory pointers by the looks of it, which gives an error on the next kernel call. Not sure how to move memory pointer from one CUDA module to another one, so for now simply disabling kernel re-load for CUDA devices. Not ideal, but better than failing render. Feature-selective option for CUDA is not an official feature anyway.	2017-05-22 12:28:21 +02:00
Sergey Sharybin	38a2bf665b	Cycles: Cleanup, style and unused arguments - Some arguments were inapproriatry tagged as unused using (void)foo semantic. Only use such semantic in tricky casses, when something needs to be ignored in release builds or something is dependent on tricky ifndef policy. For rest of the cases just use void foo(int /bar*/) semantic, which ensures variable is not used. Solves confusion and code running out of sync with later development. - Used proper unused semantic to some arguments. - Added braces to make code easier to follow, tricky indentation with ifdef, uh.	2017-05-20 05:21:27 -07:00
Bastien Montagne	1f46da922a	Merge branch 'master' into blender2.8 Conflicts: source/blender/blenloader/intern/versioning_270.c source/blender/depsgraph/intern/depsgraph_tag.cc source/blender/editors/mask/mask_draw.c	2017-05-19 09:36:14 +02:00
Lukas Stockner	ffd83a34ab	Fix T51502: Cycles denoising not using correctly aligned width for NLM on CUDA	2017-05-19 02:06:54 +02:00
Lukas Stockner	740cd28748	Cycles Denoising: Add more robust outlier heuristic to avoid artifacts Extremely bright pixels in the rendered image cause the denoising algorithm to produce extremely noticable artifacts. Therefore, a heuristic is needed to exclude these pixels from the filtering process. The new approach calculates the 75% percentile of the 5x5 neighborhood of each pixel and flags the pixel if it is more than twice as bright. During the reconstruction process, flagged pixels are skipped. Therefore, they don't cause any problems for neighboring pixels, and the outlier pixels themselves are replaced by a prediction of their actual value based on their feature pass values and the neighboring pixels. Therefore, the denoiser now also works as a smarter despeckling filter that uses a more accurate prediction of the pixel instead of a simple average. This can be used even if denoising isn't wanted by setting the denoising radius to 1.	2017-05-18 21:55:56 +02:00
Lukas Stockner	b3a3459e1a	Cycles Denoising: Fix wrong order of denoising feature passes	2017-05-18 21:55:56 +02:00
Dalai Felinto	75ba1826c8	Merge remote-tracking branch 'origin/master' into blender2.8	2017-05-09 17:56:16 +02:00
Sergey Sharybin	e20eb2dec0	Cycles: Properly free memory used by KernelGlobals Previous logic did not free memory used by vector classes which were storing images, causing memory leaks.	2017-05-09 17:07:17 +02:00
Julian Eisel	9181f13af7	Merge branch 'master' into blender2.8	2017-05-08 00:19:22 +02:00
Lukas Stockner	43b374e8c5	Cycles: Implement denoising option for reducing noise in the rendered image This commit contains the first part of the new Cycles denoising option, which filters the resulting image using information gathered during rendering to get rid of noise while preserving visual features as well as possible. To use the option, enable it in the render layer options. The default settings fit a wide range of scenes, but the user can tweak individual settings to control the tradeoff between a noise-free image, image details, and calculation time. Note that the denoiser may still change in the future and that some features are not implemented yet. The most important missing feature is animation denoising, which uses information from multiple frames at once to produce a flicker-free and smoother result. These features will be added in the future. Finally, thanks to all the people who supported this project: - Google (through the GSoC) and Theory Studios for sponsoring the development - The authors of the papers I used for implementing the denoiser (more details on them will be included in the technical docs) - The other Cycles devs for feedback on the code, especially Sergey for mentoring the GSoC project and Brecht for the code review! - And of course the users who helped with testing, reported bugs and things that could and/or should work better!	2017-05-07 14:40:58 +02:00
Campbell Barton	90ebf4832f	Merge branch 'master' into blender2.8	2017-05-06 22:54:28 +10:00
Hristo Gueorguiev	b9fda4480f	Cycles: Show samples progress for OpenCL split kernel	2017-05-05 13:37:21 +02:00
Campbell Barton	1c2b5430ca	Merge branch 'master' into blender2.8	2017-05-05 08:23:59 +10:00
Lukas Stockner	ed688e4843	Cycles: Fix crash when assigning KernelGlobals The memory isn't initialized during allocation, so calling the assignment operator is a bad idea.	2017-05-04 20:49:04 +02:00
Lukas Stockner	82e242cc72	Merge branch 'master' into blender2.8	2017-05-03 18:33:02 +02:00
Hristo Gueorguiev	6bf4115c13	Cycles: Split kernel - sort shaders Reduce thread divergence in kernel_shader_eval. Rays are sorted in blocks of 2048 according to shader->id. On R9 290 Classroom is ~30% faster, and Pabellon Barcelone is ~8% faster. No sorting for CUDA split kernel. Reviewers: sergey, maiself Reviewed By: maiself Differential Revision: https://developer.blender.org/D2598	2017-05-03 15:30:45 +02:00
Mai Lavelle	d187014675	Cycles: Remove extra clFinish from driver workaround These were causing problems with Nvidia OpenCL.	2017-05-02 14:26:46 -04:00
Mai Lavelle	299d839dc5	Cycles: Output split state element size	2017-05-02 14:26:46 -04:00
Mai Lavelle	915766f42d	Cycles: Branched path tracing for the split kernel This implements branched path tracing for the split kernel. General approach is to store the ray state at a branch point, trace the branched ray as normal, then restore the state as necessary before iterating to the next part of the path. A state machine is used to advance the indirect loop state, which avoids the need to add any new kernels. Each iteration the state machine recreates as much state as possible from the stored ray to keep overall storage down. Its kind of hard to keep all the different integration loops in sync, so this needs lots of testing to make sure everything is working correctly. We should probably start trying to deduplicate the integration loops more now. Nonbranched BMW is ~2% slower, while classroom is ~2% faster, other scenes could use more testing still. Reviewers: sergey, nirved Reviewed By: nirved Subscribers: Blendify, bliblubli Differential Revision: https://developer.blender.org/D2611	2017-05-02 14:26:46 -04:00
Sergey Sharybin	7f833c0da8	Merge branch 'master' into blender2.8	2017-05-02 15:29:00 +02:00
Sergey Sharybin	4384a7cf46	Cycles: Fix CUDA split kernel Global size y needs to be a multiple of 16.	2017-05-02 15:03:51 +02:00
Sergey Sharybin	4174e533c0	Cycles: Cache split kernels in CUDA device This way we don't re-load kernels for every sample in the viewport. Additionally, we don't risk global size changed inbetween of samples.	2017-05-02 15:03:12 +02:00
Dalai Felinto	b868f43fd3	Cycles support for preview on viewport with core profile This upgrade the drawing code to use latest opengl calls. Also, it adds a fallback shader for opencolorio. Reviewers: sergey, brecht Subscribers: merwin, fclem Differential Revision: https://developer.blender.org/D2652	2017-04-28 19:25:57 +02:00

1 2 3 4 5 ...

476 Commits