test2

Author	SHA1	Message	Date
Iliya Katueshenock	4060ba4024	Cleanup: reserve vector before an append loop Pull Request: https://projects.blender.org/blender/blender/pulls/109416	2023-06-28 08:48:00 +02:00
Jacques Lucke	201a442750	Functions: improve default debug names in lazy function graph executor	2023-06-16 10:53:11 +02:00
Hans Goudey	1e4b80fed9	Attributes: Add quaternion rotation type Add a quaternion attribute type that will be used in combination with rotation sockets for geometry nodes to give a more intuitive experience and better performance when using rotations. The most interesting part is probably the interpolation, the rest is the same as the last attribute type addition, `988f23cec3`. We need to interpolate multiple values with different weights. Based on Sybren's suggestion, this uses the `expmap` methods from `4805a54525` for that. This also refactors `SimpleMixerWithAccumulationType` to use a function rather than a cast to convert to the accumulation type. See #92967 Pull Request: https://projects.blender.org/blender/blender/pulls/108678	2023-06-12 15:49:50 +02:00
Campbell Barton	118a47b7f6	Cleanup: quiet warnings Resolve undeclared function & dangling reference warning.	2023-06-02 12:21:56 +10:00
Sergey Sharybin	c1bc70b711	Cleanup: Add a copyright notice to files and use SPDX format A lot of files were missing copyright field in the header and the Blender Foundation contributed to them in a sense of bug fixing and general maintenance. This change makes it explicit that those files are at least partially copyrighted by the Blender Foundation. Note that this does not make it so the Blender Foundation is the only holder of the copyright in those files, and developers who do not have a signed contract with the foundation still hold the copyright as well. Another aspect of this change is using SPDX format for the header. We already used it for the license specification, and now we state it for the copyright as well, following the FAQ: https://reuse.software/faq/	2023-05-31 16:19:06 +02:00
Jacques Lucke	2cfcb8b0b8	BLI: refactor IndexMask for better performance and memory usage Goals of this refactor: * Reduce memory consumption of `IndexMask`. The old `IndexMask` uses an `int64_t` for each index which is more than necessary in pretty much all practical cases currently. Using `int32_t` might still become limiting in the future in case we use this to index e.g. byte buffers larger than a few gigabytes. We also don't want to template `IndexMask`, because that would cause a split in the "ecosystem", or everything would have to be implemented twice or templated. * Allow for more multi-threading. The old `IndexMask` contains a single array. This is generally good but has the problem that it is hard to fill from multiple-threads when the final size is not known from the beginning. This is commonly the case when e.g. converting an array of bool to an index mask. Currently, this kind of code only runs on a single thread. * Allow for efficient set operations like join, intersect and difference. It should be possible to multi-thread those operations. * It should be possible to iterate over an `IndexMask` very efficiently. The most important part of that is to avoid all memory access when iterating over continuous ranges. For some core nodes (e.g. math nodes), we generate optimized code for the cases of irregular index masks and simple index ranges. To achieve these goals, a few compromises had to made: * Slicing of the mask (at specific indices) and random element access is `O(log #indices)` now, but with a low constant factor. It should be possible to split a mask into n approximately equally sized parts in `O(n)` though, making the time per split `O(1)`. * Using range-based for loops does not work well when iterating over a nested data structure like the new `IndexMask`. Therefor, `foreach_` functions with callbacks have to be used. To avoid extra code complexity at the call site, the `foreach_` methods support multi-threading out of the box. The new data structure splits an `IndexMask` into an arbitrary number of ordered `IndexMaskSegment`. Each segment can contain at most `2^14 = 16384` indices. The indices within a segment are stored as `int16_t`. Each segment has an additional `int64_t` offset which allows storing arbitrary `int64_t` indices. This approach has the main benefits that segments can be processed/constructed individually on multiple threads without a serial bottleneck. Also it reduces the memory requirements significantly. For more details see comments in `BLI_index_mask.hh`. I did a few tests to verify that the data structure generally improves performance and does not cause regressions: * Our field evaluation benchmarks take about as much as before. This is to be expected because we already made sure that e.g. add node evaluation is vectorized. The important thing here is to check that changes to the way we iterate over the indices still allows for auto-vectorization. * Memory usage by a mask is about 1/4 of what it was before in the average case. That's mainly caused by the switch from `int64_t` to `int16_t` for indices. In the worst case, the memory requirements can be larger when there are many indices that are very far away. However, when they are far away from each other, that indicates that there aren't many indices in total. In common cases, memory usage can be way lower than 1/4 of before, because sub-ranges use static memory. * For some more specific numbers I benchmarked `IndexMask::from_bools` in `index_mask_from_selection` on 10.000.000 elements at various probabilities for `true` at every index: ``` Probability Old New 0 4.6 ms 0.8 ms 0.001 5.1 ms 1.3 ms 0.2 8.4 ms 1.8 ms 0.5 15.3 ms 3.0 ms 0.8 20.1 ms 3.0 ms 0.999 25.1 ms 1.7 ms 1 13.5 ms 1.1 ms ``` Pull Request: https://projects.blender.org/blender/blender/pulls/104629	2023-05-24 18:11:41 +02:00
Jacques Lucke	8ba9d7b67a	Functions: improve handling of thread-local data in lazy functions The main goal here is to reduce the number of times thread-local data has to be looked up using e.g. `EnumerableThreadSpecific.local()`. While this isn't a bottleneck in many cases, it is when the action performed on the local data is very short and that happens very often (e.g. logging used sockets during geometry nodes evaluation). The solution is to simply pass the thread-local data as parameter to many functions that use it, instead of looking it up in those functions which generally is more costly. The lazy-function graph executor now only looks up the local data if it knows that it might be on a new thread, otherwise it uses the local data retrieved earlier. Alongside with `UserData` there is `LocalUserData` now. This allows users of the lazy-function evaluation (such as geometry nodes) to have custom thread-local data that is passed to all the lazy-functions automatically. This is used for logging now.	2023-05-09 13:13:52 +02:00
Campbell Barton	6859bb6e67	Cleanup: format (with BraceWrapping::AfterControlStatement "MultiLine")	2023-05-02 09:37:49 +10:00
Hans Goudey	2f581a779c	Cleanup: Use utility constructor to create field operations	2023-04-23 15:27:20 -04:00
Hans Goudey	988f23cec3	Attributes: Add 2D integer vector attribute type This type will be used to store mesh edges in #106638, but it could be used for anything else too. This commit adds support for: - The new type in the Python API - Editing the type in the edit mode "Attribute Set" operator - Rendering the type in EEVEE and Cycles for all geometry types - Geometry nodes attribute interpolation and mixing - Viewing the type in the spreadsheet and using row filters The attribute uses the `blender::int2` type in most code, and the `vec2i` DNA type in C code when necessary. The enum names are based on `INT32_2D` for consistency with `INT8` and `INT32`. Pull Request: https://projects.blender.org/blender/blender/pulls/106677	2023-04-14 16:08:05 +02:00
Sergey Sharybin	d32d787f5f	Clang-Format: Allow empty functions to be single-line For example ``` OIIOOutputDriver::~OIIOOutputDriver() { } ``` becomes ``` OIIOOutputDriver::~OIIOOutputDriver() {} ``` Saves quite some vertical space, which is especially handy for constructors. Pull Request: https://projects.blender.org/blender/blender/pulls/105594	2023-03-29 16:50:54 +02:00
Clément Foucault	b0b9e746fa	BLI: Use BLI_math_matrix_type.hh instead of BLI_math_float4x4.hh Straightforward port. I took the oportunity to remove some C vector functions (ex: copy_v2_v2). This makes some changes to DRWView to accomodate the alignement requirements of the float4x4 type.	2023-02-06 21:25:45 +01:00
Ray Molenkamp	b5e00a1482	Revert "BLI: Use BLI_math_matrix_type.hh instead of BLI_math_float4x4.hh" This reverts commit `52de84b0db`. had some build issues on windows i can't quickly resolve, revert for now while we fix the problems	2023-02-02 11:46:23 -07:00
Clément Foucault	52de84b0db	BLI: Use BLI_math_matrix_type.hh instead of BLI_math_float4x4.hh Straightforward port. I took the oportunity to remove some C vector functions (ex: `copy_v2_v2`). This makes some changes to DRWView to accomodate the alignement requirements of the float4x4 type.	2023-02-02 18:11:35 +01:00
Jacques Lucke	96dfa68e5f	Cleanup: extract function that slices parameters for multi-function call	2023-01-22 00:13:47 +01:00
Jacques Lucke	3f1886d0b7	Functions: align chunk sizes in multi-function evaluation This can improve performance in some circumstances when there are vectorized and/or unrolled loops. I especially noticed that this helps a lot while working on D16970 (got a 10-20% speedup there by avoiding running into the non-vectorized fallback loop too often).	2023-01-22 00:03:25 +01:00
Jacques Lucke	31a505d1a5	Functions: add debug utility for lazy function graphs This makes it easier to print information about a socket. Just the socket name is sometimes not enough information to know where it is in the graph.	2023-01-20 13:39:29 +01:00
Jacques Lucke	72cc68e299	Functions: only allocate resource scope when it is actually used In most cases it is currently not used, so always having it there causes unnecessary overhead. In my test file that causes a 2 % performance improvement.	2023-01-14 15:56:43 +01:00
Jacques Lucke	50980981e3	Cleanup: remove MF prefix from some classes in multi-function namespace This was missing in rBeedcf1876a6651c38d8f4daa2e65d1fb81f77c5d.	2023-01-14 15:42:52 +01:00
Jacques Lucke	8625495b1c	Functions: improve handling of unused multi-function outputs Previously, `ParamsBuilder` lazily allocated an array for an output when it was unused, but the called multi-function wanted to access it. Now, whether the multi-function supports an output to be unused is part of the signature. This way, the allocation can happen earlier when the parameters are build. The benefit is that this makes all methods of `MFParams` thread-safe again, removing the need for a mutex.	2023-01-14 15:35:44 +01:00
Jacques Lucke	73a2c79c07	Functions: free memory of unused sockets earlier During geometry nodes evaluation some sockets can be determined to be unused, for example based on the condition input in a switch node. Once a socket is determined to be unused, that information has to be propagated backwards through the tree to free any memory that may have been reserved for those sockets already. This is happening before this commit already, but in a less ideal way. Determining that sockets are unused early is good because it helps with memory reuse and avoids copy-on-write copies caused by shared data. Now, nodes that are scheduled because an output became unused have priority over nodes scheduled for other reasons.	2023-01-08 21:09:33 +01:00
Jacques Lucke	eedcf1876a	Functions: introduce multi-function namespace This moves all multi-function related code in the `functions` module into a new `multi_function` namespace. This is similar to how there is a `lazy_function` namespace. The main benefit of this is that many types names that were prefixed with `MF` (for "multi function") can be simplified. There is also a common shorthand for the `multi_function` namespace: `mf`. This is also similar to lazy-functions where the shortened namespace is called `lf`.	2023-01-07 17:32:28 +01:00
Jacques Lucke	a5b27f9858	Functions: simplify multi-function signature type * `depends_on_context` was not used for a long time already. * `param_data_indices` is not used since rB42b88c008861b6. * The remaining data is moved to a single `Vector` to avoid having to do two allocations when the size signature becomes larger than fits into the inline buffer.	2023-01-07 16:51:26 +01:00
Jacques Lucke	577442a26f	Functions: build multi-function signature in-place This avoids a move of the signature after building it. Tthe value had to be moved out of `MFSignatureBuilder` in the `build` method. This also makes the naming a bit less confusing where sometimes both the `MFSignature` and `MFSignatureBuilder` were referred to as "signature".	2023-01-07 16:30:56 +01:00
Jacques Lucke	b3146200a8	Functions: refactor multi-function builder API * New `build_mf` namespace for the multi-function builders. * The type name of the created multi-functions is now "private", i.e. the caller has to use `auto`. This has the benefit that the implementation can change more freely without affecting the caller. * `CustomMF` does not use `std::function` internally anymore. This reduces some overhead during code generation and at run-time. * `CustomMF` now supports single-mutable parameters.	2023-01-07 16:19:59 +01:00
Clément Foucault	8f44c37f5c	Cleanup: Rename BLI_math_vec_types* files to BLI_math_vector_types This is for the sake of consistency and clarity.	2023-01-06 20:09:51 +01:00
Jacques Lucke	42b88c0088	Functions: simplify multi-function parameters The use of `std::variant` allows combining the four vectors into one which more closely matches the intend and avoids a workaround used before. Note that this uses `std::get_if` instead of `std::get` because `std::get` is only available since macOS 10.14.	2023-01-06 11:50:56 +01:00
Campbell Barton	14fc02f91d	Cleanup: spelling in comments	2023-01-06 14:00:36 +11:00
Jacques Lucke	3819a9b15a	Fix T103614: crash during geometry nodes evaluation with tbb disabled	2023-01-05 15:36:39 +01:00
Jacques Lucke	2ffd08e952	Geometry Nodes: deterministic anonymous attribute lifetimes Previously, the lifetimes of anonymous attributes were determined by reference counts which were non-deterministic when multiple threads are used. Now the lifetimes of anonymous attributes are handled more explicitly and deterministically. This is a prerequisite for any kind of caching, because caching the output of nodes that do things non-deterministically and have "invisible inputs" (reference counts) doesn't really work. For more details for how deterministic lifetimes are achieved, see D16858. No functional changes are expected. Small performance changes are expected as well (within few percent, anything larger regressions should be reported as bugs). Differential Revision: https://developer.blender.org/D16858	2023-01-05 14:05:30 +01:00
Jacques Lucke	83f519b7c1	Functions: initialize node storage and default values on first execution Previously, this happened when the "node task" first runs, which might not actually execute the node if there are missing inputs. Deferring the allocation of storage and default inputs allows for better memory reuse later (currently the memory is not reused).	2023-01-04 18:46:50 +01:00
Jacques Lucke	30753f7692	Functions: add method to iterate over all inputs of a field This is part of D16858. Iterating over all field inputs allows us to extract all anonymous attributes used by a field relatively easily which is necessary for D16858. This could potentially be used for better field tooltips for nested fields, but that needs further investigation.	2023-01-03 12:37:18 +01:00
Jacques Lucke	0bc0e3f9f7	Fix: geometry nodes crashes with large trees This was an oversight in rBdba2d828462ae22de5. The evaluator uses multiple threads to initialize node states but it is still in single threaded mode. `get_main_or_local_allocator` did not return the right allocator in this case.	2023-01-02 18:34:01 +01:00
Jacques Lucke	dba2d82846	Geometry Nodes: avoid using enumerable thread specific on single thread The geometry nodes evaluator supports "lazy threading", i.e. it starts out single-threaded. But when it determines that multi-threading can be benefitial, it switches to multi-threaded mode. Now it only creates an enumerable-thread-specific if it is actually using multiple threads. This results in a 6% speedup in my test file with many node groups and math nodes.	2022-12-29 21:05:58 +01:00
Jacques Lucke	b6ca942e47	Functions: support cycles in lazy-function graph Lazy-function graphs are now evaluated properly even if they contain cycles. Note that cycles are only ok if there is no data dependency cycle. For example, a node might output something that is fed back into itself. As long as the output can be computed without the input that it feeds into, everything is ok. The code that builds the graph is responsible for making sure that there are no actual data dependencies.	2022-12-29 16:39:40 +01:00
Jacques Lucke	7e4f988072	BLI: improve node graph export in dot format This makes it bit easier to export node graphs and also allows for more customization of links and sockets.	2022-12-29 15:09:52 +01:00
Jacques Lucke	c37e07bc01	Geometry Nodes: improve dot graph export of lazy function graph * Dim default input values. * Print default input values instead of type name. * Add node/socket names to group input/output nodes.	2022-12-16 12:18:49 +01:00
Hans Goudey	2155bdd500	Cleanup: Remove "done" variable from node runtime The runtime storage is meant for more persistent things. These local states for an algorithm are much better handled by an array now.	2022-12-02 14:14:14 -06:00
Hans Goudey	14d0b57be7	Cleanup: Use array_utils to copy evaluated field array	2022-11-22 12:49:51 -06:00
Jacques Lucke	0ebb7ab41f	Geometry Nodes: disable unreachable nodes in evaluator Nodes that were not connected to any output could still impact performance. While they were never executed, sometimes their inputs could keep references to geometries that other nodes want to modify. That caused unnecessary geometry copies, because a geometry can only be modified if it is not shared. Now, inputs that will never be used are tagged accordingly and they will never have references to geometries that others might want to modify.	2022-11-16 14:26:11 +01:00
Jacques Lucke	edcce2c073	Cleanup: correct inverted variable name	2022-11-16 13:19:23 +01:00
Jacques Lucke	a6c822733a	BLI: improve CPPType system * Support bidirectional type lookups. E.g. finding the base type of a field was supported, but not the other way around. This also removes the todo in `get_vector_type`. To achieve this, types have to be registered up-front. * Separate `CPPType` from other "type traits". For example, previously `ValueOrFieldCPPType` adds additional behavior on top of `CPPType`. Previously, it was a subclass, now it just contains a reference to the `CPPType` it corresponds to. This follows the composition-over-inheritance idea. This makes it easier to have self-contained "type traits" without having to put everything into `CPPType`. Differential Revision: https://developer.blender.org/D16479	2022-11-12 18:33:31 +01:00
Hans Goudey	97746129d5	Cleanup: replace UNUSED macro with commented args in C++ code This is the conventional way of dealing with unused arguments in C++, since it works on all compilers. Regex find and replace: `UNUSED$(\w+)$` -> `/$1/`	2022-10-03 17:38:16 -05:00
Jacques Lucke	c6e70e7bac	Cleanup: follow C++ type cast style guide in some files https://wiki.blender.org/wiki/Style_Guide/C_Cpp#C.2B.2B_Type_Cast This was discussed in https://devtalk.blender.org/t/rfc-style-guide-for-type-casts-in-c-code/25907.	2022-09-25 17:39:45 +02:00
Campbell Barton	f68cfd6bb0	Cleanup: replace C-style casts with functional casts for numeric types	2022-09-25 20:17:08 +10:00
Campbell Barton	5517c848bd	Cleanup: spelling in comments	2022-09-21 12:00:01 +10:00
Jacques Lucke	5c81d3bd46	Geometry Nodes: improve evaluator with lazy threading In large node setup the threading overhead was sometimes very significant. That's especially true when most nodes do very little work. This commit improves the scheduling by not using multi-threading in many cases unless it's likely that it will be worth it. For more details see the comments in `BLI_lazy_threading.hh`. Differential Revision: https://developer.blender.org/D15976	2022-09-20 11:08:05 +02:00
Campbell Barton	95f05a6a4b	Cleanup: spelling in comments	2022-09-16 18:14:33 +10:00
Campbell Barton	2c53970bbf	Cleanup: use doxy sections, remove outdated comment	2022-09-15 15:27:21 +10:00
Campbell Barton	f78219c9a8	Cleanup: spelling in comments	2022-09-13 18:03:09 +10:00

1 2 3 4

174 Commits