griefith/test

Author	SHA1	Message	Date
Campbell Barton	137f8dd7bc	Cleanup: spelling in comments	2023-10-10 09:44:57 +11:00
Jacques Lucke	7bd509f73a	Functions: enable multi-threading when many nodes are scheduled at once Nodes that are scheduled can be executed in any order in theory. So when there are many scheduled nodes, it can be benefitial to start evaluating them in parallel. Note that it is not very common that many nodes are scheduled at the same time in typical setups because the evaluator uses a depth-first heuristic to decide in which order to evaluate nodes. It can happen more easily in generated node trees though. Also, this change only has an affect in practice if none of the scheduled nodes uses multi-threading internally, as this would also trigger the user of multiple threads in the graph executor.	2023-10-08 16:21:23 +02:00
Jacques Lucke	8822e4de73	Functions: add lazy-function graph input/output getter methods	2023-10-08 16:01:56 +02:00
Jacques Lucke	bef0d6c067	Functions: extract remapped-params to make it reusable This idea is of remapping parameters of lazy-functions is useful not only for the repeat zone. For example, it could be used for the for-each zone as well. Also, moving it to a more general place indicates that there is no repeat-zone specific stuff in it.	2023-10-06 22:32:51 +02:00
Campbell Barton	5fbcb4c27e	Cleanup: remove spaces from commented arguments Also use local enums for `MA_BM_*` in versioning code.	2023-09-22 12:21:18 +10:00
Jacques Lucke	62e2cc0ad0	Geometry Nodes: refactor geometry nodes execution interface The main goal of this refactor is to simplify how a geometry node group is executed. Previously, there was duplicated logic that turned the lazy-function graph of a node group into a single lazy-function. Now this is done only in one place and others can just execute the lazy-function directly, without having to worry about the underlying graph. Pull Request: https://projects.blender.org/blender/blender/pulls/112482	2023-09-17 19:09:45 +02:00
Jacques Lucke	4db6a22c72	Functions: use array indexing instead of VectorSet in graph executor This avoids the need to build the VectorSet and array indexing is generally faster than a hash table lookup.	2023-09-17 14:27:01 +02:00
Jacques Lucke	2a5f3bd1cc	Functions: refactor lazy-function graph interface Goals of the refactor: * Simplify adding (named) graph inputs and outputs. * Add ability to refer to a graph input or output with an index. * Get rid of the "dummy" terminology which doesn't really help. Previously, one would add "dummy nodes" which can then serve as input and output nodes of the graph. Now one directly adds input and outputs using `Graph.add_input` and `Graph.add_output`. There is one interface node that contains all inputs and another one that contains all outputs. Being able to refer to a graph input or output with an index makes it more efficient to implement some algorithms. E.g. one could have a bit span for a socket that contains all the information what graph inputs this socket depends on. Pull Request: https://projects.blender.org/blender/blender/pulls/112474	2023-09-17 13:54:09 +02:00
Jacques Lucke	54fd33d783	Functions: support wrapping lazy-function node execute function This is a light weight solution to passing in some extra context into a lazy-function that is invoked by the graph executor. The new functionality is used by #112421.	2023-09-16 18:50:54 +02:00
Jacques Lucke	93f8d55473	Function: add assert to detect invalid side effect nodes early	2023-09-16 18:44:58 +02:00
Jacques Lucke	bd414cdbda	Functions: reduce memory usage in node state By storing a raw pointer instead of a `Span`, we save 16 bytes per node state. I measured a ~5% speedup in my setup with a simple repeat zone. `5c450aea05` added some additional asserts to check for valid indices. Generally, index-errors in this area lead to wrong behaviors of geometry nodes very quickly.	2023-09-16 12:30:23 +02:00
Jacques Lucke	5c450aea05	Functions: add asserts to check indices	2023-09-16 12:24:54 +02:00
Jacques Lucke	60c65ab13b	Functions: better pack socket state structs This reduces the amount of used memory.	2023-09-16 12:11:08 +02:00
Jacques Lucke	c74a309209	Functions: combine allocations in lazy function graph executor There are many small allocations when the graph executor is initialized (e.g. all the node/sockets have to be allocated). Those were already combined into a few allocations by making use of `LinearAllocator`. However, even better performance can be achieved by making one larger allocation and then using preprocessed offsets into that buffer. I measured up to 20% speedup in geometry nodes with a simple repeat zone.	2023-09-16 11:38:40 +02:00
Iliya Katueshenock	e951924b33	Cleanup: forbid implicit copy and move for multi function This avoids accidental copies/moves which are never really intentional. Pull Request: https://projects.blender.org/blender/blender/pulls/110168	2023-08-31 16:01:29 +02:00
Aras Pranckevicius	acbd952abf	Cleanup: fewer iostreams related includes from BLI/BKE headers Including <iostream> or similar headers is quite expensive, since it also pulls in things like <locale> and so on. In many BLI headers, iostreams are only used to implement some sort of "debug print", or an operator<< for ostream. Change some of the commonly used places to instead include <iosfwd>, which is the standard way of forward-declaring iostreams related classes, and move the actual debug-print / operator<< implementations into .cc files. This is not done for templated classes though (it would be possible to provide explicit operator<< instantiations somewhere in the source file, but that would lead to hard-to-figure-out linker error whenever someone would add a different template type). There, where possible, I changed from full <iostream> include to only the needed <ostream> part. For Span<T>, I just removed print_as_lines since it's not used by anything. It could be moved into a .cc file using a similar approach as above if needed. Doing full blender build changes include counts this way: - <iostream> 1986 -> 978 - <sstream> 2880 -> 925 It does not affect the total build time much though, mostly because towards the end of it there's just several CPU cores finishing compiling OpenVDB related source files. Pull Request: https://projects.blender.org/blender/blender/pulls/111046	2023-08-16 09:51:37 +02:00
Campbell Barton	e955c94ed3	License Headers: Set copyright to "Blender Authors", add AUTHORS Listing the "Blender Foundation" as copyright holder implied the Blender Foundation holds copyright to files which may include work from many developers. While keeping copyright on headers makes sense for isolated libraries, Blender's own code may be refactored or moved between files in a way that makes the per file copyright holders less meaningful. Copyright references to the "Blender Foundation" have been replaced with "Blender Authors", with the exception of `./extern/` since these this contains libraries which are more isolated, any changed to license headers there can be handled on a case-by-case basis. Some directories in `./intern/` have also been excluded: - `./intern/cycles/` it's own `AUTHORS` file is planned. - `./intern/opensubdiv/`. An "AUTHORS" file has been added, using the chromium projects authors file as a template. Design task: #110784 Ref !110783.	2023-08-16 00:20:26 +10:00
Campbell Barton	ed01e16aa6	Cleanup: quiet uninitialized warnings	2023-07-29 13:47:57 +10:00
Ray Molenkamp	04235d0e55	Cleanup: CMake: Modernize bf_blenlib dependencies Pretty straightforward - Remove any blenlib paths from INC - Add a dependency though LIB Pull Request: https://projects.blender.org/blender/blender/pulls/109934	2023-07-10 22:04:18 +02:00
Ray Molenkamp	57ad866d81	Cleanup: CMake: Modernize bf_guardedalloc dependencies Pretty straightforward - Removes any guardedalloc paths from INC - Adds a dependency though LIB Pull Request: https://projects.blender.org/blender/blender/pulls/109925	2023-07-10 18:44:19 +02:00
Ray Molenkamp	7cebb61486	Cleanup: CMake: Modernize bf_dna dependencies There's quite a few libraries that depend on dna_type_offsets.h but had gotten to it by just adding the folder that contains it to their includes INC section without declaring a dependency to bf_dna in the LIB section. which occasionally lead to the lib building before bf_dna and the header being missing, while this generally gets fixed in CMake by adding bf_dna to the LIB section of the lib, however until last week all libraries in the LIB section were linked as INTERFACE so adding it in there did not resolve the build issue. To make things still build, we sprinkled add_dependencies wherever we needed it to force a build order. This diff : Declares public include folders for the bf_dna target so there's no more fudging the INC section required to get to them. Removes all dna related paths from the INC section for all libraries. Adds an alias target bf:dna to signify it has been updated to modern cmake Declares a dependency on bf::dna for all libraries that require it Removes (almost) all calls to add_dependencies for bf_dna Future work: Because of the manual dependency management that was done, there is now some "clutter" with libs depending on bf_dna that realistically don't. Example bf_intern_opencolorio itself has no dependency on bf_dna at all, doesn't need it, doesn't use it. However the dna include folder had been added to it in the past since bf_blenlib uses dna headers in some of its public headers and bf_intern_opencolorio does use those blenlib headers. Given bf_blenlib now correctly declares the dependency on bf_dna as public bf_intern_opencolorio will get the dna header directory automatically from CMake, hence some cleanup could be done for bf_intern_opencolorio Because 99% of the changes in this diff have been automated, this diff does not seek to address these issues as there is no easy way to determine why a certain dependency is in place. A developer will have to make a pass a this at some later point in time. As I'd rather not mix automated and manual labour. There are a few libraries that could not be automatically processed (ie bf_blendthumb) that also will need this manual look-over. Pull Request: https://projects.blender.org/blender/blender/pulls/109835	2023-07-10 15:07:37 +02:00
Iliya Katueshenock	4060ba4024	Cleanup: reserve vector before an append loop Pull Request: https://projects.blender.org/blender/blender/pulls/109416	2023-06-28 08:48:00 +02:00
Jacques Lucke	201a442750	Functions: improve default debug names in lazy function graph executor	2023-06-16 10:53:11 +02:00
Hans Goudey	1e4b80fed9	Attributes: Add quaternion rotation type Add a quaternion attribute type that will be used in combination with rotation sockets for geometry nodes to give a more intuitive experience and better performance when using rotations. The most interesting part is probably the interpolation, the rest is the same as the last attribute type addition, `988f23cec3`. We need to interpolate multiple values with different weights. Based on Sybren's suggestion, this uses the `expmap` methods from `4805a54525` for that. This also refactors `SimpleMixerWithAccumulationType` to use a function rather than a cast to convert to the accumulation type. See #92967 Pull Request: https://projects.blender.org/blender/blender/pulls/108678	2023-06-12 15:49:50 +02:00
Campbell Barton	118a47b7f6	Cleanup: quiet warnings Resolve undeclared function & dangling reference warning.	2023-06-02 12:21:56 +10:00
Sergey Sharybin	c1bc70b711	Cleanup: Add a copyright notice to files and use SPDX format A lot of files were missing copyright field in the header and the Blender Foundation contributed to them in a sense of bug fixing and general maintenance. This change makes it explicit that those files are at least partially copyrighted by the Blender Foundation. Note that this does not make it so the Blender Foundation is the only holder of the copyright in those files, and developers who do not have a signed contract with the foundation still hold the copyright as well. Another aspect of this change is using SPDX format for the header. We already used it for the license specification, and now we state it for the copyright as well, following the FAQ: https://reuse.software/faq/	2023-05-31 16:19:06 +02:00
Jacques Lucke	2cfcb8b0b8	BLI: refactor IndexMask for better performance and memory usage Goals of this refactor: * Reduce memory consumption of `IndexMask`. The old `IndexMask` uses an `int64_t` for each index which is more than necessary in pretty much all practical cases currently. Using `int32_t` might still become limiting in the future in case we use this to index e.g. byte buffers larger than a few gigabytes. We also don't want to template `IndexMask`, because that would cause a split in the "ecosystem", or everything would have to be implemented twice or templated. * Allow for more multi-threading. The old `IndexMask` contains a single array. This is generally good but has the problem that it is hard to fill from multiple-threads when the final size is not known from the beginning. This is commonly the case when e.g. converting an array of bool to an index mask. Currently, this kind of code only runs on a single thread. * Allow for efficient set operations like join, intersect and difference. It should be possible to multi-thread those operations. * It should be possible to iterate over an `IndexMask` very efficiently. The most important part of that is to avoid all memory access when iterating over continuous ranges. For some core nodes (e.g. math nodes), we generate optimized code for the cases of irregular index masks and simple index ranges. To achieve these goals, a few compromises had to made: * Slicing of the mask (at specific indices) and random element access is `O(log #indices)` now, but with a low constant factor. It should be possible to split a mask into n approximately equally sized parts in `O(n)` though, making the time per split `O(1)`. * Using range-based for loops does not work well when iterating over a nested data structure like the new `IndexMask`. Therefor, `foreach_` functions with callbacks have to be used. To avoid extra code complexity at the call site, the `foreach_` methods support multi-threading out of the box. The new data structure splits an `IndexMask` into an arbitrary number of ordered `IndexMaskSegment`. Each segment can contain at most `2^14 = 16384` indices. The indices within a segment are stored as `int16_t`. Each segment has an additional `int64_t` offset which allows storing arbitrary `int64_t` indices. This approach has the main benefits that segments can be processed/constructed individually on multiple threads without a serial bottleneck. Also it reduces the memory requirements significantly. For more details see comments in `BLI_index_mask.hh`. I did a few tests to verify that the data structure generally improves performance and does not cause regressions: * Our field evaluation benchmarks take about as much as before. This is to be expected because we already made sure that e.g. add node evaluation is vectorized. The important thing here is to check that changes to the way we iterate over the indices still allows for auto-vectorization. * Memory usage by a mask is about 1/4 of what it was before in the average case. That's mainly caused by the switch from `int64_t` to `int16_t` for indices. In the worst case, the memory requirements can be larger when there are many indices that are very far away. However, when they are far away from each other, that indicates that there aren't many indices in total. In common cases, memory usage can be way lower than 1/4 of before, because sub-ranges use static memory. * For some more specific numbers I benchmarked `IndexMask::from_bools` in `index_mask_from_selection` on 10.000.000 elements at various probabilities for `true` at every index: ``` Probability Old New 0 4.6 ms 0.8 ms 0.001 5.1 ms 1.3 ms 0.2 8.4 ms 1.8 ms 0.5 15.3 ms 3.0 ms 0.8 20.1 ms 3.0 ms 0.999 25.1 ms 1.7 ms 1 13.5 ms 1.1 ms ``` Pull Request: https://projects.blender.org/blender/blender/pulls/104629	2023-05-24 18:11:41 +02:00
Campbell Barton	46479f41e1	Cleanup: quiet dangling-reference warnings with GCC13	2023-05-10 12:06:27 +10:00
Jacques Lucke	8ba9d7b67a	Functions: improve handling of thread-local data in lazy functions The main goal here is to reduce the number of times thread-local data has to be looked up using e.g. `EnumerableThreadSpecific.local()`. While this isn't a bottleneck in many cases, it is when the action performed on the local data is very short and that happens very often (e.g. logging used sockets during geometry nodes evaluation). The solution is to simply pass the thread-local data as parameter to many functions that use it, instead of looking it up in those functions which generally is more costly. The lazy-function graph executor now only looks up the local data if it knows that it might be on a new thread, otherwise it uses the local data retrieved earlier. Alongside with `UserData` there is `LocalUserData` now. This allows users of the lazy-function evaluation (such as geometry nodes) to have custom thread-local data that is passed to all the lazy-functions automatically. This is used for logging now.	2023-05-09 13:13:52 +02:00
Campbell Barton	6859bb6e67	Cleanup: format (with BraceWrapping::AfterControlStatement "MultiLine")	2023-05-02 09:37:49 +10:00
Hans Goudey	a6baf7beae	BLI: Allow different integer types when filling span indices	2023-04-27 08:50:41 -04:00
Hans Goudey	2f581a779c	Cleanup: Use utility constructor to create field operations	2023-04-23 15:27:20 -04:00
Hans Goudey	988f23cec3	Attributes: Add 2D integer vector attribute type This type will be used to store mesh edges in #106638, but it could be used for anything else too. This commit adds support for: - The new type in the Python API - Editing the type in the edit mode "Attribute Set" operator - Rendering the type in EEVEE and Cycles for all geometry types - Geometry nodes attribute interpolation and mixing - Viewing the type in the spreadsheet and using row filters The attribute uses the `blender::int2` type in most code, and the `vec2i` DNA type in C code when necessary. The enum names are based on `INT32_2D` for consistency with `INT8` and `INT32`. Pull Request: https://projects.blender.org/blender/blender/pulls/106677	2023-04-14 16:08:05 +02:00
Jacques Lucke	55d473ee40	Cleanup: use better default name for unknown parameter The `<` and `>` don't work well when the name is inserted into a .dot graph.	2023-03-30 18:44:11 +02:00
Sergey Sharybin	d32d787f5f	Clang-Format: Allow empty functions to be single-line For example ``` OIIOOutputDriver::~OIIOOutputDriver() { } ``` becomes ``` OIIOOutputDriver::~OIIOOutputDriver() {} ``` Saves quite some vertical space, which is especially handy for constructors. Pull Request: https://projects.blender.org/blender/blender/pulls/105594	2023-03-29 16:50:54 +02:00
Campbell Barton	b3625e6bfd	Cleanup: comment blocks	2023-03-09 10:39:49 +11:00
Germano Cavalcante	7fcb262dfd	Cleanup: resolve some unreferenced parameter warnings in MSVC When the warning level is set to 4, some unreferenced parameter warnings can appear This commit resolves some of those warnings.	2023-03-07 21:39:44 -03:00
Clément Foucault	b0b9e746fa	BLI: Use BLI_math_matrix_type.hh instead of BLI_math_float4x4.hh Straightforward port. I took the oportunity to remove some C vector functions (ex: copy_v2_v2). This makes some changes to DRWView to accomodate the alignement requirements of the float4x4 type.	2023-02-06 21:25:45 +01:00
Ray Molenkamp	b5e00a1482	Revert "BLI: Use BLI_math_matrix_type.hh instead of BLI_math_float4x4.hh" This reverts commit `52de84b0db`. had some build issues on windows i can't quickly resolve, revert for now while we fix the problems	2023-02-02 11:46:23 -07:00
Clément Foucault	52de84b0db	BLI: Use BLI_math_matrix_type.hh instead of BLI_math_float4x4.hh Straightforward port. I took the oportunity to remove some C vector functions (ex: `copy_v2_v2`). This makes some changes to DRWView to accomodate the alignement requirements of the float4x4 type.	2023-02-02 18:11:35 +01:00
Jacques Lucke	904357d67a	Fix: assert when converting between incompatible field types This results in a compile time error now which hopefully prevents this specific kind of mistake in the future.	2023-01-28 14:52:15 +01:00
Jacques Lucke	96dfa68e5f	Cleanup: extract function that slices parameters for multi-function call	2023-01-22 00:13:47 +01:00
Jacques Lucke	3f1886d0b7	Functions: align chunk sizes in multi-function evaluation This can improve performance in some circumstances when there are vectorized and/or unrolled loops. I especially noticed that this helps a lot while working on D16970 (got a 10-20% speedup there by avoiding running into the non-vectorized fallback loop too often).	2023-01-22 00:03:25 +01:00
Jacques Lucke	31a505d1a5	Functions: add debug utility for lazy function graphs This makes it easier to print information about a socket. Just the socket name is sometimes not enough information to know where it is in the graph.	2023-01-20 13:39:29 +01:00
Jacques Lucke	72cc68e299	Functions: only allocate resource scope when it is actually used In most cases it is currently not used, so always having it there causes unnecessary overhead. In my test file that causes a 2 % performance improvement.	2023-01-14 15:56:43 +01:00
Jacques Lucke	50980981e3	Cleanup: remove MF prefix from some classes in multi-function namespace This was missing in rBeedcf1876a6651c38d8f4daa2e65d1fb81f77c5d.	2023-01-14 15:42:52 +01:00
Jacques Lucke	8625495b1c	Functions: improve handling of unused multi-function outputs Previously, `ParamsBuilder` lazily allocated an array for an output when it was unused, but the called multi-function wanted to access it. Now, whether the multi-function supports an output to be unused is part of the signature. This way, the allocation can happen earlier when the parameters are build. The benefit is that this makes all methods of `MFParams` thread-safe again, removing the need for a mutex.	2023-01-14 15:35:44 +01:00
Jacques Lucke	aea26830dc	Cleanup: use std::get instead of std::get_if `std::get` could not be used due to restrictions on macos. However, the minimum requirement has been lifted in {rB597aecc01644f0063fa4545dabadc5f73387e3d3}.	2023-01-14 14:16:51 +01:00
Campbell Barton	02226e9069	Cleanup: spelling in comments	2023-01-09 17:41:08 +11:00
Jacques Lucke	73a2c79c07	Functions: free memory of unused sockets earlier During geometry nodes evaluation some sockets can be determined to be unused, for example based on the condition input in a switch node. Once a socket is determined to be unused, that information has to be propagated backwards through the tree to free any memory that may have been reserved for those sockets already. This is happening before this commit already, but in a less ideal way. Determining that sockets are unused early is good because it helps with memory reuse and avoids copy-on-write copies caused by shared data. Now, nodes that are scheduled because an output became unused have priority over nodes scheduled for other reasons.	2023-01-08 21:09:33 +01:00

1 2 3 4 5 ...

320 Commits