Commits · 65b4f247ea133c6e4dff983cc6ad79bc49f974e3 · submodule / capnproto

27 Feb, 2019 1 commit

Implement InsertionOrderIndex move constructor/assignment operator · 42bce29f

Harris Hancock authored 6 years ago

InsertionOrderIndex manages memory, but it didn't have an explicit move constructor or move assignment operator, causing segmentation faults and double frees.

42bce29f

05 Aug, 2018 14 commits

Apply @harrishancock review comments. · 62beedb5
Kenton Varda authored 6 years ago

62beedb5
Fix GCC opt build. · 2d0d0d8c
Kenton Varda authored 6 years ago

2d0d0d8c
Add a way to release a table row. · 868051cb
Kenton Varda authored 6 years ago

868051cb
Fix template issues on compliers other than what I'm using. · 6698a5af
Kenton Varda authored 6 years ago

6698a5af
Refactor: Add keyForRow() callback so that other callbacks don't need to be overloaded for rows. · 5c4ed12c
Kenton Varda authored 6 years ago
```
See map.h changes to see why this makes this cleaner.
```
5c4ed12c

Support findOrCreate() (with only a single lookup). · 9b20a942

Kenton Varda authored 6 years ago

This is a very common pattern in practice -- and annoyingly difficult with STL maps.

This required some refactoring so than index.insert() could be called before the row was actually constructed, based on the search parameters.

It also required some awful hacks to support putting the creation function at the end of the argument list to findOrCreate(), with a variable-width arg list before it.

9b20a942

Optimize hash table modulus operation using big switch. · 25a25704

Kenton Varda authored 6 years ago

Integer division is really, really slow. The integer hash table benchmark spends most of its time in modulus operations!

This change shaves 32% off the integer hash table benchmark runtime, and 8% off the string hash table benchmark runtime.

25a25704

Work around VS2015 bug with zero-arg default constructors. · 3a184591
Kenton Varda authored 6 years ago

3a184591

Work around apparent VS 2015 bug. · 3353ff72

Kenton Varda authored 6 years ago

Admittedly this is a strict simplification to the code.

VS 2017 is fine either way.

3353ff72

Sadly, MSVC cannot handle pointer-to-members in constexprs. · 5cd3e108

Kenton Varda authored 6 years ago

This is true even if the pointer-to-member is never actually used, but only has its type matched, which was what kj::size() was trying to do.

Oh well, define some damned constants instead.

5cd3e108

Fix some MSVC problems. · 0117595a
Kenton Varda authored 6 years ago

0117595a
Fix typos spotted by @zarvox. · 2709adb4
Kenton Varda authored 6 years ago

2709adb4
Add IntertionOrderIndex. · 4a330282
Kenton Varda authored 6 years ago

4a330282

Implement kj::Table, an alternative to STL maps/sets. · 8707e731

Kenton Varda authored 6 years ago

Hash-based (unordered) and tree-based (ordered) indexing are provided.

kj::Table offers advantages over STL:
- A Table can have multiple indexes (allowing lookup by multiple keys). Different indexes can use different algorithms (e.g. hash vs. tree) and have different uniqueness constraints.
- The properties on which a Table is indexed need not be explicit fields -- they can be computed from the table's row type.
- Tables use less memory and make fewer allocations than STL, because rows are stored in a contiguous array.
- The hash indexing implementation uses linear probing rather than chaining, which again means far fewer allocations and more cache-friendliness.
- The tree indexing implementation uses B-trees optimized for cache line size, whereas STL uses cache-unfriendly and allocation-heavy red-black binary trees. (However, STL trees are overall more cache-friendly; see below.)
- Most of the b-tree implementation is not templated. This reduces code bloat, at the cost of some performance due to virtual calls.

On an ad hoc benchmark on large tables, the hash index implementation appears to outperform libc++'s `std::unordered_set` by ~60%. However, libc++'s `std::set` still outperforms the B-tree index by ~70%. It looks like the B-tree implementation suffers in part from the fact that keys are not stored inline in the tree nodes, forcing extra memory indirections. This is a price we pay for lower memory usage overall, and the ability to have multiple indexes on one table. The b-tree implementation also suffers somewhat from not being 100% templates, compared to STL, but I think this is a reasonable trade-off. The most performance-critical use cases will use hash indexes anyway.

8707e731