Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Project Bloodstone - sled 1.0 #1456

Draft
wants to merge 135 commits into
base: main
Choose a base branch
from
Draft
Changes from 1 commit
Commits
Show all changes
135 commits
Select commit Hold shift + click to select a range
3d4c6a7
Check-in basic project bloodstone implementation
spacejam Jul 28, 2023
cd4c58f
Sand-off a few edges, rename bloodstone->sled
spacejam Jul 28, 2023
b5d95d9
Add periodic flusher thread and some better in-memory size calculations
spacejam Jul 28, 2023
41a9829
Cut new version for periodic flush thread
spacejam Jul 28, 2023
3c19bfb
Move some dependencies into this crate for better customization. Cut …
spacejam Jul 29, 2023
43f52dc
Clean up a variety of codepaths after merging crates
spacejam Jul 29, 2023
c9a11a3
Fsync metadata log directory before returning from flush
spacejam Jul 29, 2023
6f46cde
Check-in simple arch description
spacejam Jul 29, 2023
9df7f3c
Merge the tests and most of the Tree methods from old sled into sled 1.0
spacejam Jul 30, 2023
87a26c4
Fix doctest compiler errors
spacejam Jul 30, 2023
1b7f8ab
Concise iteration
wackbyte Jul 29, 2023
819c2e0
Remove unnecessary `as_ref` calls
wackbyte Jul 29, 2023
e47bbcc
Only call `as_ref` once on keys
wackbyte Jul 29, 2023
c30b4aa
Remove unnecessary referencing
wackbyte Jul 29, 2023
dc1d2ab
Un-nest double `&mut` references
wackbyte Jul 29, 2023
3e6436d
Fix typo in ARCHITECTURE.md
wackbyte Jul 29, 2023
a6c065d
Use `u8::MAX` over the soon-to-be-deprecated `u8::max_value()`
wackbyte Jul 31, 2023
36fb34f
Merge pull request #1455 from wackbyte/fix-typo
spacejam Aug 5, 2023
cf627b4
Merge pull request #1454 from wackbyte/changes
spacejam Aug 5, 2023
d1b1a73
Fix temporary directory support
spacejam Aug 5, 2023
5ebd72d
follow SPDX 2.1 license expression standard
spacejam Aug 5, 2023
c64fbbe
Implement Iter::next
spacejam Aug 5, 2023
e4fd52c
Implement the skeleton for Iter::next_back
spacejam Aug 5, 2023
5e52a92
Fix a bug with pop_last_in_range
spacejam Aug 5, 2023
5dfe390
Check-in some correctness efforts
spacejam Aug 5, 2023
117ad01
Clear away other databases for comparative benchmarking for now
spacejam Aug 5, 2023
4ea84fb
Cut alpha.104
spacejam Aug 5, 2023
b8924ef
Smooth out a number of issues, mostly around testing
spacejam Aug 5, 2023
3016dde
Fix a few concurrency bugs with the flush epoch tracker
spacejam Aug 5, 2023
bb6d53a
Fix writebatch dirty page tracking issue that the concurrent crash te…
spacejam Aug 5, 2023
6538301
Fix a test and tune-down log verbosity
spacejam Aug 5, 2023
151feaa
Improve reverse iterator undershot detection to fix concurrent iterat…
spacejam Aug 6, 2023
eccaadc
Cut alpha.107
spacejam Aug 6, 2023
6940c71
Fix overshoot bound detection in Db::page_in
spacejam Aug 6, 2023
8012b12
Relax flush notification assertion. Avoid running transaction crash t…
spacejam Aug 6, 2023
d9179c8
Cut new release with bugfixes
spacejam Aug 6, 2023
b11ebea
Remove redundant open_default function
spacejam Aug 6, 2023
1b9c057
Fix a few bugs that the intensive tests discovered
spacejam Aug 6, 2023
5ee89a9
Fix an iterator bug, and begin testing with more interesting const ge…
spacejam Aug 6, 2023
20ee4f5
Improve fuzz test, implement IntoIterator for &Db
spacejam Aug 6, 2023
594bfd9
Avoid memory leak in FlushEpoch
spacejam Aug 6, 2023
47ce0c9
Cut alpha.113 with memory leak fix
spacejam Aug 6, 2023
6a1ff5e
Fix tests under linux
spacejam Aug 6, 2023
72cfb06
Fix caching behavior
spacejam Aug 7, 2023
6cdea88
Update gitignore to include fuzzer logs
spacejam Aug 7, 2023
65a8a1a
Remove INDEX_FANOUT and EBR const generics from Db. Fix flush safety bug
spacejam Aug 12, 2023
b13abe8
Handle cooperative flushes more concretely to avoid bugs
spacejam Aug 12, 2023
a161729
Check-in more work on empty tree leaf merges and refactoring the dirt…
spacejam Aug 26, 2023
efc4242
Fix a couple merge bugs
spacejam Aug 26, 2023
16dcc6b
Make deletion tracking account for specific flush epochs. Properly ma…
spacejam Aug 26, 2023
8e95ec8
Merge pull request #1459 from spacejam/tyler_bloodstone_merges_and_st…
spacejam Aug 26, 2023
26f1e74
Bump alpha version to 117
spacejam Aug 26, 2023
49f73d2
Alter the storage format to include collection ID information in anti…
spacejam Sep 2, 2023
4abb765
Avoid mapping from NodeId to InlineArray low key
spacejam Sep 2, 2023
331ce89
Split out flush responsibility from Db
spacejam Sep 3, 2023
1ec57a5
Move cache maintenance work to Io struct
spacejam Sep 3, 2023
53c4654
Move shared IO behavior to new shared PageCache struct
spacejam Sep 3, 2023
cc8fe0b
Avoid race-prone Arc::strong_count checks in flush-on-shutdown logic
spacejam Sep 3, 2023
365b60d
docs update
spacejam Sep 3, 2023
80dbc40
Move most of the interesting methods from Db to Tree in preparation f…
spacejam Sep 3, 2023
8eb2ea2
Fix performance regression for insertions
spacejam Sep 3, 2023
9743807
Bump concurrent-map to take advantage of massive optimization in get_…
spacejam Sep 3, 2023
f5a554d
Move some ID allocation logic into its own module
spacejam Sep 3, 2023
2f73989
Start threading CollectionId into the write and recovery paths to sup…
spacejam Sep 4, 2023
1d76590
Restructure tests
spacejam Sep 4, 2023
bcea2e4
Implement multiple collections and import/export
spacejam Sep 5, 2023
304c5d2
Improve flush epoch concurrent testing
spacejam Sep 11, 2023
1786bdf
Bump version
spacejam Sep 11, 2023
16b1d98
Extract PageTable into ObjectLocationMap
kolloch Sep 25, 2023
0a91e3a
Merge pull request #1469 from kolloch/project_bloodstone
spacejam Sep 25, 2023
9dded84
Restructure heap location tracking code slightly in preparation for h…
spacejam Oct 3, 2023
bc85cc9
Make assertions about expected location transitions, avoid todo panic…
spacejam Oct 3, 2023
0209baa
Fix-up assertions so that tests pass
spacejam Oct 3, 2023
4b1aea0
Improve naming
spacejam Oct 3, 2023
ce51eeb
Standardize naming on ObjectId. Include CollectionId and low key in O…
spacejam Oct 3, 2023
b765c2f
A collection of cleanups and the beginnings of heap defragmentation
spacejam Oct 19, 2023
af4ec89
Check-in initial GC object rewriting logic
spacejam Oct 28, 2023
8228aaa
Add allocation and GC counters to Stats
spacejam Oct 29, 2023
fe48530
Check in gc pest
spacejam Nov 5, 2023
cb44c1f
A large number of improvements towards on-disk file GC
spacejam Nov 11, 2023
86fe050
Re-add max_allocated as it will be used for file truncation
spacejam Nov 11, 2023
971b91b
Reduce quickcheck operation counts a bit
spacejam Nov 11, 2023
6e24fa3
Be more pedantic in deletion test
spacejam Dec 17, 2023
ed98f93
Have reads return optional values if the page was freed
spacejam Dec 17, 2023
cf118a8
Complete Merge
spacejam Dec 24, 2023
024edbe
Clarify minimum flush epoch and better test it
spacejam Dec 24, 2023
2647942
Thread significantly more event verification into the writepath
spacejam Dec 24, 2023
947131e
Improve verification subsystem
spacejam Dec 25, 2023
07fcb1d
Fix bug in paging out dirty pages
spacejam Dec 25, 2023
784e287
Refine testing
spacejam Dec 25, 2023
bae5da3
Refine testing assertions
spacejam Dec 25, 2023
48e6c7f
Remove unreachable wildcard match in history verifier
spacejam Dec 25, 2023
c9d66f6
Bump version to alpha.119
spacejam Dec 25, 2023
8d34f6b
Address some TODOs, clean up the system more
spacejam Dec 25, 2023
11b50b5
Use BTreeSet instead of BinaryHeap in the Allocator
spacejam Dec 25, 2023
ca270ab
prioritize TODOs
spacejam Dec 25, 2023
ecc717a
Perform file truncation when a slab is detected to be at 80% of its p…
spacejam Dec 25, 2023
aa1f899
Fix size calculation for file resizing
spacejam Dec 25, 2023
e6f509e
Bump version to 1.0.0-alpha.120
spacejam Dec 25, 2023
dd72722
Clear TODO related to file resiziing
spacejam Dec 25, 2023
896b698
Abstract low-level Leaf access methods to enable lower defect prefix-…
spacejam Dec 27, 2023
e2e75ed
Have testing par! macro provide an InlineArray instead of a Vec<u8>
spacejam Dec 27, 2023
ffbdade
Add significantly more stats to the read and write paths.
spacejam Dec 28, 2023
ff5dec5
Use small cache for the concurrent iterator test, and run many more t…
spacejam Dec 28, 2023
bc8b14b
Update TODOs
spacejam Dec 29, 2023
8c69736
Make concurrent iterator test more intense
spacejam Dec 29, 2023
d9533c3
Update ARCHITECTURE.md
spacejam Jan 4, 2024
c043a1f
Better abstract the leaf storage and make room for the soon-to-be-add…
spacejam Feb 10, 2024
83a7bff
Handle flusher thread panics in a way that causes tests to fail as ex…
spacejam Feb 11, 2024
433e9b0
Mark ObjectCache as RefUnwindSafe
spacejam Feb 11, 2024
bd65e8e
Increase the strictness of the event verifier and fix a variety of su…
spacejam Feb 11, 2024
d6ed26e
Update project TODOs and get things into place for proper fsync manag…
spacejam Feb 18, 2024
16c108d
Tighten up concurrent tests by adding concurrent flushers to get epoc…
spacejam Mar 9, 2024
6905c87
Add notion of max unflushed epoch to leaves
spacejam Mar 10, 2024
3a87137
Properly log cooperative serialization in batch processing as Coopera…
spacejam Mar 23, 2024
2a888db
Remove redundant and incorrect debug log event
spacejam Mar 24, 2024
4e4a3f9
Provide more information for debugging failed crash tests
spacejam Apr 1, 2024
1516077
Properly fsync slab files after write batches
spacejam Apr 7, 2024
534fbfb
Silence warnings
spacejam Apr 7, 2024
a2fb9aa
Sync slabs dir after potentially initializing new files
spacejam Apr 7, 2024
795a221
check-in this weekend's work before flying
spacejam Apr 9, 2024
cb14155
[new file format] use crc on frame lengths in metadata store
spacejam Apr 14, 2024
3b4c889
Fix bug with crash test assertions. Reduce visibility of platform-spe…
spacejam May 11, 2024
a153ead
Improve crash test structure, add skeletons for more crash tests, add…
spacejam Jun 15, 2024
355f4d3
Sync more thoroughly before recovering metadata store. Move crash dir…
spacejam Jun 15, 2024
1a29b92
Add a lot more assertions, rely on a full Mutex in a couple places fo…
spacejam Jun 15, 2024
83541fa
Explicitly bump ebr dependency to avoid bug
spacejam Sep 5, 2024
6fdd358
Make crash test concurrent flushing more frequent
spacejam Oct 11, 2024
1e32923
Add a SeqCst fence between Release writes and later Acquire reads
spacejam Oct 11, 2024
cd9d468
Retry page accesses on a rare unexpected state
spacejam Oct 11, 2024
41a3293
Bump version which includes some additional strictness and fixes
spacejam Oct 11, 2024
7ee0e1a
Check-in new configurable for blob inlining
spacejam Dec 27, 2024
13856e3
Merge branch 'main' into project_bloodstone
spacejam Dec 27, 2024
c5e0f55
Bump upload-artifact to v4
spacejam Dec 27, 2024
cf9f21c
Fix testing feature in GHA
spacejam Dec 27, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
Implement the skeleton for Iter::next_back
spacejam committed Aug 5, 2023

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
commit e4fd52c6eba3c0ed55df6170b8d98ab7d7bf8a4f
51 changes: 37 additions & 14 deletions src/db.rs
Original file line number Diff line number Diff line change
@@ -1134,7 +1134,7 @@ impl<
prefetched: VecDeque::new(),
prefetched_back: VecDeque::new(),
next_fetch: Some(InlineArray::MIN),
next_back_fetch: None,
next_back_last_lo: None,
next_calls: 0,
next_back_calls: 0,
inner: self,
@@ -1160,16 +1160,11 @@ impl<
Bound::Unbounded => InlineArray::MIN,
});

let next_back_fetch = Some(match &end {
Bound::Included(b) | Bound::Excluded(b) => b.clone(),
Bound::Unbounded => InlineArray::MIN,
});

Iter {
prefetched: VecDeque::new(),
prefetched_back: VecDeque::new(),
next_fetch,
next_back_fetch,
next_back_last_lo: None,
next_calls: 0,
next_back_calls: 0,
inner: self,
@@ -1826,7 +1821,7 @@ pub struct Iter<
next_calls: usize,
next_back_calls: usize,
next_fetch: Option<InlineArray>,
next_back_fetch: Option<InlineArray>,
next_back_last_lo: Option<InlineArray>,
prefetched: VecDeque<(InlineArray, InlineArray)>,
prefetched_back: VecDeque<(InlineArray, InlineArray)>,
}
@@ -1879,10 +1874,20 @@ impl<
fn next_back(&mut self) -> Option<Self::Item> {
self.next_back_calls += 1;
while self.prefetched_back.is_empty() {
let search_key = if let Some(last) = &self.next_back_fetch {
last.clone()
let search_key: InlineArray = if let Some(last) =
&self.next_back_last_lo
{
if !self.bounds.contains(last) || last == &InlineArray::MIN {
return None;
}
self.inner
.index
.range::<InlineArray, _>(..last)
.next_back()
.unwrap()
.0
} else {
return None;
self.inner.index.last().unwrap().0
};

let node = match self.inner.leaf_for_key(&search_key) {
@@ -1891,12 +1896,30 @@ impl<
};

let leaf = node.leaf_read.as_ref().unwrap();

if let (Some(leaf_hi), Some(last_lo)) =
(&leaf.hi, &self.next_back_last_lo)
{
if leaf_hi < last_lo {
// concurrent predecessor split, retry
continue;
}
}

for (k, v) in leaf.data.iter() {
if self.bounds.contains(k) && &search_key <= k {
self.prefetched_back.push_back((k.clone(), v.clone()));
if self.bounds.contains(k) {
let beneath_last_lo =
if let Some(last_lo) = &self.next_back_last_lo {
k < last_lo
} else {
true
};
if beneath_last_lo {
self.prefetched_back.push_back((k.clone(), v.clone()));
}
}
}
self.next_back_fetch = leaf.hi.clone();
self.next_back_last_lo = Some(leaf.lo.clone());
}

self.prefetched_back.pop_back().map(Ok)