bug(server): fix lns mismatch in replication #4967

adiholden · 2025-04-20T20:57:48Z

The bug:
entries were expired at the time of transitioning from full sync to stable sync leading to journal changes not propagated to replica.
The fix:
disable expire items on transitioning from full sync to stable sync using

src/server/transaction.cc

romange

LGTM!

adiholden · 2025-04-21T06:45:10Z

LGTM!

@romange I still see failure in the lsn in CI with this fix. I will continue investigate this

Signed-off-by: adi_holden <adi@dragonflydb.io>

adiholden · 2025-04-21T15:31:57Z

I suspect the problem relies in JournalStreamer register callback
we have this condition
if (allow_await) {
ThrottleIfNeeded();
...
}

meaning that from a transaction flow if we need to throttle we preempt before writing the data to socket
but than if by the time we preempt there is heatbeat flow execution which does not allows await meaning that it will skip this throttle and will write the next journal entry to socket before the one that run before with lower lsn.
So by skipping the await here in the heatbeat (or any flow that sets the allow await to false) flow we do not ensure sequential order.
@romange
To fix this we can call the throttle if needed after we write to the buffer wdyt?

romange · 2025-04-21T15:36:35Z

Lets take it offline - I need to understand more

Signed-off-by: adi_holden <adi@dragonflydb.io>

adiholden · 2025-04-23T10:08:30Z

I got the check fail on master for the lsn order validation. This proves the theory fyi @romange

Signed-off-by: adi_holden <adi@dragonflydb.io>

src/server/journal/streamer.h

adiholden · 2025-04-23T18:25:25Z

Test failed again. I suspect that moving the throttle to after the write did not fix the issue because we can have few registered callbacks and throttle on the first one can result in journal reorder.
Will need to think on another way to fix this

romange · 2025-04-23T20:19:06Z

What if we do not call ThrottleIfNeeded there? what if instead of storing callbacks in change_cb_arr_, we will store pointers to JournalConsumer interfaces that will have:

void Handle(const JournalItem& item);   
void ThrottleIfNeeded();

And then JournalSlice::CallOnChange will first go over all the instances and call Handle, and then in a separate loop will call ThrottleIfNeeded().
In other words the first loop will be fully atomic and non-preemptable and the second one is about throttling/flushing

BorysTheDev · 2025-04-28T09:47:30Z

src/server/transaction.cc

@@ -149,11 +149,19 @@ cv_status Transaction::BatonBarrier::Wait(time_point tp) {

 Transaction::Guard::Guard(Transaction* tx) : tx(tx) {
  DCHECK(tx->cid_->opt_mask() & CO::GLOBAL_TRANS);
-  tx->Execute([](auto*, auto*) { return OpStatus::OK; }, false);
+  auto cb = [&](Transaction* t, EngineShard* shard) {
+    namespaces->GetDefaultNamespace().GetDbSlice(shard->shard_id()).SetExpireAllowed(false);


What about eviction?

we check the same flag in eviction flow FreeMemWithEvictionStep checks expire_allowed_

Signed-off-by: adi_holden <adi@dragonflydb.io>

src/server/journal/types.h

romange · 2025-05-05T08:32:31Z

src/server/journal/journal_slice.cc

@@ -157,28 +157,35 @@ void JournalSlice::CallOnChange(const JournalItem& item) {
  // CallOnChange is atomic iff JournalSlice::SetFlushMode(false) is called before.
  std::shared_lock lk(cb_mu_);

-  const size_t size = change_cb_arr_.size();
-  auto k_v = change_cb_arr_.begin();
+  const size_t size = journal_consumers_arr_.size();


the first loop is atomic -let's just iterate over map.

romange · 2025-05-05T08:35:03Z

src/server/journal/journal_slice.cc

    ++k_v;
  }
+  k_v = journal_consumers_arr_.begin();
+  if (enable_journal_flush_) {
+    for (size_t i = 0; i < size; ++i) {


move size initialization here and add a comment that you set size because ThrottleIfNeeded can preempt and meanwhile journal_consumers_arr_ can grow. and btw, calling ThrottleIfNeeded on a new consumer is harmless so now you can remove size workaround at all, imho

romange

lgtm!

Signed-off-by: adi_holden <adi@dragonflydb.io>

adiholden requested review from romange and kostasrim April 20, 2025 20:57

adiholden changed the title ~~bug server: fix lns mismatch in replication~~ bug(server): fix lns mismatch in replication Apr 20, 2025

romange requested changes Apr 21, 2025

View reviewed changes

src/server/transaction.cc Outdated Show resolved Hide resolved

romange previously approved these changes Apr 21, 2025

View reviewed changes

adiholden added 3 commits April 21, 2025 15:02

bug server: fix lns mismatch in replication

ccaecfd

Signed-off-by: adi_holden <adi@dragonflydb.io>

bug server: fix lns mismatch in replication

50b0254

Signed-off-by: adi_holden <adi@dragonflydb.io>

bug server: fix lns mismatch in replication

c995ffc

Signed-off-by: adi_holden <adi@dragonflydb.io>

adiholden dismissed romange’s stale review via c995ffc April 21, 2025 12:04

adiholden force-pushed the fix_lsn_bug branch from ef5d7b6 to c995ffc Compare April 21, 2025 12:04

bug server: fix lns mismatch in replication

7402730

Signed-off-by: adi_holden <adi@dragonflydb.io>

bug server: fix lns mismatch in replication

2e12d1f

Signed-off-by: adi_holden <adi@dragonflydb.io>

romange reviewed Apr 23, 2025

View reviewed changes

src/server/journal/streamer.h Show resolved Hide resolved

BorysTheDev reviewed Apr 28, 2025

View reviewed changes

use JournalConsumerInterface

24df41c

Signed-off-by: adi_holden <adi@dragonflydb.io>

romange reviewed May 5, 2025

View reviewed changes

src/server/journal/types.h Show resolved Hide resolved

romange reviewed May 5, 2025

View reviewed changes

adiholden added 2 commits May 5, 2025 12:00

use JournalConsumerInterface

7a5b949

Signed-off-by: adi_holden <adi@dragonflydb.io>

Merge branch 'main' into fix_lsn_bug

ef262b0

adiholden requested a review from romange May 5, 2025 09:56

romange approved these changes May 5, 2025

View reviewed changes

adiholden enabled auto-merge (squash) May 5, 2025 10:44

adiholden merged commit 6a84ad0 into main May 5, 2025
10 checks passed

adiholden deleted the fix_lsn_bug branch May 5, 2025 10:50

kostasrim mentioned this pull request May 5, 2025

Python test failed: test_preempt_in_atomic_section_of_heartbeat #4938

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug(server): fix lns mismatch in replication #4967

bug(server): fix lns mismatch in replication #4967

adiholden commented Apr 20, 2025

romange left a comment

adiholden commented Apr 21, 2025

adiholden commented Apr 21, 2025

romange commented Apr 21, 2025

adiholden commented Apr 23, 2025

adiholden commented Apr 23, 2025

romange commented Apr 23, 2025

BorysTheDev Apr 28, 2025

adiholden Apr 30, 2025

romange May 5, 2025

romange May 5, 2025

romange left a comment

bug(server): fix lns mismatch in replication #4967

bug(server): fix lns mismatch in replication #4967

Conversation

adiholden commented Apr 20, 2025

romange left a comment

Choose a reason for hiding this comment

adiholden commented Apr 21, 2025

adiholden commented Apr 21, 2025

romange commented Apr 21, 2025

adiholden commented Apr 23, 2025

adiholden commented Apr 23, 2025

romange commented Apr 23, 2025

BorysTheDev Apr 28, 2025

Choose a reason for hiding this comment

adiholden Apr 30, 2025

Choose a reason for hiding this comment

romange May 5, 2025

Choose a reason for hiding this comment

romange May 5, 2025

Choose a reason for hiding this comment

romange left a comment

Choose a reason for hiding this comment