-
Type: Bug
-
Resolution: Works as Designed
-
Priority: Major - P3
-
None
-
Affects Version/s: None
-
Component/s: Replication
-
None
-
ALL
-
Execution Team 2020-07-27, Execution Team 2021-02-08, Execution Team 2021-02-22
Flow Control uses the lastApplied wall clock time minus the lastCommitted wall clock time as a proxy for replication lag. This measure can overstate the lag if there are oplog holes, since lastApplied can include operations after oplog holes, which cannot be replicated by secondaries due to the oplog hole.
One proposed fix to address this is to use the wall clock time associated with the all_durable timestamp or the oplog visibility point instead of the lastApplied wall clock time, since these points do not include operations after oplog holes.
Any solution to this issue that involves changing the components of the lag detection mechanism should ensure that 1) a wall clock time is available for the proposed timestamp 2) the proposed timestamp is accessible in-memory and is kept up-to-date.
SERVER-46114 represents another case for reconsidering whether lastApplied minus lastCommitted is the best measure for lag.
- is related to
-
SERVER-46114 Flow-control engages on a single-node replica set
- Closed
-
SERVER-54576 Add invariants that no network calls are made nor locks acquired after taking a flow control ticket
- Closed
-
SERVER-54581 Report the WT all_durable timestamp in serverStatus
- Closed
- related to
-
SERVER-45881 Investigate and implement desired Flow Control throttling for multi-document transactions
- Closed
- mentioned in
-
Page Loading...