-
Type:
Improvement
-
Resolution: Unresolved
-
Priority:
Major - P3
-
None
-
Affects Version/s: None
-
Component/s: None
-
None
-
Storage Execution
-
Storage Execution 2026-04-13
-
None
-
None
-
None
-
None
-
None
-
None
-
None
Once validation of metadata and graceful stepdown have been implemented, the following sequence of events becomes invalid:
- Node is primary.
- Concurrent conflicting createCollection calls happen, resulting in one of them creating a table and then rolling back. Table is added to the reaper.
- Reaper gets EBUSY when it tries to drop the table.
- Node steps down.
- Node installs a checkpoint from the new primary
Here the validation will fail because the node has a table that it shouldn't, so we need to either disable validation or skip installing the checkpoint.
In normal operation this should skip at most one checkpoint before all tables are cleaned up, but we've had bugs that result in forever getting EBUSY before. To avoid having a standby that never installs checkpoints, we should put a limit on how many are skipped and fassert if it exceeded.