[SERVER-49075] Add replica_sets suite to resumable index build variant Created: 24/Jun/20  Updated: 29/Oct/23  Resolved: 02/Sep/20

Status: Closed
Project: Core Server
Component/s: Index Maintenance, Testing Infrastructure
Affects Version/s: None
Fix Version/s: 4.7.0

Type: Task Priority: Major - P3
Reporter: Samyukta Lanka Assignee: Gregory Noma
Resolution: Fixed Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Issue Links:
Depends
depends on SERVER-50446 make index builds non-resumable when ... Closed
depends on SERVER-50168 Transition to rollback doesn't need t... Closed
Problem/Incident
Related
is related to SERVER-48476 resumable index build should use majo... Closed
is related to SERVER-49774 Enable rollback testing for resumable... Closed
Backwards Compatibility: Fully Compatible
Sprint: Execution Team 2020-08-24, Execution Team 2020-09-07
Participants:
Linked BF Score: 37

 Description   

SERVER-48476 is adding a wait during index builds that can cause a hang in some replica_sets tests when resumable index builds are enabled. As a part of this ticket, we should resolve these hangs.



 Comments   
Comment by Githook User [ 03/Sep/20 ]

Author:

{'name': 'Gregory Noma', 'email': 'gregory.noma@gmail.com', 'username': 'gregorynoma'}

Message: SERVER-49075 Add requires_fcv_47 tag to resumable index build rollback tests
Branch: master
https://github.com/mongodb/mongo/commit/58bc5d187eb30270353f68eef3977c5be57e4eb8

Comment by Githook User [ 02/Sep/20 ]

Author:

{'name': 'Gregory Noma', 'email': 'gregory.noma@gmail.com', 'username': 'gregorynoma'}

Message: SERVER-49075 Add replica_sets suite to resumable index builds variant
Branch: master
https://github.com/mongodb/mongo/commit/fc7a537848482592d40ece7b84f1fda0fa7f725e

Comment by Samyukta Lanka [ 24/Jun/20 ]

This depends on SERVER-49076 since that ticket will address hang issues in the rollback test fixture, which is used by multiple tests in replica_sets suites.

Comment by Samyukta Lanka [ 24/Jun/20 ]

One cause of hangs are tests that stop replication on secondaries. One such example is in read_committed_with_catalog_changes.js. The test uses a PSA architecture and tries to create an index while replication is stopped on the secondary (meaning that the majority commit point is held back). Since the test disables the commit quorum, this is normally not a problem. With the changes from SERVER-48476, the primary is stalled waiting for the last optime before the collection scan to be majority committed and so the test eventually times out. See these logs for more details.

Generated at Thu Feb 08 05:18:52 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.