primary fails to rejoin set on restart

XMLWordPrintableJSON

    • Type: Bug
    • Resolution: Done
    • Priority: Major - P3
    • None
    • Affects Version/s: 3.0.0-rc6
    • Component/s: Replication
    • None
    • Environment:
      rhel55 32-bit
    • Fully Compatible
    • Linux
    • Hide

      start 3 node replSet, 2 mongods, one arbiter.

      one mongod has priority 99, other has priority 1.1

      start replicaSet, high-priority node becomes primary as expected.

      Restart primary after doing a few ops. Primary fails to join set with error:

      2015-01-28T21:32:54.167+0000 W REPL     [ReplicationExecutor] Locally stored replica set configuration does not have a valid entry for the current node; waiting for reconfig or remote heartbeat; Got "NodeNotFound No host described in new configuration 1 for replica set 3fe0bcef-8fbd-425a-a6cf-06a3a098cc70 maps to this node" while validating { _id: "3fe0bcef-8fbd-425a-a6cf-06a3a098cc70", version: 1, members: [ { _id: 0, host: "127.0.0.1:1053", arbiterOnly: false, buildIndexes: true, hidden: false, priority: 99.0, tags: { ordinal: "one", dc: "ny" }, slaveDelay: 0, votes: 1 }, { _id: 1, host: "127.0.0.1:1054", arbiterOnly: false, buildIndexes: true, hidden: false, priority: 1.1, tags: { ordinal: "two", dc: "pa" }, slaveDelay: 0, votes: 1 }, { _id: 2, host: "127.0.0.1:1055", arbiterOnly: true, buildIndexes: true, hidden: false, priority: 1.0, tags: {}, slaveDelay: 0, votes: 1 } ], settings: { chainingAllowed: true, heartbeatTimeoutSecs: 10, getLastErrorModes: {}, getLastErrorDefaults: { w: 1, wtimeout: 0 } } }
      2015-01-28T21:32:54.167+0000 I REPL     [ReplicationExecutor] new replica set config in use: { _id: "3fe0bcef-8fbd-425a-a6cf-06a3a098cc70", version: 1, members: [ { _id: 0, host: "127.0.0.1:1053", arbiterOnly: false, buildIndexes: true, hidden: false, priority: 99.0, tags: { ordinal: "one", dc: "ny" }, slaveDelay: 0, votes: 1 }, { _id: 1, host: "127.0.0.1:1054", arbiterOnly: false, buildIndexes: true, hidden: false, priority: 1.1, tags: { ordinal: "two", dc: "pa" }, slaveDelay: 0, votes: 1 }, { _id: 2, host: "127.0.0.1:1055", arbiterOnly: true, buildIndexes: true, hidden: false, priority: 1.0, tags: {}, slaveDelay: 0, votes: 1 } ], settings: { chainingAllowed: true, heartbeatTimeoutSecs: 10, getLastErrorModes: {}, getLastErrorDefaults: { w: 1, wtimeout: 0 } } }
      2015-01-28T21:32:54.167+0000 I REPL     [ReplicationExecutor] transition to REMOVED
      
      Show
      start 3 node replSet, 2 mongods, one arbiter. one mongod has priority 99, other has priority 1.1 start replicaSet, high-priority node becomes primary as expected. Restart primary after doing a few ops. Primary fails to join set with error: 2015-01-28T21:32:54.167+0000 W REPL [ReplicationExecutor] Locally stored replica set configuration does not have a valid entry for the current node; waiting for reconfig or remote heartbeat; Got "NodeNotFound No host described in new configuration 1 for replica set 3fe0bcef-8fbd-425a-a6cf-06a3a098cc70 maps to this node" while validating { _id: "3fe0bcef-8fbd-425a-a6cf-06a3a098cc70" , version: 1, members: [ { _id: 0, host: "127.0.0.1:1053" , arbiterOnly: false , buildIndexes: true , hidden: false , priority: 99.0, tags: { ordinal: "one" , dc: "ny" }, slaveDelay: 0, votes: 1 }, { _id: 1, host: "127.0.0.1:1054" , arbiterOnly: false , buildIndexes: true , hidden: false , priority: 1.1, tags: { ordinal: "two" , dc: "pa" }, slaveDelay: 0, votes: 1 }, { _id: 2, host: "127.0.0.1:1055" , arbiterOnly: true , buildIndexes: true , hidden: false , priority: 1.0, tags: {}, slaveDelay: 0, votes: 1 } ], settings: { chainingAllowed: true , heartbeatTimeoutSecs: 10, getLastErrorModes: {}, getLastErrorDefaults: { w: 1, wtimeout: 0 } } } 2015-01-28T21:32:54.167+0000 I REPL [ReplicationExecutor] new replica set config in use: { _id: "3fe0bcef-8fbd-425a-a6cf-06a3a098cc70" , version: 1, members: [ { _id: 0, host: "127.0.0.1:1053" , arbiterOnly: false , buildIndexes: true , hidden: false , priority: 99.0, tags: { ordinal: "one" , dc: "ny" }, slaveDelay: 0, votes: 1 }, { _id: 1, host: "127.0.0.1:1054" , arbiterOnly: false , buildIndexes: true , hidden: false , priority: 1.1, tags: { ordinal: "two" , dc: "pa" }, slaveDelay: 0, votes: 1 }, { _id: 2, host: "127.0.0.1:1055" , arbiterOnly: true , buildIndexes: true , hidden: false , priority: 1.0, tags: {}, slaveDelay: 0, votes: 1 } ], settings: { chainingAllowed: true , heartbeatTimeoutSecs: 10, getLastErrorModes: {}, getLastErrorDefaults: { w: 1, wtimeout: 0 } } } 2015-01-28T21:32:54.167+0000 I REPL [ReplicationExecutor] transition to REMOVED
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      found in cxx driver test suite on a RHEL 5.5 32bit host.

      server git version: ac9ee2fb80f2afc2737a0d9f346cff8117a82af2

        1. mongo1055.log
          111 kB
        2. mongo1054.log
          130 kB
        3. mongo1053.log
          28 kB

            Assignee:
            Scott Hernandez (Inactive)
            Reporter:
            Adam Midvidy (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

              Created:
              Updated:
              Resolved: