<!-- 
RSS generated by JIRA (9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66) at Thu Feb 08 03:27:47 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>MongoDB Jira</title>
    <link>https://jira.mongodb.org</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.7.1</version>
        <build-number>970001</build-number>
        <build-date>13-04-2023</build-date>
    </build-info>


<item>
            <title>[SERVER-12163] Replica Set failover time is more than 10 sec</title>
                <link>https://jira.mongodb.org/browse/SERVER-12163</link>
                <project id="10000" key="SERVER">Core Server</project>
                    <description>&lt;p&gt;We have three member replica set primary+secondary+arbiter across different virtual machines. &lt;/p&gt;

&lt;p&gt;To reproduced the issue we power the primary DB. We have observed that failover time is more than 10 secs.&lt;br/&gt;
What is optimal failover time. Is there any tuning parameter to reduce the failover time?&lt;/p&gt;

&lt;p&gt;Thu Dec 19 02:15:55.994 &lt;span class=&quot;error&quot;&gt;&amp;#91;rsSyncNotifier&amp;#93;&lt;/span&gt; replset setting oplog notifier to sessionmgr01:27717&lt;br/&gt;
Thu Dec 19 02:19:09.121 &lt;span class=&quot;error&quot;&gt;&amp;#91;rsHealthPoll&amp;#93;&lt;/span&gt; DBClientCursor::init call() failed&lt;br/&gt;
Thu Dec 19 02:19:09.121 &lt;span class=&quot;error&quot;&gt;&amp;#91;rsHealthPoll&amp;#93;&lt;/span&gt; replSet info sessionmgr01:27717 is down (or slow to respond):&lt;br/&gt;
Thu Dec 19 02:19:09.121 &lt;span class=&quot;error&quot;&gt;&amp;#91;rsHealthPoll&amp;#93;&lt;/span&gt; replSet member sessionmgr01:27717 is now in state DOWN&lt;br/&gt;
Thu Dec 19 02:19:09.122 &lt;span class=&quot;error&quot;&gt;&amp;#91;rsMgr&amp;#93;&lt;/span&gt; replSet info electSelf 2&lt;br/&gt;
Thu Dec 19 02:19:17.124 &lt;span class=&quot;error&quot;&gt;&amp;#91;rsHealthPoll&amp;#93;&lt;/span&gt; replset info sessionmgr01:27717 heartbeat failed, retrying&lt;br/&gt;
Thu Dec 19 02:19:28.071 &lt;span class=&quot;error&quot;&gt;&amp;#91;rsBackgroundSync&amp;#93;&lt;/span&gt; Socket recv() timeout  192.168.92.59:27717&lt;br/&gt;
Thu Dec 19 02:19:28.071 &lt;span class=&quot;error&quot;&gt;&amp;#91;rsBackgroundSync&amp;#93;&lt;/span&gt; SocketException: remote: 192.168.92.59:27717 error: 9001 socket exception &lt;span class=&quot;error&quot;&gt;&amp;#91;RECV_TIMEOUT&amp;#93;&lt;/span&gt; server &lt;span class=&quot;error&quot;&gt;&amp;#91;192.168.92.59:27717&amp;#93;&lt;/span&gt;&lt;br/&gt;
Thu Dec 19 02:19:28.072 &lt;span class=&quot;error&quot;&gt;&amp;#91;rsBackgroundSync&amp;#93;&lt;/span&gt; replSet sync source problem: 10278 dbclient error communicating with server: sessionmgr01:27717&lt;br/&gt;
Thu Dec 19 02:19:28.072 &lt;span class=&quot;error&quot;&gt;&amp;#91;rsSyncNotifier&amp;#93;&lt;/span&gt; Socket recv() timeout  192.168.92.59:27717&lt;br/&gt;
Thu Dec 19 02:19:28.072 &lt;span class=&quot;error&quot;&gt;&amp;#91;rsSyncNotifier&amp;#93;&lt;/span&gt; SocketException: remote: 192.168.92.59:27717 error: 9001 socket exception &lt;span class=&quot;error&quot;&gt;&amp;#91;RECV_TIMEOUT&amp;#93;&lt;/span&gt; server &lt;span class=&quot;error&quot;&gt;&amp;#91;192.168.92.59:27717&amp;#93;&lt;/span&gt;&lt;br/&gt;
Thu Dec 19 02:19:28.072 &lt;span class=&quot;error&quot;&gt;&amp;#91;rsSyncNotifier&amp;#93;&lt;/span&gt; replset tracking exception: exception: 10278 dbclient error communicating with server: sessionmgr01:27717&lt;br/&gt;
Thu Dec 19 02:19:28.072 &lt;span class=&quot;error&quot;&gt;&amp;#91;rsMgr&amp;#93;&lt;/span&gt; replSet PRIMARY&lt;br/&gt;
Thu Dec 19 02:19:29.127 &lt;span class=&quot;error&quot;&gt;&amp;#91;rsHealthPoll&amp;#93;&lt;/span&gt; replset info sessionmgr01:27717 heartbeat failed, retrying&lt;/p&gt;
</description>
                <environment>CentOS</environment>
        <key id="103214">SERVER-12163</key>
            <summary>Replica Set failover time is more than 10 sec</summary>
                <type id="1" iconUrl="https://jira.mongodb.org/secure/viewavatar?size=xsmall&amp;avatarId=14703&amp;avatarType=issuetype">Bug</type>
                                            <priority id="2" iconUrl="https://jira.mongodb.org/images/icons/priorities/critical.svg">Critical - P2</priority>
                        <status id="6" iconUrl="https://jira.mongodb.org/images/icons/statuses/closed.png" description="The issue is considered finished, the resolution is correct. Issues which are closed can be reopened.">Closed</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="9">Done</resolution>
                                        <assignee username="matt.dannenberg">Matt Dannenberg</assignee>
                                    <reporter username="amwankhe@cisco.com">Amit Wankhede</reporter>
                        <labels>
                    </labels>
                <created>Thu, 19 Dec 2013 09:31:35 +0000</created>
                <updated>Thu, 6 Apr 2023 15:20:09 +0000</updated>
                            <resolved>Thu, 19 Dec 2013 19:43:27 +0000</resolved>
                                    <version>2.4.6</version>
                                                    <component>Replication</component>
                                        <votes>0</votes>
                                    <watches>8</watches>
                                                                                                                <comments>
                            <comment id="474293" author="gianfranco" created="Mon, 23 Dec 2013 12:17:40 +0000"  >&lt;p&gt;The ticket Matt is referring is &lt;a href=&quot;https://jira.mongodb.org/browse/SERVER-10225&quot; title=&quot;Replica set failover speed improvement&quot; class=&quot;issue-link&quot; data-issue-key=&quot;SERVER-10225&quot;&gt;&lt;del&gt;SERVER-10225&lt;/del&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;This is currently scheduled for 2.7.x (the development version of 2.8)&lt;/p&gt;

&lt;p&gt;Please click on &quot;Start watching this issue&quot; to receive email updates on the status of the feature request.&lt;/p&gt;</comment>
                            <comment id="473846" author="amwankhe@cisco.com" created="Fri, 20 Dec 2013 19:11:05 +0000"  >&lt;p&gt;Hi Matt,&lt;/p&gt;

&lt;p&gt;Can you please provide more insights on this.&lt;/p&gt;</comment>
                            <comment id="473829" author="mattd@10gen.com" created="Fri, 20 Dec 2013 18:40:31 +0000"  >&lt;p&gt;Turns out there is a &quot;speed up failovers&quot; ticket. It is now linked.&lt;/p&gt;</comment>
                            <comment id="473721" author="mattd@10gen.com" created="Fri, 20 Dec 2013 15:30:01 +0000"  >&lt;p&gt;At this time, there is no concrete plan to improve replica set failover time specifically. We are planning to rework replica set internals in the near future. We anticipate that this will have a positive effect on failover time.&lt;/p&gt;</comment>
                            <comment id="473641" author="amwankhe@cisco.com" created="Fri, 20 Dec 2013 09:24:04 +0000"  >&lt;p&gt;Hi Matt,&lt;/p&gt;

&lt;p&gt;Thanks for your comments.&lt;/p&gt;

&lt;p&gt;In which future release we will get the fix for failover time?&lt;/p&gt;

&lt;p&gt;regards,&lt;br/&gt;
Amit&lt;/p&gt;</comment>
                            <comment id="473293" author="mattd@10gen.com" created="Thu, 19 Dec 2013 19:19:53 +0000"  >&lt;p&gt;Failover time is not tunable.&lt;/p&gt;

&lt;p&gt;The way we detect a downed node is by a loss of heartbeats and heartbeat responses. Heartbeat responses time out after 10 seconds and then if we have not received a heartbeat from them in the past two seconds (they are sent every two seconds), we mark them as down. So it is common for the election process to take 10 seconds before it starts.&lt;/p&gt;

&lt;p&gt;There are many other variables such as the latency between nodes and where or not the first election is successful. As a result, we make no guarantees with regard to how long a failover will take.&lt;/p&gt;

&lt;p&gt;In a future release, we will be heavily reworking the internals of replication and a side effect of that should be reduced fail over time.&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10010">
                    <name>Duplicate</name>
                                            <outwardlinks description="duplicates">
                                        <issuelink>
            <issuekey id="82461">SERVER-10225</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                            <issuelinktype id="10012">
                    <name>Related</name>
                                                                <inwardlinks description="is related to">
                                                        </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                <customfield id="customfield_10050" key="com.atlassian.jira.toolkit:comments">
                        <customfieldname># Replies</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>6.0</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                <customfield id="customfield_10055" key="com.atlassian.jira.ext.charting:firstresponsedate">
                        <customfieldname>Date of 1st Reply</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>Thu, 19 Dec 2013 18:38:07 +0000</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10052" key="com.atlassian.jira.toolkit:dayslastcommented">
                        <customfieldname>Days since reply</customfieldname>
                        <customfieldvalues>
                                        10 years, 8 weeks, 2 days ago
    
                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_18254" key="com.onresolve.jira.groovy.groovyrunner:scripted-field">
                        <customfieldname>Dependencies</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue><![CDATA[]]></customfieldvalue>


                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_15850" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    <customfield id="customfield_10057" key="com.atlassian.jira.toolkit:lastusercommented">
                        <customfieldname>Last comment by Customer</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>true</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10056" key="com.atlassian.jira.toolkit:lastupdaterorcommenter">
                        <customfieldname>Last commenter</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>luke.bonanomi@mongodb.com</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_11151" key="com.atlassian.jira.toolkit:LastCommentDate">
                        <customfieldname>Last public comment date</customfieldname>
                        <customfieldvalues>
                            10 years, 8 weeks, 2 days ago
                        </customfieldvalues>
                    </customfield>
                                                                                                                        <customfield id="customfield_10000" key="com.atlassian.jira.plugin.system.customfieldtypes:radiobuttons">
                        <customfieldname>Old_Backport</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10000"><![CDATA[No]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10032" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Operating System</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10020"><![CDATA[Linux]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                <customfield id="customfield_10051" key="com.atlassian.jira.toolkit:participants">
                        <customfieldname>Participants</customfieldname>
                        <customfieldvalues>
                                        <customfieldvalue>amwankhe@cisco.com</customfieldvalue>
            <customfieldvalue>gianfranco</customfieldvalue>
            <customfieldvalue>matt.dannenberg</customfieldvalue>
    
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                        <customfield id="customfield_14254" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Product Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hrm527:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                <customfield id="customfield_12550" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>2|hruxsv:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10558" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>94467</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_23361" key="com.onresolve.jira.groovy.groovyrunner:scripted-field">
                        <customfieldname>Requested By</customfieldname>
                        <customfieldvalues>
                                

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                            <customfield id="customfield_10750" key="com.atlassian.jira.plugin.system.customfieldtypes:textarea">
                        <customfieldname>Steps To Reproduce</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>&lt;p&gt;Power the primary DB VM and verify the secondary DB logs.&lt;/p&gt;</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                    <customfield id="customfield_10053" key="com.atlassian.jira.ext.charting:timeinstatus">
                        <customfieldname>Time In Status</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_22870" key="com.onresolve.jira.groovy.groovyrunner:scripted-field">
                        <customfieldname>Triagers</customfieldname>
                        <customfieldvalues>
                                

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                    <customfield id="customfield_14350" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>serverRank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hs7adr:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                    </customfields>
    </item>
</channel>
</rss>