Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 4.13.2
Affects Version/s: 4.11.1
Component/s: async, Connection Mgmt
Labels:
- greenerbuild
- python

Case:
Confidence Status:
None

Assigned Teams:

Python Drivers

Documentation Changes:
Not Needed
Documentation Changes Summary:

Hide

1. What would you like to communicate to the user about this feature?
2. Would you like the user to see examples of the syntax and/or executable code and its output?
3. Which versions of the driver/connector does this apply to?

Show
1. What would you like to communicate to the user about this feature? 2. Would you like the user to see examples of the syntax and/or executable code and its output? 3. Which versions of the driver/connector does this apply to?

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Link:
None
Goal Name(s):
None

Detailed steps to reproduce the problem?

During routine mongdob cluster mainentance on atlas, our fast api async apps running on top of mongodb just freeze, resulting in a complete platform outage.
No trace is present, the server process/threads/coroutines just block.

We are monitoring overall request latency and it seems that for a short period, requests just return super slowly (requests that usually rake 100ms to 400ms now take 20-35 seconds (!!!)). No Pymongo exceptions are thrown whatsoever

Here's our client settings:

"minPoolSize": 32,
"maxPoolSize": 128,
"socketTimeoutMS": 5000

This is not the first time we've seen this behavior. Believing that socketTimeoutMS would be the key to trigger node failure detection, but that never happens.

What are we doing wrong? Which setting do we need to tweak in order to overcome these outages?

Definition of done: what must be done to consider the task complete?

See that pymongo async client properly handles topology changes during server maintenance routines

The exact Python version used, with patch level:

3.12.9

The exact version of PyMongo used, with patch level:

4.11.1

Describe how MongoDB is set up. Local vs Hosted, version, topology, load balanced, etc.

Atlas , M60 NVMe SSD Cluster residing in AWS Frankfurt Region

The operating system and version (e.g. Windows 7, OSX 10.8, ...)

Debian bullseye (12)

Web framework or asynchronous network library used, if any, with version (e.g. Django 1.7, mod_wsgi 4.3.0, gevent 1.0.1, Tornado 4.0.2, ...)

FastAPI 0.115.11

is related to

PYTHON-5271 Investigate slow task warnings for pymongo_server_monitor_task and pymongo_kill_cursors_thread

Backlog

related to

PYTHON-5219 Avoid awaiting coroutines while holding pool locks

Closed

Assignee:: Noah Stapp
Reporter:: Idan Sheinberg
Votes:: 1 Vote for this issue
Watchers:: 10 Start watching this issue

Created:: Mar 13 2025 09:55:56 PM UTC
Updated:: Jun 26 2025 11:49:54 AM UTC
Resolved:: Jun 16 2025 11:30:42 AM UTC

Details

Description

Detailed steps to reproduce the problem?

The exact Python version used, with patch level:

The exact version of PyMongo used, with patch level:

Describe how MongoDB is set up. Local vs Hosted, version, topology, load balanced, etc.

The operating system and version (e.g. Windows 7, OSX 10.8, ...)

Web framework or asynchronous network library used, if any, with version (e.g. Django 1.7, mod_wsgi 4.3.0, gevent 1.0.1, Tornado 4.0.2, ...)

Attachments

Issue Links

Forms

Activity

People

Dates