[SERVER-39355] Collection drops can block the server for long periods Created: 01/Feb/19 Updated: 28/Feb/19 Resolved: 14/Feb/19 |
|
| Status: | Closed |
| Project: | Core Server |
| Component/s: | Storage |
| Affects Version/s: | 3.4.14 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Major - P3 |
| Reporter: | Eric Milkie | Assignee: | Donald Anderson |
| Resolution: | Done | Votes: | 0 |
| Labels: | None | ||
| Remaining Estimate: | Not Specified | ||
| Time Spent: | Not Specified | ||
| Original Estimate: | Not Specified | ||
| Attachments: |
|
||||||||||||||||||||||||
| Issue Links: |
|
||||||||||||||||||||||||
| Operating System: | ALL | ||||||||||||||||||||||||
| Sprint: | Storage Engines 2019-02-25 | ||||||||||||||||||||||||
| Participants: | |||||||||||||||||||||||||
| Story Points: | 1 | ||||||||||||||||||||||||
| Description |
|
Hi, sorry but we've just had another occurrence today (still running 3.4.13) so there's still an issue here. We've modified our code to drop collection to sleep 10 sec between each deletion (to give mongo some time to recover after the "short" global lock and not kill the platform) but unfortunately this wasn't enough and it killed the global performance:
After investigation I found that this was cause by some collection deletion. I tried to upload the diagnostic.data but the portal specified earlier doesn't accept files any more. I can upload it if you give another portal. Here is the log from the drop queries: mongo_drop_log.txt And before you say this is probably fixed in a more recent version, we'll need better proof than last time considering the high risk of upgrading... |
| Comments |
| Comment by Donald Anderson [ 14/Feb/19 ] |
|
bigbourin@gmail.com, I understand. I'm going to close this ticket, please reopen if you need any more help on this. |
| Comment by Adrien Jarthon [ 13/Feb/19 ] |
|
I see, thanks for the details. Looks like a possible cause indeed, we'll try to let you know after we update to 3.6 but we're kind of skeptic because of all the trouble we had with mongo upgrades in the past and all the regressions there were in 3.6 so far. |
| Comment by Donald Anderson [ 12/Feb/19 ] |
|
bigbourin@gmail.com, The switch to use WT cursor caching that is enabled by
Both |
| Comment by Adrien Jarthon [ 04/Feb/19 ] |
|
Thanks, the file is uploaded. |
| Comment by Kelsey Schubert [ 01/Feb/19 ] |
|
Secure upload portal for this issue. |
| Comment by Eric Milkie [ 01/Feb/19 ] |
|
kelsey.schubert can you set up a new portal for Adrien to upload the diagnostic data? |