[SERVER-72146] Make chunk migrations metrics more accessible from Atlas Created: 15/Dec/22  Updated: 29/Oct/23  Resolved: 18/Apr/23

Status: Closed
Project: Core Server
Component/s: Sharding
Affects Version/s: None
Fix Version/s: 7.0.0-rc0, 7.1.0-rc0, 6.0.6, 5.0.18, 6.3.2

Type: Improvement Priority: Major - P3
Reporter: Jordi Serra Torrens Assignee: Marcos José Grillo Ramirez
Resolution: Fixed Votes: 0
Labels: shardingemea-qw
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Issue Links:
Backports
Depends
Documented
is documented by DOCS-16159 Investigate changes in SERVER-72146: ... Closed
Related
is related to SERVER-76278 Log extra migration information Closed
is related to SERVER-76341 Add range deletion throughput statistic Closed
Assigned Teams:
Sharding EMEA
Backwards Compatibility: Fully Compatible
Backport Requested:
v7.0, v6.3, v6.0, v5.0
Sprint: Sharding EMEA 2023-01-09, Sharding EMEA 2023-01-23, Sharding EMEA 2023-02-06, Sharding EMEA 2023-02-20, Sharding EMEA 2023-03-06, Sharding EMEA 2023-03-20, Sharding EMEA 2023-04-03, Sharding EMEA 2023-04-17
Participants:
Story Points: 4.5

 Description   

Often, when investigating HELP tickets related to balancing, we need to access and combine data from FTDC, logs and configdump to figure some basic metrics such as:

  • Migration throughput (how fast is this shard cloning data)
  • Range deleter throughout (how fast is this shard executing its range deletions)
  • Number of orphans documents (how many orphans documents are waiting to be deleted)

The following statistics should be available on serverStatus under the shardingStatistics group (besides the currently already available):

  • countDocsClonedOnCatchUpOnRecipient: the number of documents cloned during the catch up phase of the migration
  • countBytesClonedOnCatchUpOnRecipient: the number of bytes cloned during the catch up phase of the migration
  • countDonorMoveChunkCommitted: the total number of migrations committed by the node
  • countDonorMoveChunkAborted: the number of migrations aborted in the node
  • totalDonorMoveChunkTimeMillis: the total amount of time a migration took from beginning to end
  • totalRecipientCriticalSectionTimeMillis: the amount of time in milliseconds the recipient of a migration spent holding the critical section


 Comments   
Comment by Githook User [ 20/Apr/23 ]

Author:

{'name': 'Marcos José Grillo Ramirez', 'email': 'marcos.grillo@mongodb.com', 'username': 'm4nti5'}

Message: SERVER-72146 Add statistic to count the number of bytes cloned from the donor of a migration

(cherry picked from commit a61cb340c33e123b273071488658811be5361d60)
Branch: v6.0
https://github.com/mongodb/mongo/commit/6320d8fc17d90d563bdd8c879c080a81b083fd55

Comment by Githook User [ 20/Apr/23 ]

Author:

{'name': 'Marcos José Grillo Ramirez', 'email': 'marcos.grillo@mongodb.com', 'username': 'm4nti5'}

Message: SERVER-72146 Add statistic to count the number of bytes cloned from the donor of a migration

(cherry picked from commit a61cb340c33e123b273071488658811be5361d60)
Branch: v6.3
https://github.com/mongodb/mongo/commit/1d511ce7b15a160e3312b75b16a3c160b2e05438

Comment by Githook User [ 20/Apr/23 ]

Author:

{'name': 'Marcos José Grillo Ramirez', 'email': 'marcos.grillo@mongodb.com', 'username': 'm4nti5'}

Message: SERVER-72146 Add statistic to count the number of bytes cloned from the donor of a migration

(cherry picked from commit a61cb340c33e123b273071488658811be5361d60)
Branch: v7.0
https://github.com/mongodb/mongo/commit/e3f788a7d7c32fbaf41ef8a8e0e51d109b2247ec

Comment by Githook User [ 20/Apr/23 ]

Author:

{'name': 'Marcos José Grillo Ramirez', 'email': 'marcos.grillo@mongodb.com', 'username': 'm4nti5'}

Message: SERVER-72146 Add statistic to count the number of bytes cloned from the donor of a migration

(cherry picked from commit a61cb340c33e123b273071488658811be5361d60)
Branch: v5.0
https://github.com/mongodb/mongo/commit/e89d32846e69d4afb4ae4e240538d7215b9cc739

Comment by Githook User [ 20/Apr/23 ]

Author:

{'name': 'Marcos José Grillo Ramirez', 'email': 'marcos.grillo@mongodb.com', 'username': 'm4nti5'}

Message: SERVER-72146 Add statistic to count the number of bytes cloned from the donor of a migration
Branch: master
https://github.com/mongodb/mongo/commit/a61cb340c33e123b273071488658811be5361d60

Comment by Githook User [ 19/Apr/23 ]

Author:

{'name': 'Marcos José Grillo Ramirez', 'email': 'marcos.grillo@mongodb.com', 'username': 'm4nti5'}

Message: SERVER-72146 Add extra migration statistics including the total bytes cloned before and during catch up, committed and aborted migrations and total time

(cherry picked from commit 9fae6f63e917c33d6b4bee6d57446a3b6111b96a)
(cherry picked from commit c7b8de3848fa6703599085541f9c7af6cf959288)
Branch: v5.0
https://github.com/mongodb/mongo/commit/98fc8a3bb804407cf61b4ee82b0c3bd55a16b950

Comment by Githook User [ 19/Apr/23 ]

Author:

{'name': 'Marcos José Grillo Ramirez', 'email': 'marcos.grillo@mongodb.com', 'username': 'm4nti5'}

Message: SERVER-72146 Add extra migration statistics including the total bytes cloned before and during catch up, committed and aborted migrations and total time

(cherry picked from commit 9fae6f63e917c33d6b4bee6d57446a3b6111b96a)
Branch: v6.0
https://github.com/mongodb/mongo/commit/c7b8de3848fa6703599085541f9c7af6cf959288

Comment by Githook User [ 19/Apr/23 ]

Author:

{'name': 'Marcos José Grillo Ramirez', 'email': 'marcos.grillo@mongodb.com', 'username': 'm4nti5'}

Message: SERVER-72146 Add extra migration statistics including the total bytes cloned before and during catch up, committed and aborted migrations and total time

(cherry picked from commit 9fae6f63e917c33d6b4bee6d57446a3b6111b96a)
Branch: v6.3
https://github.com/mongodb/mongo/commit/90b7607c7a076311704a040fb25f7e5f5079ec15

Comment by Githook User [ 19/Apr/23 ]

Author:

{'name': 'Marcos José Grillo Ramirez', 'email': 'marcos.grillo@mongodb.com', 'username': 'm4nti5'}

Message: SERVER-72146 Add extra migration statistics including the total bytes cloned before and during catch up, committed and aborted migrations and total time

(cherry picked from commit 9fae6f63e917c33d6b4bee6d57446a3b6111b96a)
Branch: v7.0
https://github.com/mongodb/mongo/commit/e7002a7a01ce3993fb96e33dfff44dffc86faaca

Comment by Githook User [ 17/Apr/23 ]

Author:

{'name': 'Marcos José Grillo Ramirez', 'email': 'marcos.grillo@mongodb.com', 'username': 'm4nti5'}

Message: SERVER-72146 Add extra migration statistics including the total bytes cloned before and during catch up, committed and aborted migrations and total time
Branch: master
https://github.com/mongodb/mongo/commit/9fae6f63e917c33d6b4bee6d57446a3b6111b96a

Generated at Thu Feb 08 06:20:56 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.