Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Won't Fix
Priority: Minor - P4
Fix Version/s: None
Affects Version/s: None
Component/s: Diagnostics, MMAPv1, Storage
Labels:
None

Assigned Teams:

Storage Execution
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

Right now we ignore records headers when computing collection data size (mongo/db/namespace_details.h):

DiskLoc deletedList[Buckets];
// ofs 168 (8 byte aligned)
struct Stats {
    // datasize and nrecords MUST Be adjacent code assumes!
    long long datasize; // this includes padding, but not record headers
    long long nrecords;
} stats;

However, the value includes padding and also includes the alignment that we use in our storage engine.

When the number of records hits 500,000,000 the headers overhead is 7+ GB, which is perceived as a lost disk space. I suggest to include record headers size into the coll.stats().size metric, or to introduce an additional sizeWithHeaders metric to avoid confusion.

Assignee:: [DO NOT USE] Backlog - Storage Execution Team
Reporter:: Alexander Komyagin (Inactive)
Participants:: [DO NOT USE] Backlog - Storage Execution Team, Alexander Komyagin
Votes:: 0 Vote for this issue
Watchers:: 5 Start watching this issue

Created:: Feb 27 2014 09:44:17 PM UTC
Updated:: Dec 06 2022 05:10:23 AM UTC
Resolved:: Sep 14 2018 08:03:55 PM UTC

Details

Description

Attachments

Activity

People

Dates