Investigate WiredTiger metadata corruption detected error while recovering from logs.

XMLWordPrintableJSON

    • Type: Bug
    • Resolution: Unresolved
    • Priority: Major - P3
    • None
    • Affects Version/s: None
    • Component/s: None
    • None
    • Storage Engines, Storage Engines - Persistence
    • StorEng - Defined Pipeline, SE Persistence - 2025-05-09
    • 8
    • 0

      I couldn't pinpoint the problematic commit in Evergreen. Also, the "antithesis_replica_sets_lag_oplog_application_jscore_passthrough" task in Evergreen didn't fail on May 4th as mentioned, but the error message seems concerning. I've created this ticket to reproduce and investigate this suspicious failure.

      Error Message - "WiredTiger metadata corruption detected"

      Here is the stack trace of the failure.

      "c":"WT",       "id":22435,   "ctx":"initandlisten","msg":"WiredTiger error message","attr":{"error":-31802,"message":{"ts_sec":1746343282,"ts_usec":880472,"thread":"24:0x7f068a9f4e00","session_name":"txn-recover","category":"WT_VERB_DEFAULT","category_id":12,"verbose_level":"ERROR","verbose_level_id":-3,"msg":"int __posix_file_read(WT_FILE_HANDLE *, WT_SESSION *, wt_off_t, size_t, void *):535:/data/db/journal/WiredTigerLog.0000000003: handle-read: pread: failed to read 128 bytes at offset 128","error_str":"WT_ERROR: non-specific WiredTiger error","error_code":-31802}}}
      "c":"WT",       "id":22435,   "ctx":"initandlisten","msg":"WiredTiger error message","attr":{"error":-31802,"message":{"ts_sec":1746343282,"ts_usec":891064,"thread":"24:0x7f068a9f4e00","session_name":"txn-recover","category":"WT_VERB_DEFAULT","category_id":12,"verbose_level":"ERROR","verbose_level_id":-3,"msg":"int __log_fs_read(WT_SESSION_IMPL *, WT_FH *, wt_off_t, size_t, void *):189:journal/WiredTigerLog.0000000003: log read failure","error_str":"WT_ERROR: non-specific WiredTiger error","error_code":-31802}}}
      "c":"WT",       "id":22435,   "ctx":"initandlisten","msg":"WiredTiger error message","attr":{"error":-31802,"message":{"ts_sec":1746343282,"ts_usec":930445,"thread":"24:0x7f068a9f4e00","session_name":"txn-recover","category":"WT_VERB_DEFAULT","category_id":12,"verbose_level":"ERROR","verbose_level_id":-3,"msg":"int __wt_txn_recover(WT_SESSION_IMPL *, const char **):1269:Recovery failed","error_str":"WT_ERROR: non-specific WiredTiger error","error_code":-31802}}}
      "c":"WT",       "id":22435,   "ctx":"initandlisten","msg":"WiredTiger error message","attr":{"error":0,"message":{"ts_sec":1746343282,"ts_usec":968441,"thread":"24:0x7f068a9f4e00","session_name":"connection","category":"WT_VERB_DEFAULT","category_id":12,"verbose_level":"ERROR","verbose_level_id":-3,"msg":"int __wti_cache_destroy(WT_SESSION_IMPL *):151:cache server: exiting with 7 pages in memory and 0 pages evicted"}}}
      "c":"WT",       "id":22435,   "ctx":"initandlisten","msg":"WiredTiger error message","attr":{"error":0,"message":{"ts_sec":1746343282,"ts_usec":970152,"thread":"24:0x7f068a9f4e00","session_name":"connection","category":"WT_VERB_DEFAULT","category_id":12,"verbose_level":"ERROR","verbose_level_id":-3,"msg":"int __wti_cache_destroy(WT_SESSION_IMPL *):156:cache server: exiting with 165230 image bytes in memory"}}}
      "c":"WT",       "id":22435,   "ctx":"initandlisten","msg":"WiredTiger error message","attr":{"error":0,"message":{"ts_sec":1746343282,"ts_usec":970585,"thread":"24:0x7f068a9f4e00","session_name":"connection","category":"WT_VERB_DEFAULT","category_id":12,"verbose_level":"ERROR","verbose_level_id":-3,"msg":"int __wti_cache_destroy(WT_SESSION_IMPL *):159:cache server: exiting with 199182 bytes in memory"}}}
      "c":"WT",       "id":22435,   "ctx":"initandlisten","msg":"WiredTiger error message","attr":{"error":0,"message":{"ts_sec":1746343282,"ts_usec":970586,"thread":"24:0x7f068a9f4e00","session_name":"connection","category":"WT_VERB_DEFAULT","category_id":12,"verbose_level":"ERROR","verbose_level_id":-3,"msg":"int __wti_cache_destroy(WT_SESSION_IMPL *):167:cache server: exiting with 198498 bytes dirty and 6 pages dirty"}}}
      "c":"STORAGE",  "id":22347,   "ctx":"initandlisten","msg":"Failed to start up WiredTiger under any compatibility version. This may be due to an unsupported upgrade or downgrade."}
      "c":"STORAGE",  "id":22348,   "ctx":"initandlisten","msg":"WiredTiger metadata corruption detected"}
      "c":"STORAGE",  "id":50944,   "ctx":"initandlisten","msg":"Please read the documentation for starting MongoDB with --repair here: http://dochub.mongodb.org/core/repair"}
      "c":"ASSERT",   "id":23091,   "ctx":"initandlisten","msg":"Fatal assertion","attr":{"msgid":50944,"location":"src/mongo/db/storage/wiredtiger/wiredtiger_kv_engine.cpp:815:52:void mongo::WiredTigerKVEngine::_openWiredTiger(const std::string &, const std::string &)"}}
      "c":"ASSERT",   "id":23092,   "ctx":"initandlisten","msg":"\n\n***aborting after fassert() failure\n\n"}
      
      

            Assignee:
            Unassigned
            Reporter:
            Ravi Giri
            Votes:
            0 Vote for this issue
            Watchers:
            7 Start watching this issue

              Created:
              Updated: