[SERVER-31638] WiredTiger has failed to open its metadata Created: 19/Oct/17  Updated: 14/Aug/18  Resolved: 20/Oct/17

Status: Closed
Project: Core Server
Component/s: WiredTiger
Affects Version/s: 3.2.17
Fix Version/s: None

Type: Bug Priority: Major - P3
Reporter: Brian Rumburg Assignee: Mark Agarunov
Resolution: Done Votes: 0
Labels: docker, envc, rpo, rps, trcf, wtc
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified
Environment:

docker mongo:3.2


Attachments: File WiredTiger.turtle     File WiredTiger.wt     File repair-SERVER-31638.tar.gz    
Operating System: Linux
Participants:

 Description   

After an unclean shutdown of a docker container running mongo:3.2 tag, mongod fails to start. I have tried a --repair with :3.2 and :latest tags with no success.

I've attached my WiredTiger.wt and WiredTiger.turtle files for your review.

Any ideas? Thanks so much!

Here is the console log:

2017-10-19T08:01:24.423+0000 I CONTROL [initandlisten] MongoDB starting : pid=11 port=27017 dbpath=/data/db 64-bit host=87a2c0329e9e
2017-10-19T08:01:24.423+0000 I CONTROL [initandlisten] db version v3.4.6
2017-10-19T08:01:24.423+0000 I CONTROL [initandlisten] git version: c55eb86ef46ee7aede3b1e2a5d184a7df4bfb5b5
2017-10-19T08:01:24.423+0000 I CONTROL [initandlisten] OpenSSL version: OpenSSL 1.0.1t 3 May 2016
2017-10-19T08:01:24.423+0000 I CONTROL [initandlisten] allocator: tcmalloc
2017-10-19T08:01:24.423+0000 I CONTROL [initandlisten] modules: none
2017-10-19T08:01:24.423+0000 I CONTROL [initandlisten] build environment:
2017-10-19T08:01:24.423+0000 I CONTROL [initandlisten] distmod: debian81
2017-10-19T08:01:24.423+0000 I CONTROL [initandlisten] distarch: x86_64
2017-10-19T08:01:24.423+0000 I CONTROL [initandlisten] target_arch: x86_64
2017-10-19T08:01:24.423+0000 I CONTROL [initandlisten] options: { storage:

{ dbPath: "/data/db" }

}
2017-10-19T08:01:24.429+0000 I - [initandlisten] Detected data files in /data/db created by the 'wiredTiger' storage engine, so setting the active storage engine to 'wiredTiger'.
2017-10-19T08:01:24.432+0000 I STORAGE [initandlisten] wiredtiger_open config: create,cache_size=487M,session_max=20000,eviction=(threads_min=4,threads_max=4),config_base=false,statistics=(fast),log=(enabled=true,archive=true,path=journal,compressor=snappy),file_manager=(close_idle_time=100000),checkpoint=(wait=60,log_size=2GB),statistics_log=(wait=0),
2017-10-19T08:01:24.445+0000 E STORAGE [initandlisten] WiredTiger error (-31802) [1508400084:445032][11:0x7f21bad9ad00], file:WiredTiger.wt, connection: unable to read root page from file:WiredTiger.wt: WT_ERROR: non-specific WiredTiger error
2017-10-19T08:01:24.445+0000 E STORAGE [initandlisten] WiredTiger error (0) [1508400084:445083][11:0x7f21bad9ad00], file:WiredTiger.wt, connection: WiredTiger has failed to open its metadata
2017-10-19T08:01:24.445+0000 E STORAGE [initandlisten] WiredTiger error (0) [1508400084:445111][11:0x7f21bad9ad00], file:WiredTiger.wt, connection: This may be due to the database files being encrypted, being from an older version or due to corruption on disk
2017-10-19T08:01:24.445+0000 E STORAGE [initandlisten] WiredTiger error (0) [1508400084:445186][11:0x7f21bad9ad00], file:WiredTiger.wt, connection: You should confirm that you have opened the database with the correct options including all encryption and compression options
2017-10-19T08:01:24.446+0000 I - [initandlisten] Assertion: 28595:-31802: WT_ERROR: non-specific WiredTiger error src/mongo/db/storage/wiredtiger/wiredtiger_kv_engine.cpp 269
2017-10-19T08:01:24.446+0000 I STORAGE [initandlisten] exception in initAndListen: 28595 -31802: WT_ERROR: non-specific WiredTiger error, terminating
2017-10-19T08:01:24.447+0000 I NETWORK [initandlisten] shutdown: going to close listening sockets...
2017-10-19T08:01:24.447+0000 I NETWORK [initandlisten] removing socket file: /tmp/mongodb-27017.sock
2017-10-19T08:01:24.447+0000 I NETWORK [initandlisten] shutdown: going to flush diaglog...
2017-10-19T08:01:24.447+0000 I CONTROL [initandlisten] now exiting
2017-10-19T08:01:24.447+0000 I CONTROL [initandlisten] shutting down with code:100



 Comments   
Comment by Mark Agarunov [ 20/Oct/17 ]

Hello brianrumburg,

Thanks for your response. I'm glad to hear that this fixed the issue and everything is working again. To prevent this type of problem in the future, we recommend implementing regular backups and/or replication to mitigate any issues related to unreliable storage layers or server failures.

Thanks,
Mark

Comment by Brian Rumburg [ 20/Oct/17 ]

These files fixed it right up. Thanks! -Brian

Comment by Mark Agarunov [ 19/Oct/17 ]

Hello brianrumburg,

Thank you for the report. I've attached a repair attempt of the files you've provided. Would you please extract these files and replace them in your $dbpath and let us know if it resolves the issue? If you are still seeing errors after replacing these files, please provide the complete logs from mongod so that we can further investigate. Additionally, if this issue persists, please provide the following information:

  1. What kind of underlying storage mechanism are you using? Are the storage devices attached locally or over the network? Are the disks SSDs or HDDs? What kind of RAID and/or volume management system are you using?
  2. Would you please check the integrity of your disks?
  3. Has the database always been running this version of MongoDB? If not please describe the upgrade/downgrade cycles the database has been through.
  4. Have you manipulated (copied or moved) the underlying database files? If so, was mongod running?
  5. Have you ever restored this instance from backups?
  6. What method do you use to create backups?
  7. When was the underlying filesystem last checked and is it currently marked clean?

Thanks,
Mark

Generated at Thu Feb 08 04:27:43 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.