[SERVER-5049] Unexpected restart on machine with disabled swap file Created: 23/Feb/12  Updated: 07/Apr/23  Resolved: 09/Jul/12

Status: Closed
Project: Core Server
Component/s: Stability
Affects Version/s: 2.0.2
Fix Version/s: None

Type: Bug Priority: Major - P3
Reporter: chris rozacki Assignee: Scott Hernandez (Inactive)
Resolution: Incomplete Votes: 0
Labels: windows
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified
Environment:

ESXi2 virtual server,windows 2003 r2 x64, 3GB, 20 GB HD, no swap file, oplog collection set to 128 MB, journal enabled.
Memory Res ~ 500M


Backwards Compatibility: Fully Compatible
Operating System: Windows
Participants:

 Description   

Mongo crashed and restarted.
Os still had over 1 GB HD free, threw warning that disk is at or near capacity.

mongo logs:

Wed Feb 22 20:16:26 [conn215] query paymentshield.metrics.campaign.hour ntoreturn:1 nreturned:1 reslen:873 0ms
Wed Feb 22 20:16:26 [conn215] running multiple plans
Wed Feb 22 20:16:26 [conn215] update paymentshield.metrics.campaign.hour query: { _id:

{ campaign: "paymentshield_data_sauce", order: 982218, period: 0.04166666666666666, start: 40925.75, stop: 40925.79166666666, tenant: "paymentshield" }

, _id.order: 982218, _id.start: 40925.75, _id.stop: 40925.79166666666, _id.period: 0.04166666666666666, _id.tenant: "paymentshield", _id.campaign: "paymentshield_data_sauce" } update: { $inc:

{ value.sr: 0.0, value.talk_time: 0.0, value.wrap_time: 0.0, value.wait_time: 0.0, value.switch_outcomes.not_connects: 1.0, value.not_connects: 1.0, value.switch_outcomes.o2: 1.0, value.total_calls: 1.0 }

, $set: { value.rt.csid: "

{f82ccfbc-5264-4720-a22f-3a062fb2d9a7}" } } 0ms
Wed Feb 22 20:16:26 [conn215] run command admin.$cmd { getlasterror: 1 }
Wed Feb 22 20:16:26 [conn215] command admin.$cmd command: { getlasterror: 1 } ntoreturn:1 reslen:101 0ms
Wed Feb 22 20:16:26 [conn215] query paymentshield.metrics.campaign.hour ntoreturn:1 idhack:1 reslen:873 0ms
Wed Feb 22 20:16:26 [conn215] command paymentshield.$cmd command: { findAndModify: "metrics.campaign.hour", query: { _id.order: 982218, _id.start: 40925.75, _id.stop: 40925.79166666666, _id.period: 0.04166666666666666, _id.tenant: "paymentshield", _id.campaign: "paymentshield_data_sauce" }, update: { $inc: { value.sr: 0.0, value.talk_time: 0.0, value.wrap_time: 0.0, value.wait_time: 0.0, value.switch_outcomes.not_connects: 1.0, value.not_connects: 1.0, value.switch_outcomes.o2: 1.0, value.total_calls: 1.0 }, $set: { value.rt.csid: "{f82ccfbc-5264-4720-a22f-3a062fb2d9a7}

" } }, new: true, upsert: true } ntoreturn:1 reslen:995 0ms
Wed Feb 22 20:16:26 [conn215] running multiple plans
Wed Feb 22 20:16:26 [conn215] update paymentshield.metrics.campaign.hour query:

{ _id.order: 982218, _id.start: 40925.75, _id.stop: 40925.79166666666, _id.period: 0.04166666666666666, _id.tenant: "paymentshield", _id.campaign: "paymentshield_data_sauce" }

update: { $set:

{ value.talk_time_avg: 0.0001821921395232462, value.wrap_time_avg: 1.201746721856711e-005, value.wait_time_avg: 6.562445437795926e-005, value.connect_rate: 0.0, value.not_connect_rate: 89.08500270709259, value.abandon_rate: 100.0 }

} 0ms
Wed Feb 22 20:16:26 [conn215] run command admin.$cmd

{ getlasterror: 1 }

Wed Feb 22 20:16:26 [conn215] command admin.$cmd command:

{ getlasterror: 1 }

ntoreturn:1 reslen:101 0ms
Wed Feb 22 20:16:26 [conn215] query paymentshield.paymentshield_campaign_N9sxyPqSH2MJdwA3oNdl6Zg9wjz2xxJW ntoreturn:1 nreturned:1 reslen:1071 0ms
Wed Feb 22 20:16:26 [conn215] running multiple plans
Wed Feb 22 20:16:26 [conn215] run command admin.$cmd

{ getlasterror: 1 }

Wed Feb 22 20:16:26 dur1
Wed Feb 22 20:17:24 [initandlisten] removeJournalFiles end
Wed Feb 22 20:17:24 [initandlisten] recover done
Wed Feb 22 20:17:24 [initandlisten] Accessing: local for the first time
Wed Feb 22 20:17:24 [initandlisten] query local.system.namespaces reslen:20 0ms
Wed Feb 22 20:17:24 [initandlisten] enter repairDatabases (to check pdfile version #)
Wed Feb 22 20:17:24 [initandlisten] default
Wed Feb 22 20:17:24 [initandlisten] Accessing: default for the first time
Wed Feb 22 20:17:24 [initandlisten] local
Wed Feb 22 20:17:24 [initandlisten] paymentshield
Wed Feb 22 20:17:24 [initandlisten] Accessing: paymentshield for the first time
Wed Feb 22 20:17:24 [initandlisten] done repairDatabases
Wed Feb 22 20:17:24 BackgroundJob starting: snapshot
Wed Feb 22 20:17:24 BackgroundJob starting: ClientCursorMonitor
Wed Feb 22 20:17:24 BackgroundJob starting: PeriodicTask::Runner
Wed Feb 22 20:17:24 [rsStart] replSet beginning startup...
Wed Feb 22 20:17:24 BackgroundJob starting: ConnectBG



 Comments   
Comment by Ian Whalen (Inactive) [ 09/Apr/12 ]

Chris, just checking in to see if you can let us know whether there was more of the mongo logs or anything in the system eventlog?

Comment by Scott Hernandez (Inactive) [ 24/Feb/12 ]

Is there anything in windows system event logs at the time when the process restarts?

Comment by Eliot Horowitz (Inactive) [ 24/Feb/12 ]

Not seeing a crash or anything about out of memory.
Was there more of the log?

Generated at Thu Feb 08 03:07:45 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.