[SERVER-9546] failure in zbigMapReduce.js on windows 64 debug Created: 02/May/13  Updated: 11/Jul/16  Resolved: 10/May/13

Status: Closed
Project: Core Server
Component/s: MapReduce
Affects Version/s: None
Fix Version/s: 2.5.0

Type: Bug Priority: Major - P3
Reporter: Ian Whalen (Inactive) Assignee: Ben Becker
Resolution: Done Votes: 0
Labels: buildbot
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Operating System: ALL
Participants:

 Description   

http://buildlogs.mongodb.org/build/51824031d2a60f31d2000107/test/5182e052d2a60f407500080c/

Thu May 02 18:42:27.713 [conn13] KERNELBASE.dll
                                           DebugBreak+0x2
 m31100| Thu May 02 18:42:27.713 [conn13] mongod.exe
...\src\mongo\util\debug_util.cpp(46)
mongo::mongo_breakpoint+0x22
 m31100| Thu May 02 18:42:27.713 [conn13] mongod.exe
...\src\mongo\util\debug_util.h(65)
mongo::breakpoint+0x25
 m31100| Thu May 02 18:42:27.713 [conn13] mongod.exe
...\src\mongo\util\assert_util.cpp(121)
mongo::verifyFailed+0x2b0
 m31100| Thu May 02 18:42:27.713 [conn13] mongod.exe
...\src\mongo\bson\bson-inl.h(183)
mongo::BSONElement::embeddedObject+0x6a
 m31100| Thu May 02 18:42:27.713 [conn13] mongod.exe
...\src\mongo\db\commands\mr.cpp(845)
mongo::mr::State::finalReduce+0x622
 m31100| Thu May 02 18:42:27.713 [conn13] mongod.exe
...\src\mongo\db\commands\mr.cpp(1266)
mongo::mr::MapReduceCommand::run+0x151b
 m31100| Thu May 02 18:42:27.713 [conn124] warning: we think data is
in ram but system says no
 m31100| Thu May 02 18:42:27.713 [conn13] mongod.exe
...\src\mongo\db\dbcommands.cpp(2015)
mongo::_execCommand+0xb0
 m31100| Thu May 02 18:42:27.713 [conn13] mongod.exe
...\src\mongo\db\dbcommands.cpp(2126)
mongo::Command::execCommand+0xb19
 m31100| Thu May 02 18:42:27.713 [conn13] mongod.exe
...\src\mongo\db\dbcommands.cpp(2218)
mongo::_runCommands+0x3f6
 m31100| Thu May 02 18:42:27.713 [conn13] mongod.exe
...\src\mongo\db\ops\query.cpp(50)
mongo::runCommands+0x7b
 m31100| Thu May 02 18:42:27.713 [conn13] mongod.exe
...\src\mongo\db\ops\query.cpp(955)
mongo::runQuery+0x48c
 m31100| Thu May 02 18:42:27.713 [conn13] mongod.exe
...\src\mongo\db\instance.cpp(235)
mongo::receivedQuery+0x2f0
 m31100| Thu May 02 18:42:27.713 [conn13] mongod.exe
...\src\mongo\db\instance.cpp(374)
mongo::assembleResponse+0x353
 m31100| Thu May 02 18:42:27.713 [conn13] mongod.exe
...\src\mongo\db\db.cpp(194)
mongo::MyMessageHandler::process+0x133
 m31100| Thu May 02 18:42:27.713 [conn13] mongod.exe
...\src\mongo\util\net\message_server_port.cpp(206)
mongo::PortMessageServer::handleIncomingMsg+0x676
 m31100| Thu May 02 18:42:27.713 [conn13] mongod.exe
...\src\third_party\boost\boost\bind\bind.hpp(244)
boost::_bi::list1<boost::_bi::value<mongo::PortMessageServer::HandleIncomingMsgParam
* __ptr64> >::operator()<void * __ptr64,void * __ptr64 (__cdecl*)(void
* __ptr64),boost::_bi::list0>+0x56
 m31100| Thu May 02 18:42:27.713 [conn13] mongod.exe
...\src\third_party\boost\boost\bind\bind_template.hpp(21)
boost::_bi::bind_t<void * __ptr64,void * __ptr64 (__cdecl*)(void *
__ptr64),boost::_bi::list1<boost::_bi::value<mongo::PortMessageServer::HandleIncomingMsgParam
* __ptr64> > >::operator()+0x61
 m31100| Thu May 02 18:42:27.713 [conn13] mongod.exe
...\src\third_party\boost\boost\thread\detail\thread.hpp(63)
boost::detail::thread_data<boost::_bi::bind_t<void * __ptr64,void *
__ptr64 (__cdecl*)(void *
__ptr64),boost::_bi::list1<boost::_bi::value<mongo::PortMessageServer::HandleIncomingMsgParam
* __ptr64> > > >::run+0x2f
 m31100| Thu May 02 18:42:27.713 [conn13] mongod.exe
...\src\third_party\boost\libs\thread\src\win32\thread.cpp(185)
boost::`anonymous namespace'::thread_start_function+0x4c
 m31100| Thu May 02 18:42:27.713 [conn13] writing minidump diagnostic
file mongo.dmp

according to mathias, probably this link: https://github.com/mongodb/mongo/blob/master/src/mongo/db/commands/mr.cpp#L845



 Comments   
Comment by auto [ 10/May/13 ]

Author:

{u'date': u'2013-05-10T20:11:01Z', u'name': u'Eric Milkie', u'email': u'milkie@10gen.com'}

Message: SERVER-9546 larger oplog for zBigMapReduce
Branch: master
https://github.com/mongodb/mongo/commit/bfede8d086719ace81c92bfc1560b170c81637b0

Comment by auto [ 08/May/13 ]

Author:

{u'date': u'2013-05-08T17:38:43Z', u'name': u'Ben Becker', u'email': u'ben.becker@10gen.com'}

Message: SERVER-9546: abort M/R if DBClientCursor returns a $err
Branch: master
https://github.com/mongodb/mongo/commit/0ecc01dfd716d73e47e1af7290496a60db1373c0

Comment by Ben Becker [ 03/May/13 ]

One issue is that finalReduce() calls DBClientWithCommands::getIndexes(), but the expected index document is not returned. If the query code hits a uassert (in this case, 13453 'not master and slaveOk=false'), the result document may not be valid. See L842 for where the query is interrupted by the following uassert:

 m31100| Thu May 02 18:42:24.375 [conn13] assertion 13435 not master and slaveOk=false ns:test.system.indexes query:{ ns: "test.tmp.mr.foo_29_inc" }
 m31100| Thu May 02 18:42:24.375 [conn13]   Assertion failure isABSONObj() D:\slave\Windows_64bit_DEBUG\mongo\src\mongo/bson/bson-inl.h 183

The call to DBClientWithCommands::getIndexes() from finalReduce() needs to ensure the command completed successfully, and if not, either retry or fail the whole operation.

Generated at Thu Feb 08 03:20:45 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.