[SERVER-10116] zBigMapReduce.js failure on Linux 64-bit debug dur off Created: 05/Jul/13  Updated: 10/Dec/14  Resolved: 11/Jul/13

Status: Closed
Project: Core Server
Component/s: Testing Infrastructure
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Major - P3
Reporter: Matt Kangas Assignee: Eric Milkie
Resolution: Duplicate Votes: 0
Labels: buildbot
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified
Environment:

buildbot: Linux 64-bit debug dur off


Issue Links:
Duplicate
duplicates SERVER-10098 zbigMapReduce.js failure on Windows 6... Closed
Operating System: ALL
Participants:

 Description   

This appears to be a different failure than SERVER-10098 occurring on Windows

Linux 64-bit debug dur off Build #2467 on July 05

http://buildlogs.mongodb.org/Linux%2064-bit%20debug%20dur%20off/builds/2467/test/sharding/zbigMapReduce.js

Fri Jul  5 12:12:37.807 map reduce failed:{
	"code" : 10276,
	"ok" : 0,
	"errmsg" : "exception: DBClientBase::findN: transport error: tp1.10gen.cc:31100 ns: test.$cmd query: { mapreduce.shardedfinish: { mapreduce: \"foo\", map: function () { emit(this.val, 1); }, reduce: function (key, values) { return Array.sum(values); }, query: { i: { $gte: 51200.0 } }, out: { reduce: \"big_out_reduce\", nonAtomic: true } }, inputDB: \"test\", shardedOutputCollection: \"tmp.mrs.foo_1373044229_16\", shards: { test-rs0/tp1.10gen.cc:31100,tp1.10gen.cc:31101: { result: \"tmp.mrs.foo_1373044229_16\", timeMillis: 10685, counts: { input: 5521, emit: 5521, reduce: 0, output: 5521 }, ok: 1.0 }, test-rs1/tp1.10gen.cc:31200,tp1.10gen.cc:31201: { result: \"tmp.mrs.foo_1373044229_16\", timeMillis: 57897, counts: { input: 45679, emit: 45679, reduce: 0, output: 45679 }, ok: 1.0 } }, shardCounts: { test-rs0/tp1.10gen.cc:31100,tp1.10gen.cc:31101: { input: 5521, emit: 5521, reduce: 0, output: 5521 }, test-rs1/tp1.10gen.cc:31200,tp1.10gen.cc:31201: { input: 45679, emit: 45679, reduce: 0, output: 45679 } }, counts: { emit: 51200, input: 51200, output: 51200, reduce: 0 } }"
} at src/mongo/shell/collection.js:975
failed to load: /home/yellow/buildslave/Linux_64bit_debug_dur_off/mongo/jstests/sharding/zbigMapReduce.js

Digging through recent build history I see several other occurrences of this

http://buildbot.mongodb.org/builders/Linux%2064-bit%20debug%20dur%20off?numbuilds=50

Jul 05 17:38	074054ecb223...	failure	#2468	Failed test
Jul 05 10:41	1961a5d66cee...	failure	#2467	Failed test_11
Jul 03 23:26	5f949c19a260...	success	#2466	Build successful
Jul 03 16:40	c11ea25c522c...	failure	#2465	Failed test_11
Jul 03 15:38	de8ae2e2d4fa...	failure	#2464	Failed test_7
Jul 03 14:34	cd3101ca2f8e...	failure	#2463	Failed test_7
Jul 03 14:10	00fb45cd97d2...	failure	#2462	Failed compile
Jul 03 13:09	3457954bb082...	failure	#2461	Failed test_7
Jul 03 11:40	7dff601fa540...	failure	#2460	Failed test_7
Jul 03 11:04	f397dbcf4378...	failure	#2459	Failed test
Jul 03 10:14	665e6ec217b1...	failure	#2458	Failed test
Jul 03 09:21	b2b5e6663136...	failure	#2457	Failed test
Jul 03 06:39	2b400fabe2fb...	failure	#2456	Failed test
Jul 02 19:27	5741e9a35cfd...	failure	#2455	Failed test
Jul 02 12:23	a9226d98c4eb...	failure	#2454	Failed test_11
Jul 02 09:58	91235163ae72...	failure	#2453	Failed test
Jul 02 09:02	91235163ae72...	failure	#2452	Failed test_1
Jul 01 21:59	718012dbfc53...	failure	#2451	Failed test
Jul 01 19:44	2051cfd7b9e2...	failure	#2450	Failed test
Jul 01 18:11	90f1d8947a26...	failure	#2449	Failed test
Jul 01 17:44	d15b27260f62...	failure	#2448	Failed test
Jul 01 17:15	01f1dc100623...	failure	#2447	Failed test
Jul 01 17:01	f45a24632fa4...	failure	#2446	Failed compile
Jul 01 16:38	50962f5188cc...	failure	#2445	Failed test
Jul 01 15:28	50962f5188cc...	failure	#2444	Failed test
Jul 01 14:24	1ef1d5118231...	failure	#2443	Failed test
Jul 01 12:26	2267744af0e9...	failure	#2442	Failed test_1
Jul 01 11:52	59878c039978...	failure	#2441	Failed test
Jun 29 23:06	85ca38c33e4a...	failure	#2440	Failed test_11

I have confirmed that these failures occurred on zBigMapReduce.js with "exception: DBClientBase::findN: transport error: tp1.10gen.cc:31100"

  • #2440 Jun 29 (OLDEST FAILURE)
  • #2441 July 1
  • (Builds 2442-2453: unit test & compile failures)
  • #2454 July 2
  • and pretty much constantly until latest build
    • EXCLUDING #2466 which was oddly successful

I find no evidence of this occurring prior to these builds:

Jun 29 00:51	babd275f8818...	success	#2438	Build successful
Jun 28 17:26	0a4b206a2938...	success	#2437	Build successful
Jun 28 10:05	a3f89bd33d23...	success	#2436	Build successful
Jun 27 22:59	86e76e34e88c...	success	#2435	Build successful


Generated at Thu Feb 08 03:22:19 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.