Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Done
Priority: Major - P3
Fix Version/s: 3.5.9
Affects Version/s: 3.0.5
Component/s: Performance, Querying
Labels:
- bi-performance
- optimization

Backwards Compatibility:
Fully Compatible
Sprint:
Query 2017-05-08, Query 2017-05-29, Query 2017-06-19, Query 2017-07-10
Case:
Linked BF Score:
0
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

Allow the planner to generate whole covered index scans when possible, rather than defaulting to a collection scan. This feature is disabled by default, as there is a chance that this will cause a performance regression for certain workloads and deployments, especially when all the data in a collection fits in memory.

To enable this feature, set the internalQueryPlannerGenerateCoveredWholeIndexScans query knob. For example, you can start mongod like

$ mongod --setParameter internalQueryPlannerGenerateCoveredWholeIndexScans=1

Original Description

In cases where the query predicate is empty, the selected plan will always involve a collection scan:

> db.foo.explain().find({}, {a: 1, b: 1, _id: 0})
{
	"waitedMS" : NumberLong(0),
	"queryPlanner" : {
		"plannerVersion" : 1,
		"namespace" : "test.foo",
		"indexFilterSet" : false,
		"parsedQuery" : {
			"$and" : [ ]
		},
		"winningPlan" : {
			"stage" : "PROJECTION",
			"transformBy" : {
				"a" : 1,
				"b" : 1
			},
			"inputStage" : {
				"stage" : "COLLSCAN",
				"filter" : {
					"$and" : [ ]
				},
				"direction" : "forward"
			}
		},
		"rejectedPlans" : [ ]
	},
	"serverInfo" : {
		"host" : "franklinia",
		"port" : 27017,
		"version" : "3.1.7-pre-",
		"gitVersion" : "none"
	},
	"ok" : 1
}

This is likely to be the most efficient plan if you want to read all of the documents in a collection, but in some cases it may not be the most efficient plan. e.g. In cases where the projection is a small subset of the fields, and it can be covered by an index scan, an index scan may be preferred to avoid fetching documents. The query system should consider such plans.

> db.foo.getIndices()
[
	{
		"v" : 1,
		"key" : {
			"_id" : 1
		},
		"name" : "_id_",
		"ns" : "test.foo"
	},
	{
		"v" : 1,
		"key" : {
			"a" : 1,
			"b" : 1,
			"c" : 1
		},
		"name" : "a_1_b_1_c_1",
		"ns" : "test.foo"
	}
]
> db.foo.find({a: {$lt: 0}}).count() // All documents have 'a' >= 0
0
> db.foo.explain().find({a: {$gte: 0}}, {a: 1, b: 1, _id: 0})
{
	"waitedMS" : NumberLong(0),
	"queryPlanner" : {
		"plannerVersion" : 1,
		"namespace" : "test.foo",
		"indexFilterSet" : false,
		"parsedQuery" : {
			"f0" : {
				"$gte" : 0
			}
		},
		"winningPlan" : {
			"stage" : "PROJECTION",
			"transformBy" : {
				"a" : 1,
				"b" : 1,
				"_id" : 0
			},
			"inputStage" : {
				"stage" : "IXSCAN",
				"keyPattern" : {
					"a" : 1,
					"b" : 1,
					"c" : 1
				},
				"indexName" : "a_1_b_1_c_1",
				"isMultiKey" : false,
				"isUnique" : false,
				"isSparse" : false,
				"isPartial" : false,
				"indexVersion" : 1,
				"direction" : "forward",
				"indexBounds" : {
					"a" : [
						"[0.0, inf.0]"
					],
					"b" : [
						"[MinKey, MaxKey]"
					],
					"c" : [
						"[MinKey, MaxKey]"
					]
				}
			}
		},
		"rejectedPlans" : [ ]
	},
	"serverInfo" : {
		"host" : "franklinia",
		"port" : 27017,
		"version" : "3.1.7-pre-",
		"gitVersion" : "none"
	},
	"ok" : 1
}

is duplicated by

SERVER-3342 utilize index for full collection scan in some cases

Closed

SERVER-30130 Covered query with empty find() without specifying hint

Closed

is related to

SERVER-59341 Query planner choses collscan over covered index scan for huge collection

Closed

SERVER-4507 aggregation: optimize $group to take advantage of sorted sequences

Backlog

SERVER-98479 Revisit full covered index scans as collection scan alternative

Backlog

related to

SERVER-29444 Use a covered and streaming plan for $group, $sum:1 queries

Backlog

SERVER-23406 index scan is slower than full collection scan in some scenarios

Closed

(2 related to)

Assignee:: Kyle Suarez (Inactive)
Reporter:: Charlie Swanson
Participants:: Charlie Swanson, Chris Harris, Githook User, Kyle Suarez
Votes:: 1 Vote for this issue
Watchers:: 20 Start watching this issue

Created:: Aug 20 2015 06:32:18 PM UTC
Updated:: Dec 19 2024 08:14:20 PM UTC
Resolved:: Jun 20 2017 01:31:57 PM UTC
Confidence Status Last Update:: 01/May/17 7:46 PM

Details

Description

Original Description

Attachments

Issue Links

Forms

Activity

People

Dates