Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 8.0.0-rc0, 5.0.27, 7.0.9, 7.3.2, 6.0.16
Affects Version/s: None
Component/s: None
Labels:
- qi-timeseries

Assigned Teams:

Query Integration
Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Backport Requested:

v7.3, v7.0, v6.0, v5.0
Steps To Reproduce:

Hide

> db.createCollection("ts", {timeseries: {timeField: "time", metaField: "tag"}});
> db.ts.insert({_id: 0, time: ISODate("2024-01-01T00:00:00.000Z"), tag: "A"});
> db.ts.aggregate([{$addFields: {time: {$dateFromParts: {year: "$tag.none"}}}}, {$project: {tag: 1}}]);
{ "tag" : "A", "_id" : 0, "time" : null }

Show
> db.createCollection("ts", {timeseries: {timeField: "time", metaField: "tag"}}); > db.ts.insert({_id: 0, time: ISODate("2024-01-01T00:00:00.000Z"), tag: "A"}); > db.ts.aggregate( [{$addFields: {time: {$dateFromParts: {year: "$tag.none"}}}}, {$project: {tag: 1}}] ); { "tag" : "A", "_id" : 0, "time" : null }
Linked BF Score:
13
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

As can be seen in the repro script, the special timeseries field time is actually ~~renamed~~ replaced by the $addFields stage by computing the value referencing another special field tag's any subfield and then the next $project stage excludes it but the timeseries optimization does not exclude it as follows:

> db.ts.explain().aggregate([{$addFields: {time: {$dateFromParts: {year: "$tag.none"}}}}, {$project: {tag: 1}}]);
{
	"explainVersion" : "1",
	"stages" : [
		{
			"$cursor" : {
				"queryPlanner" : {
					"namespace" : "test.system.buckets.ts",
					"indexFilterSet" : false,
					"parsedQuery" : {

					},
					"queryHash" : "FCBE9F38",
					"planCacheKey" : "64E90EFC",
					"optimizationTimeMillis" : 1,
					"maxIndexedOrSolutionsReached" : false,
					"maxIndexedAndSolutionsReached" : false,
					"maxScansToExplodeReached" : false,
					"prunedSimilarIndexes" : false,
					"winningPlan" : {
						"isCached" : false,
						"stage" : "COLLSCAN",
						"direction" : "forward"
					},
					"rejectedPlans" : [ ]
				}
			}
		},
		{
			"$addFields" : {
				"time" : {
					"$dateFromParts" : {
						"year" : "$meta.none"
					}
				}
			}
		},
		{
			"$_internalUnpackBucket" : {
				"include" : [
					"_id",
					"tag"
				],
				"timeField" : "time",
				"metaField" : "tag",
				"bucketMaxSpanSeconds" : 3600,
				"assumeNoMixedSchemaData" : true,
				"computedMetaProjFields" : [
					"time"
				]
			}
		}
	],

It does not seems that dependency tracking or inclusion/exclusion tracking for special timeseries fields work correctly and it has been introduced around 7.3 timeframe, seeing it's failing on v7.3 branch as well.

Ideally, we would want to totally remove $addFields as it's excluded by the subsequent $project while optimizaing the pipeline

The simplest fix would be to not push down $addFields when it actually renames the timeseries special field timeField.

related to

SERVER-87961 Time series $group rewrite may produce incorrect results when a preceding $project stage projects out accessed fields

Closed

Assignee:: Erin Zhu
Reporter:: Yoon Soo Kim (Inactive)
Participants:: Erin Zhu, Githook User, Yoon Soo Kim
Votes:: 0 Vote for this issue
Watchers:: 5 Start watching this issue

Created:: Mar 14 2024 05:39:35 PM UTC
Updated:: Apr 16 2024 08:48:15 PM UTC
Resolved:: Mar 25 2024 01:38:34 PM UTC
Confidence Status Last Update:: 14/Mar/24 7:47 PM

Details

Description

Attachments

Issue Links

Forms

Activity

People

Dates