[SERVER-78790] Investigate getMore command executor error in fcv_upgrade_downgrade_sharded_collections_jscore_passthrough suite Created: 07/Jul/23  Updated: 18/Jul/23

Status: Backlog
Project: Core Server
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Major - P3
Reporter: Adi Zaimi Assignee: Backlog - Query Optimization
Resolution: Unresolved Votes: 0
Labels: sharding-nyc-subteam3
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Assigned Teams:
Query Optimization
Operating System: ALL
Participants:

 Description   

Test server6570.js failed in a patch build with the following:

[js_test:server6570] 2023-07-07T14:07:17.384Z assert: command failed: {
[js_test:server6570]    "ok" : 0,
[js_test:server6570]    "errmsg" : "Error on remote shard localhost:20500 :: caused by :: Executor error during getMore :: caused by :: only numbers and dates are allowed in an $add expression",
[js_test:server6570]    "code" : 7157723,
[js_test:server6570]    "codeName" : "Location7157723",
[js_test:server6570]    "$clusterTime" : {
[js_test:server6570]            "clusterTime" : Timestamp(1688738837, 1),
[js_test:server6570]            "signature" : { 
[js_test:server6570]                    "hash" : BinData(0,"AAAAAAAAAAAAAAAAAAAAAAAAAAA="),
[js_test:server6570]                    "keyId" : NumberLong(0)
[js_test:server6570]            }
[js_test:server6570]    },
[js_test:server6570]    "operationTime" : Timestamp(1688738837, 1)
[js_test:server6570] } with original command request: {
[js_test:server6570]    "getMore" : NumberLong("4604890520509920003"),
[js_test:server6570]    "collection" : "s6570",
[js_test:server6570]    "lsid" : {
[js_test:server6570]            "id" : UUID("a1e16d3c-c8ec-4b40-a4e4-afa095fb8c8c")
[js_test:server6570]    },      
[js_test:server6570]    "$clusterTime" : {
[js_test:server6570]            "clusterTime" : Timestamp(1688738837, 1),
[js_test:server6570]            "signature" : { 
[js_test:server6570]                    "hash" : BinData(0,"AAAAAAAAAAAAAAAAAAAAAAAAAAA="),
[js_test:server6570]                    "keyId" : NumberLong(0)
[js_test:server6570]            }
[js_test:server6570]    }

This is the evg link:
https://parsley.mongodb.com/resmoke/fcea0f4752cb3669ea8263f018655d3b/all?bookmarks=0,444125,444135,444160,444171,444191,444192,444195,444612,444615,444952,1279401&shareLine=444872



 Comments   
Comment by Adi Zaimi [ 18/Jul/23 ]

For more background on the suite, please look at https://jira.mongodb.org/browse/PM-3087 or ping me.

Comment by Adi Zaimi [ 18/Jul/23 ]

The test looks to behave correctly as far as we can tell--the test should convert the code correctly as per jstests/libs/sbe_assert_error_override.js but somehow it is not being caught. We think this does use SBE (queryFramework: "sbe" in the log)
To give a bit of background, this suite was recently added, and it runs a background hook in the background which sends fcv upgrade/downgrade continuously to the server. FCV upgrade/downgrade aborts all transactions, so that might be the originator of what initiates the failure here.

Generated at Thu Feb 08 06:39:15 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.