Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Done
Priority: Critical - P2
Fix Version/s: None
Affects Version/s: 2.5.4
Component/s: Text Search
Labels:
None

Operating System:
ALL
Confidence Status:
None
Work Order:
0
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

textIndexVersion:2 indexes observe subdocument language annotations (and correctly index nested arrays, if not directly nested). The text search matcher needs to invoke the correct language stemmer on text contained in subdocuments (and examine fields in nested arrays for determining a match), but doesn't.

Reproduce with:

> db.foo.ensureIndex({"a.b":"text"})
> db.foo.insert({a:[{b:["example content"]}]}) // note indexed nested arrays
Insert WriteResult({ "ok" : 1, "n" : 1 })
> db.foo.find({$text:{$search:"example content"}}) // correct
{ "_id" : ObjectId("52aa57a0ae39c4212eb00625"), "a" : [ { "b" : [ "example content" ] } ] }
> db.foo.find({$text:{$search:"example -content"}}) // incorrect: should return empty result set
{ "_id" : ObjectId("52aa57a0ae39c4212eb00625"), "a" : [ { "b" : [ "example content" ] } ] }
> db.foo.find({$text:{$search:"example \"content\""}}) // incorrect: should not return empty result set
>

is duplicated by

SERVER-12162 Update fts_matcher to use sub-document language-aware tokenization

Closed

Assignee:: J Rassi (Inactive)
Reporter:: J Rassi (Inactive)
Participants:: Githook User, J Rassi
Votes:: 0 Vote for this issue
Watchers:: 3 Start watching this issue

Created:: Dec 13 2013 01:05:29 AM UTC
Updated:: Jul 11 2016 05:38:55 PM UTC
Resolved:: Jan 30 2014 03:58:09 PM UTC

Details

Description

Attachments

Issue Links

Activity

People

Dates