[SERVER-18514] Improve Multi-Word Term Search performance with RLP Created: 18/May/15  Updated: 19/Sep/15  Resolved: 04/Jun/15

Status: Closed
Project: Core Server
Component/s: Text Search
Affects Version/s: 3.1.2
Fix Version/s: 3.1.4

Type: Bug Priority: Major - P3
Reporter: Mark Benvenuto Assignee: Mark Benvenuto
Resolution: Done Votes: 0
Labels: FT
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Issue Links:
Depends
Backwards Compatibility: Fully Compatible
Operating System: ALL
Sprint: Platform 4 06/05/15
Participants:

 Description   

During text query analysis, we call into RLP for each term. Since RLP is more expensive then our Basic FTS algorithm, we should amortize the cost of this analysis by only calling into RLP once per set of positive, and negative terms.

The best approach is pass a space (U+0020) delimited set of terms into RLP. This will work since space is a word boundary. See http://www.unicode.org/reports/tr29/#Word_Boundaries.



 Comments   
Comment by Githook User [ 02/Jun/15 ]

Author:

{u'username': u'markbenvenuto', u'name': u'Mark Benvenuto', u'email': u'mark.benvenuto@mongodb.com'}

Message: SERVER-18514: Improve Multi-Word Term Search performance with RLP
Branch: master
https://github.com/mongodb/mongo/commit/2c1e081ac6fa825b0499198799b1345c56329b6d

Generated at Thu Feb 08 03:47:55 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.