[SERVER-18514] Improve Multi-Word Term Search performance with RLP Created: 18/May/15 Updated: 19/Sep/15 Resolved: 04/Jun/15 |
|
| Status: | Closed |
| Project: | Core Server |
| Component/s: | Text Search |
| Affects Version/s: | 3.1.2 |
| Fix Version/s: | 3.1.4 |
| Type: | Bug | Priority: | Major - P3 |
| Reporter: | Mark Benvenuto | Assignee: | Mark Benvenuto |
| Resolution: | Done | Votes: | 0 |
| Labels: | FT | ||
| Remaining Estimate: | Not Specified | ||
| Time Spent: | Not Specified | ||
| Original Estimate: | Not Specified | ||
| Issue Links: |
|
||||
| Backwards Compatibility: | Fully Compatible | ||||
| Operating System: | ALL | ||||
| Sprint: | Platform 4 06/05/15 | ||||
| Participants: | |||||
| Description |
|
During text query analysis, we call into RLP for each term. Since RLP is more expensive then our Basic FTS algorithm, we should amortize the cost of this analysis by only calling into RLP once per set of positive, and negative terms. The best approach is pass a space (U+0020) delimited set of terms into RLP. This will work since space is a word boundary. See http://www.unicode.org/reports/tr29/#Word_Boundaries. |
| Comments |
| Comment by Githook User [ 02/Jun/15 ] |
|
Author: {u'username': u'markbenvenuto', u'name': u'Mark Benvenuto', u'email': u'mark.benvenuto@mongodb.com'}Message: |