[SERVER-20123] Full text search with "english" indexing does not allow searching dropped words even in quotes Created: 26/Aug/15  Updated: 26/Aug/15  Resolved: 26/Aug/15

Status: Closed
Project: Core Server
Component/s: Index Maintenance, Text Search
Affects Version/s: 3.0.6
Fix Version/s: None

Type: Bug Priority: Major - P3
Reporter: Brendan Molloy Assignee: Unassigned
Resolution: Duplicate Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Issue Links:
Duplicate
duplicates SERVER-10062 Add user configurable stop word lists... Backlog
Operating System: ALL
Steps To Reproduce:
  1. Create a collection foo with a record: {{ { line: "what the hell" }

    }}.

  2. Create full text index for line
  3. Do the queries as listed in description.
Participants:

 Description   

Real world example of issue: searching the phrase what the renders no results. Searching "what the" also renders no results. However, searching "what the hell" finds results.

It seems that quite simply, dropped words are not indexed at all, so even when you do an explicit search (ie, quoted phrase) you will always receive 0 results.

I am reporting this as a bug as I'm unsure if this was intentional. If it was, then this is an enhancement request to either:

  1. Let us disable or otherwise configure the drop words for a specific language
  2. Enhance indexing to handle this.


 Comments   
Comment by J Rassi [ 26/Aug/15 ]

Hi,

This behavior is in fact intentional (MongoDB drops stop words during the indexing process for text indexes, see the $text documentation for more information).

I'm closing this ticket as a dup of feature request SERVER-10062 ("Add user configurable stop word lists for text search"). Feel free to add yourself as a watcher to that ticket for updates on its progress.

~ Jason Rassi

Generated at Thu Feb 08 03:53:13 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.