[SERVER-17084] Full text search does not find text in typographic quotes Created: 28/Jan/15  Updated: 02/Feb/15  Resolved: 02/Feb/15

Status: Closed
Project: Core Server
Component/s: Index Maintenance
Affects Version/s: 2.4.10
Fix Version/s: None

Type: Bug Priority: Major - P3
Reporter: Tobias Pfeiffer Assignee: Ramon Fernandez Marina
Resolution: Duplicate Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Issue Links:
Related
related to SERVER-13099 Expand set of delimiters recognized b... Closed
Operating System: ALL
Steps To Reproduce:

> db.createCollection("test")
{ "ok" : 1 }
> db.test.insert({"title": "hallo"})
> db.test.insert({"title": "„hallo“"})
> db.test.find()
{ "_id" : ObjectId("54c8737202907f6af18809a2"), "title" : "hallo" }
{ "_id" : ObjectId("54c8737202907f6af18809a3"), "title" : "„hallo“" }
> db.test.ensureIndex({"title" : "text"})
> db.test.runCommand("text", {"search": "hallo"})
{
	"queryDebugString" : "hallo||||||",
	"language" : "english",
	"results" : [
		{
			"score" : 1.1,
			"obj" : {
				"_id" : ObjectId("54c8737202907f6af18809a2"),
				"title" : "hallo"
			}
		}
	],
	"stats" : {
		"nscanned" : 1,
		"nscannedObjects" : 0,
		"n" : 1,
		"nfound" : 1,
		"timeMicros" : 97
	},
	"ok" : 1
}

Participants:

 Description   

In my application I tend to use proper typographic quotes for names etc., like „hallo“ (to take a German example). If I do that, the text in quotes is not found by fulltext search. However, it should be found.



 Comments   
Comment by J Rassi [ 02/Feb/15 ]

Resolving this ticket as a dup of SERVER-13099.

Comment by Ramon Fernandez Marina [ 01/Feb/15 ]

tgpfeiffer, this is currently expected behavior: typographic quotes are not considered text delimiters (find the code here).

Comment by Tobias Pfeiffer [ 01/Feb/15 ]

Is there anything I can do to provide more information?

Generated at Thu Feb 08 03:43:15 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.