Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Duplicate
Priority: Major - P3
Fix Version/s: None
Affects Version/s: 3.0.0-rc7
Component/s: Querying
Labels:
None

Operating System:
ALL
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

When we lower case Turkish words, we use English rules to lower case the words instead of Turkish rules. In Turkish, the lower case form of "I" is "ı" not "i" (Note the lack of dot in the font glyph).

Load Script:

db.turk.drop()

db.turk.insert({ _id: "small_dotless", t1 : "quıt" })
db.turk.insert({ _id: "small_dot", t1 : "quit" })
db.turk.insert({ _id: "big_dotless", t1 : "QUIT" })
db.turk.insert({ _id: "big_dot", t1 : "QUİT" })

db.turk.ensureIndex( { t1 : "text"} , {default_language : "turkish" })

Actual Results:

> db.turk.find( {$text: {$search: "quit" }})
{ "_id" : "small_dot", "t1" : "quit" }
{ "_id" : "big_dotless", "t1" : "QUIT" }

Expected Results:

> db.turk.find( {$text: {$search: "quit" }})
{ "_id" : "small_dot", "t1" : "quit" }
{ "_id" : "big_dot", "t1" : "QUİT" }

duplicates

SERVER-8423 Text search case folding needs utf-8 support

Closed

Assignee:: Matt Kangas (Inactive)
Reporter:: Mark Benvenuto
Participants:: Mark Benvenuto, Matt Kangas
Votes:: 0 Vote for this issue
Watchers:: 4 Start watching this issue

Created:: Feb 03 2015 07:32:49 PM UTC
Updated:: Feb 03 2015 09:02:39 PM UTC
Resolved:: Feb 03 2015 09:02:39 PM UTC

Details

Description

Attachments

Issue Links

Forms

Activity

People

Dates