Uploaded image for project: 'Documentation'
  1. Documentation
  2. DOCS-9550

Docs for SERVER-8423: Text search case folding needs utf-8 support

    XMLWordPrintableJSON

Details

    • Icon: Task Task
    • Resolution: Duplicate
    • Icon: Major - P3 Major - P3
    • None
    • None
    • None
    • None

    Description

      Engineering Ticket Description:

      e.g. for Russian queries, "Как" currently lowercases to itself, whereas it should lowercase to "как".

      Needed for stopword removal, matching, etc.

      > db.foo.insert({content:"Как дела?"})
      > db.foo.ensureIndex({content:"text"},{default_language:"russian"})
      > db.foo.runCommand("text",{search:"\"как дела\""})
      {
      	"queryDebugString" : "дел||||как дела||",
      	"language" : "russian",
      	"results" : [ ],
      	"stats" : {
      		"nscanned" : 0,
      		"nscannedObjects" : 0,
      		"n" : 0,
      		"nfound" : 0,
      		"timeMicros" : 104
      	},
      	"ok" : 1
      }
      > db.foo.runCommand("text",{search:"\"Как дела\""})
      {
      	"queryDebugString" : "Как|дел||||Как дела||",
      	"language" : "russian",
      	"results" : [
      		{
      			"score" : 1,
      			"obj" : {
      				"_id" : ObjectId("510aa82ddb47733460b47eff"),
      				"content" : "Как дела?"
      			}
      		}
      	],
      	"stats" : {
      		"nscanned" : 1,
      		"nscannedObjects" : 0,
      		"n" : 1,
      		"nfound" : 1,
      		"timeMicros" : 118
      	},
      	"ok" : 1
      }
      > 

      Attachments

        Activity

          People

            steve.renaker@mongodb.com Steve Renaker (Inactive)
            emily.hall Emily Hall
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:
              5 years, 50 weeks, 6 days ago