Loading...

XML

Word

Printable

JSON

Type: New Feature
Resolution: Unresolved
Priority: Major - P3
Fix Version/s: None
Affects Version/s: 3.0.11
Component/s: Querying
Labels:
None

Assigned Teams:

Query Optimization
Case:
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

Provide a way to use regular expressions in MongoDB where the word character (\w) and word boundary (\b) escapes work for code points greater than or equal to 256.

Original description

$regex word boundary fails by treating Danish ø character as a non-character

db.collection.find({ "name" : { "$regex" : ".*\\bden\\b.*" , "$options" : "i"} })

returns a document:

{  "name": "Death Is A Caress(Døden Er Et Kjærtegn).sub" }

related to

SERVER-7218 Turn on PCRE_UCP config option to pcre build to enable some regex characters (\b \B \d etc) to work with UTF8 characters

Backlog

Assignee:: [DO NOT USE] Backlog - Query Optimization
Reporter:: Nic Cottrell (Personal) (Inactive)
Participants:: [DO NOT USE] Backlog - Query Optimization, Asya Kamsky, David Storch, Nic Cottrell (Personal)
Votes:: 2 Vote for this issue
Watchers:: 15 Start watching this issue

Created:: Apr 22 2016 05:06:28 PM UTC
Updated:: Dec 06 2022 04:27:12 AM UTC

Details

Description

Original description

Attachments

Issue Links

Activity

People

Dates