[SERVER-19961] Text search case folding needs support for 'Full' case folding Created: 14/Aug/15  Updated: 28/Dec/23

Status: Backlog
Project: Core Server
Component/s: Text Search
Affects Version/s: 3.1.6
Fix Version/s: None

Type: Improvement Priority: Major - P3
Reporter: Adam Chelminski (Inactive) Assignee: Backlog - Query Integration
Resolution: Unresolved Votes: 0
Labels: qi-text-search, query-44-grooming
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Assigned Teams:
Query Integration
Participants:

 Description   

Text index v3 supports case folding for Unicode's 'C', 'S', and 'T' mappings (see ftp://ftp.unicode.org/Public/UNIDATA/CaseFolding.txt), which are all one character to one character.

There are about 100 'F' mappings where a character will case fold to more than one character. This includes the latin sharp s (ß) which case folds to "ss".

To support these there needs to be an additional pass when lowercasing a unicode::String that supports expanding the resulting string.


Generated at Thu Feb 08 03:52:42 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.