[SERVER-19961] Text search case folding needs support for 'Full' case folding Created: 14/Aug/15 Updated: 28/Dec/23 |
|
| Status: | Backlog |
| Project: | Core Server |
| Component/s: | Text Search |
| Affects Version/s: | 3.1.6 |
| Fix Version/s: | None |
| Type: | Improvement | Priority: | Major - P3 |
| Reporter: | Adam Chelminski (Inactive) | Assignee: | Backlog - Query Integration |
| Resolution: | Unresolved | Votes: | 0 |
| Labels: | qi-text-search, query-44-grooming | ||
| Remaining Estimate: | Not Specified | ||
| Time Spent: | Not Specified | ||
| Original Estimate: | Not Specified | ||
| Assigned Teams: |
Query Integration
|
| Participants: |
| Description |
|
Text index v3 supports case folding for Unicode's 'C', 'S', and 'T' mappings (see ftp://ftp.unicode.org/Public/UNIDATA/CaseFolding.txt), which are all one character to one character. There are about 100 'F' mappings where a character will case fold to more than one character. This includes the latin sharp s (ß) which case folds to "ss". To support these there needs to be an additional pass when lowercasing a unicode::String that supports expanding the resulting string. |