Uploaded image for project: 'Core Server'
  1. Core Server
  2. SERVER-325

Correctly handle \u escapes using UTF16 surrogate pairs for chars outside of BMP

    XMLWordPrintableJSON

Details

    • Icon: Bug Bug
    • Resolution: Won't Do
    • Icon: Minor - P4 Minor - P4
    • None
    • None
    • Tools
    • Storage Execution

    Description

      http://en.wikipedia.org/wiki/UTF-16/UCS-2#Encoding_of_characters_outside_the_BMP

      From rfc 4627:
      To escape an extended character that is not in the Basic Multilingual
      Plane, the character is represented as a twelve-character sequence,
      encoding the UTF-16 surrogate pair. So, for example, a string
      containing only the G clef character (U+1D11E) may be represented as
      "\uD834\uDD1E".

      Attachments

        Activity

          People

            backlog-server-execution Backlog - Storage Execution Team
            mathias@mongodb.com Mathias Stearn
            Votes:
            0 Vote for this issue
            Watchers:
            9 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: