Correctly handle \u escapes using UTF16 surrogate pairs for chars outside of BMP

XMLWordPrintableJSON

    • Type: Bug
    • Resolution: Won't Do
    • Priority: Minor - P4
    • None
    • Affects Version/s: None
    • Component/s: Tools
    • Storage Execution
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      http://en.wikipedia.org/wiki/UTF-16/UCS-2#Encoding_of_characters_outside_the_BMP

      From rfc 4627:
      To escape an extended character that is not in the Basic Multilingual
      Plane, the character is represented as a twelve-character sequence,
      encoding the UTF-16 surrogate pair. So, for example, a string
      containing only the G clef character (U+1D11E) may be represented as
      "\uD834\uDD1E".

            Assignee:
            [DO NOT USE] Backlog - Storage Execution Team
            Reporter:
            Mathias Stearn
            Votes:
            0 Vote for this issue
            Watchers:
            9 Start watching this issue

              Created:
              Updated:
              Resolved: