Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Gone away
Priority: Major - P3
Fix Version/s: None
Affects Version/s: 5.3.1
Component/s: None
Labels:
None

Operating System:
ALL
Steps To Reproduce:

Hide

See SERVER-11873

Show
See SERVER-11873
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

In our logs long strings are being truncated to 150 bytes irrespective of UTF-8 character boundaries. While this is not an issue for ASCII characters any multi-byte character that is on the truncation boundary gets cut at exactly 150 bytes and produces an invalid UTF-8 byte sequence contaminating our log file with an improper encoding.

For example 'あ' is "\xE3\x81\x82" in UTF-8 bytes but if the last byte is cut it produces the invalid UTF-8 byte sequence of "\xE3\x81".

The code causing this issue can be found here. It should be updated to be aware of character boundaries in its trimming logic.

Assignee:: Chris Kelly
Reporter:: Justin Casali
Participants:: Chris Kelly, Justin Casali
Votes:: 0 Vote for this issue
Watchers:: 5 Start watching this issue

Created:: Apr 25 2022 05:25:28 PM UTC
Updated:: Oct 27 2023 08:45:29 PM UTC
Resolved:: May 18 2022 06:56:29 PM UTC

Details

Description

Attachments

Activity

People

Dates