[SERVER-46873] Invalid utf-8 in json logs Created: 14/Mar/20 Updated: 29/Oct/23 Resolved: 25/Jan/21 |
|
| Status: | Closed |
| Project: | Core Server |
| Component/s: | Logging |
| Affects Version/s: | 4.3.4 |
| Fix Version/s: | 4.4.0 |
| Type: | Bug | Priority: | Major - P3 |
| Reporter: | Oleg Pudeyev (Inactive) | Assignee: | Mark Benvenuto |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Remaining Estimate: | Not Specified | ||
| Time Spent: | Not Specified | ||
| Original Estimate: | Not Specified | ||
| Issue Links: |
|
||||||||
| Backwards Compatibility: | Fully Compatible | ||||||||
| Operating System: | ALL | ||||||||
| Sprint: | Service arch 2020-05-04, Service arch 2020-05-18, Service arch 2020-06-01, Service arch 2020-06-15, Service arch 2020-06-29, Service arch 2020-07-13, Service Arch 2020-07-27, Service Arch 2020-08-10, Service Arch 2020-08-24, Security 2021-02-08 | ||||||||
| Participants: | |||||||||
| Description |
|
In this file: https://s3.amazonaws.com/mciuploads/mongo-ruby-driver/mongo-latest__mongodb-version~latest_topology~replica-set_auth-and-ssl~noauth-and-nossl_ruby~ruby-2.7_os~ubuntu1604/3da63736cf6fd65edb655fe21aa94b970796b802/5e6b12882a60ed16d60d365b/mongo_ruby_driver_mongo_latest__mongodb_version~latest_topology~replica_set_auth_and_ssl~noauth_and_nossl_ruby~ruby_2.7_os~ubuntu1604_patch_3da63736cf6fd65edb655fe21aa94b970796b802_5e6b12882a60ed16d60d365b_20_03_13_04_56_41/logs/mongo_ruby_driver_mongo_latest__mongodb_version~latest_topology~replica_set_auth_and_ssl~noauth_and_nossl_ruby~ruby_2.7_os~ubuntu1604_test_mlaunch_patch_3da63736cf6fd65edb655fe21aa94b970796b802_5e6b12882a60ed16d60d365b_20_03_13_04_56_41-0-mongodb-logs.tar.gz, in drivers-tools/.evergreen/orchestration/db/ruby-driver-rs/rs2/mongod.log , the server produced 4 lines that contain invalid utf-8 as reported by Ruby. The line numbers are 21111, 21962, 22041, 22187. These lines all look like this:
terminationCause appears to look like this:
Ruby has this to say about it:
According to https://en.wikipedia.org/wiki/JSON#Data_portability_issues json should be encoded in utf-8. |
| Comments |
| Comment by Mark Benvenuto [ 25/Jan/21 ] |
|
As part as |
| Comment by Bruce Lucas (Inactive) [ 16/Mar/20 ] |
|
It looks to me that this is worse than simply logging invalid utf8 - it looks like terminationCause is garbage, indicating possibly a more serious problem like use after free, wild pointer, uninitialized storage. |