Uploaded image for project: 'Core Server'
  1. Core Server
  2. SERVER-54485

FTS indexes pass c-strings to tokenizer so they ignore data in strings past the first nul byte

    XMLWordPrintable

Details

    • Bug
    • Status: Closed
    • Major - P3
    • Resolution: Won't Fix
    • None
    • None
    • Index Maintenance, Querying
    • None
    • ALL
    • Query Execution 2021-03-08, Query Execution 2021-03-22, Query Execution 2021-04-05, Query Execution 2021-04-19, Query Execution 2021-05-03, Query Execution 2021-05-17, Query Execution 2021-06-14

    Description

      "Luckily" the same bug exists in both indexing and querying. Unfortunately, because it is in indexing, it will require a version bump to fix and we would need to keep the old code around.

       

      This also adds a relatively slight overhead because it is computing strlen rather than just using the size stored in the BSONElement. But that cost is trivial relative to the actual tokenization, so I don't think it is a good motivation alone to fix this.

      Attachments

        Activity

          People

            mickey.winters@mongodb.com Mickey Winters
            mathias@mongodb.com Mathias Stearn
            Votes:
            0 Vote for this issue
            Watchers:
            7 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: