Add support for pluggable FTS tokenizers

XMLWordPrintableJSON

    • Type: Task
    • Resolution: Done
    • Priority: Major - P3
    • 3.1.1
    • Affects Version/s: None
    • Component/s: Text Search
    • None
    • Fully Compatible
    • Platform 1 04/03/15
    • None
    • 3
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      To support third-party tokenizers, Mongo needs to create an abstract interface for document tokenization.

      1. Create an abstract interface
      2. Move all code to use the new interface except V1 legacy
      3. Create an implementation for our
      class FtsTokenizer {
          virtual ~FtsTokenizer()
          virtual void reset(const char* document) = 0; // Process a new doc
          virtual bool moveNext() = 0;	    // Moves to the next token
          virtual StringData& getStem() = 0;  // Returns stemmed form
      };
      

              Assignee:
              Mark Benvenuto
              Reporter:
              Mark Benvenuto
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

                Created:
                Updated:
                Resolved: