Uploaded image for project: 'Core Server'
  1. Core Server
  2. SERVER-9537

Full text search in Dutch does incorrect stemming for words that end with "sen"

    • Type: Icon: Bug Bug
    • Resolution: Duplicate
    • Priority: Icon: Minor - P4 Minor - P4
    • None
    • Affects Version/s: 2.4.3
    • Component/s: Text Search
    • Labels:
      None
    • ALL
    • Hide
      • create a Dutch-language full text index
      • put a document with the word "dansen" in the full text index
      • do a full text search for "dans"
        => no results are returned
      Show
      create a Dutch-language full text index put a document with the word "dansen" in the full text index do a full text search for "dans" => no results are returned

      Words in Dutch that end with "sen" are correctly stemmed to the same word without "en". So "dansen" becomes "dans". However, if you then search for "dans", this will incorrectly be stemmed to "dan", and the full text search returns no matches.

      A possible solution would be to recursively stem words during the indexing and search phase, so that both "dansen" and "dans" would stem to "dan".

            Assignee:
            paul.pedersen Paul Pedersen
            Reporter:
            mroloux Matti Roloux
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

              Created:
              Updated:
              Resolved: