Uploaded image for project: 'Core Server'
  1. Core Server
  2. SERVER-9537

Full text search in Dutch does incorrect stemming for words that end with "sen"

    XMLWordPrintableJSON

Details

    • Icon: Bug Bug
    • Resolution: Duplicate
    • Icon: Minor - P4 Minor - P4
    • None
    • 2.4.3
    • Text Search
    • None
    • ALL
    • Hide
      • create a Dutch-language full text index
      • put a document with the word "dansen" in the full text index
      • do a full text search for "dans"
        => no results are returned
      Show
      create a Dutch-language full text index put a document with the word "dansen" in the full text index do a full text search for "dans" => no results are returned

    Description

      Words in Dutch that end with "sen" are correctly stemmed to the same word without "en". So "dansen" becomes "dans". However, if you then search for "dans", this will incorrectly be stemmed to "dan", and the full text search returns no matches.

      A possible solution would be to recursively stem words during the indexing and search phase, so that both "dansen" and "dans" would stem to "dan".

      Attachments

        Activity

          People

            paul.pedersen Paul Pedersen
            mroloux Matti Roloux
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: