[SERVER-45617] Support for Serbian language in FTS Created: 16/Jan/20  Updated: 27/Dec/23

Status: Backlog
Project: Core Server
Component/s: Text Search
Affects Version/s: None
Fix Version/s: None

Type: New Feature Priority: Major - P3
Reporter: Stefan Petkovic Assignee: Backlog - Query Integration
Resolution: Unresolved Votes: 0
Labels: qi-text-search, serbian
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Assigned Teams:
Query Integration
Participants:

 Description   

We would like to contribute by adding a support for Serbian language to FTS component.

This would require:

  1. Update of the libstemmer_c library - Serbian stemmer is available under the Snowball project,
  2. Change in the /src/mongo/db/fts/fts_language.cpp - in order to register a new language,
  3. New list of stopwords (stop_words_serbian.txt) under the /src/mongo/db/fts folder and a Change to the SConscript file (to register this new list),
  4. Change to the /src/mongo/db/fts/unicode/codepoints_diacritic_map.cpp - two new cases inside codepointRemoveDiacritics function for letters "Đ" and "đ".

 



 Comments   
Comment by Stefan Petkovic [ 29/Aug/21 ]

Hello,

I just want to check if this is something that you will take into consideration?

Best regards,
Stefan

Comment by Carl Champain (Inactive) [ 17/Jan/20 ]

Hi petkovic8@gmail.com,

Thanks for the report.
I'm passing this ticket along to the appropriate team for further investigation. Updates will be posted on this ticket as they happen.

Kind regards,
Carl
 

Generated at Thu Feb 08 05:09:17 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.