[SERVER-16803] Text search support for Hebrew language Created: 12/Jan/15  Updated: 28/Dec/23

Status: Backlog
Project: Core Server
Component/s: Text Search
Affects Version/s: None
Fix Version/s: None

Type: New Feature Priority: Major - P3
Reporter: Michael Elkin Assignee: Backlog - Query Integration
Resolution: Unresolved Votes: 5
Labels: qi-text-search
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Issue Links:
Related
Assigned Teams:
Query Integration
Participants:

 Description   

Does text search support Hebrew language ?

Thank you

Michael



 Comments   
Comment by Yonatan Novetsky [ 25/Mar/20 ]

Hello,

I did a search in the MongoDB codebase, and it seems that the $text language support comes from an external library called "libstemmer_c". This library seems to be the C implementation of the Snowball stemmers, current website: https://snowballstem.org/

If this is true, then:

A) MongoDB should update it's copy of libstemmer, since it now contains (in addition to Danish, Dutch, English, Finnish, French, German, Hungarian, Italian, Norwegian, Portuguese, Romanian, Russian, Spanish, Swedish, Turkish) also the following new languages: Arabic, Basque, Catalan, Greek, Hindi, Indonesian, Irish, Lithuanian, Nepali, Tamil

B) Can someone explain how to make a Hebrew stemmer?

Comment by Doron Sinai [ 29/Dec/15 ]

Do you know when its scheduled for?

Comment by Ofer Groner [ 07/Jul/15 ]

Hello,

What is the status of this request?

Comment by Matt Kangas [ 12/Jan/15 ]

Hi melkin@dbs-h.com,

As of MongoDB 2.6 we support 15 text search languages. Hebrew is not currently supported.
http://docs.mongodb.org/manual/reference/text-search-languages/#text-search-languages

I will update this ticket to make it a feature request for Hebrew support.

Generated at Thu Feb 08 03:42:19 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.