[SERVER-13709] Add text index support for arabic Created: 24/Apr/14  Updated: 25/Jun/15  Resolved: 29/Apr/15

Status: Closed
Project: Core Server
Component/s: Text Search
Affects Version/s: None
Fix Version/s: None

Type: New Feature Priority: Major - P3
Reporter: Ali Hmer Assignee: Unassigned
Resolution: Duplicate Votes: 2
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Issue Links:
Depends
depends on SERVER-17620 RLP Tokenizer (includes C++ unit tests) Closed
Duplicate
duplicates SERVER-17620 RLP Tokenizer (includes C++ unit tests) Closed
Backwards Compatibility: Fully Compatible
Participants:

 Description   

Add support for arabic language text index and search.

original description

Is it feasible to add stop_words_arabic to handle arabic language from a technical point of view? In other words, would it help to have such file when you use text search in that language.



 Comments   
Comment by Ali Hmer [ 24/Apr/14 ]

That was my concern. Anyhow, thanks for escalating the issue. I still can help with providing an Arabic version for stop_words text file.

Comment by Daniel Pasette (Inactive) [ 24/Apr/14 ]

To add support for arabic requires more than simply adding the stop words list. There is work to support a tokenizer and stemmer as well. I'm going to change this ticket to be a request for arabic language support with text.

Generated at Thu Feb 08 03:32:37 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.