[SERVER-17595] Add support for Persian language in text search Created: 15/Mar/15  Updated: 25/Jun/15  Resolved: 29/Apr/15

Status: Closed
Project: Core Server
Component/s: Index Maintenance, Text Search
Affects Version/s: None
Fix Version/s: None

Type: New Feature Priority: Major - P3
Reporter: behnamy Assignee: Unassigned
Resolution: Duplicate Votes: 7
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Attachments: Text File stop_words_persian.txt    
Issue Links:
Depends
depends on SERVER-17620 RLP Tokenizer (includes C++ unit tests) Closed
Duplicate
duplicates SERVER-17620 RLP Tokenizer (includes C++ unit tests) Closed
Backwards Compatibility: Fully Compatible
Participants:

 Description   

If it's possible, please Add support for Persian language. I attached the persian stop words list.



 Comments   
Comment by Ramon Fernandez Marina [ 29/Apr/15 ]

The necessary pieces to support Persian and other languages via an external library have been put in place by SERVER-17620, and will be part of the upcoming 3.1.2 development release.

Comment by Miliad Ebadi [ 25/Apr/15 ]

Here are extra persian stop word list:
http://www.ranks.nl/stopwords/persian
https://code.google.com/p/stop-words/
http://www.mojiry.ir/text_tools/StopWords/PersianStopWords.txt

i ready to any help i can.

Comment by Hadi Farnoud [ 17/Mar/15 ]

Frankly, I'm surprised it's not supported already. I'd help however I can.

Comment by behnamy [ 17/Mar/15 ]

Hi Mark, thanks for your attention to our Request,
we mean Iranian persian (pes) language,you can read more here: http://en.wikipedia.org/wiki/Persian_language

And as @mobinranjbar mentioned We are ready to help for adding persian language to full text searching or even translating Docs to persian.

Comment by Mark Benvenuto [ 16/Mar/15 ]

Based on the ISO-639-3 specification, Persian is considered a Macrolanguage. Are you refering to the macrolanguage or one of the individual languages within this macrolanguage: Dari Dari (prs) or Iranian Persian (pes).

See this ISO-639-3 entry for more information:
http://www-01.sil.org/iso639-3/documentation.asp?id=fas

Comment by Mobin Ranjbar [ 16/Mar/15 ]

I absolutely support it.

Comment by MOHAMMAD [X] [ 15/Mar/15 ]

Hi dear friend!
Very good for the run
This is an important and necessary!

Generated at Thu Feb 08 03:45:00 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.