[SERVER-8398] Review Russian stop word list Created: 30/Jan/13 Updated: 11/Jul/16 Resolved: 26/Feb/13 |
|
| Status: | Closed |
| Project: | Core Server |
| Component/s: | Text Search |
| Affects Version/s: | 2.3.2 |
| Fix Version/s: | 2.4.0-rc2 |
| Type: | Task | Priority: | Major - P3 |
| Reporter: | Daniel Pasette (Inactive) | Assignee: | Asya Kamsky |
| Resolution: | Done | Votes: | 0 |
| Labels: | None | ||
| Remaining Estimate: | Not Specified | ||
| Time Spent: | Not Specified | ||
| Original Estimate: | Not Specified | ||
| Backwards Compatibility: | Fully Compatible |
| Participants: |
| Description |
|
https://github.com/mongodb/mongo/blob/master/src/mongo/db/fts/stop_words_russian.txt |
| Comments |
| Comment by Asya Kamsky [ 09/Feb/13 ] |
|
I divided the problem with this list of words (and there are many) into four parts: 1. Words that are on the list that should not be. 1. Remove from list: Probably remove from list: 2. Add missing forms of present words to list: 3. Add to list: 4. Decide what to do with: The above are all ordinal form of numbers 1-20 (first, second, third, etc) but in masculine singular form only. Either they should be removed or feminine singular, neuter singluar and plural form of each one added. The list of forms to add is not quite complete (as you can see from adjective последний which means "last" it has many forms and most other adjectives do as well) |