[DOCS-3880] Norwegian language code is "nb" rather than "no" Created: 07/Aug/14  Updated: 11/Jan/17  Resolved: 13/Aug/14

Status: Closed
Project: Documentation
Component/s: manual
Affects Version/s: None
Fix Version/s: 01112017-cleanup

Type: Bug Priority: Major - P3
Reporter: Kamran K. Assignee: Unassigned
Resolution: Done Votes: 0
Labels: 28qa
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Issue Links:
Related
related to SERVER-9932 Text search languages should accept s... Closed
is related to SERVER-14879 Text search alias for Norwegian shoul... Backlog
Participants:
Days since reply: 9 years, 27 weeks ago

 Description   

http://docs.mongodb.org/manual/reference/text-search-languages says Norwegian is aliased to "no", but it's actually aliased to "nb" in the source code: https://github.com/mongodb/mongo/blob/master/src/mongo/db/fts/fts_language.cpp#L177

rassi@10gen.com: Can you verify that the Norwegian Bokmål alias is correct in the server code? I think this is a docs bug, but I'm not entirely certain because of the various Norwegian ISO codes (nb, nn, and no). Thanks!



 Comments   
Comment by Githook User [ 13/Aug/14 ]

Author:

{u'username': u'kkmongo', u'name': u'Kamran Khan', u'email': u'kamran.khan@mongodb.com'}

Message: DOCS-3880 Fix the Norwegian ISO language code ('no' -> 'nb')

Signed-off-by: kay <kay.kim@10gen.com>
Branch: master
https://github.com/mongodb/docs/commit/bdf5f6a6a05647cfa1ec9cb4acb2e7270a793b7f

Comment by J Rassi [ 13/Aug/14 ]

Filed / linked SERVER-14879.

Comment by J Rassi [ 13/Aug/14 ]

I see that the description of the SERVER ticket to implement two-letter language codes (SERVER-9932) did specify "no" as the language alias to be used for Norwegian. However, the implementation registered the language as "nb" (and using "nb" in the server does correctly invoke the Norwegian stemmer and stopword list). paul@10gen.com may have raised the suggestion to use "nb" instead (based on Bokmål's current dominance) but I can't honestly recall off the top of my head. Looking into it now, I do think that "no" would have been more correct: Porter said on gmame.comp.search.snowball back in 2001 that "the simple Norwegian stemmer I've presented works equally on bokmal and nynorsk", and also the Norwegian stopword list packaged with the server has Bokmål and Nynorsk words (compare the annotated list with the list from 2.6.4).

Hence, I'm inclined to suggest both of the following:

  • the documentation should be updated to reflect that the Norwegian alias is actually "nb"
  • the alias in the server should be changed to "no" in a future version of text indexes (and the change should be documented when it happens, eventually)

I'll file a ticket for the server part of the work and link here.

Generated at Thu Feb 08 07:46:43 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.