<!-- 
RSS generated by JIRA (9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66) at Thu Feb 08 03:47:04 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>MongoDB Jira</title>
    <link>https://jira.mongodb.org</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.7.1</version>
        <build-number>970001</build-number>
        <build-date>13-04-2023</build-date>
    </build-info>


<item>
            <title>[SERVER-18248] &lt; 100 chars are still too large to index if weird chars or messed up encoding&#8230;?</title>
                <link>https://jira.mongodb.org/browse/SERVER-18248</link>
                <project id="10000" key="SERVER">Core Server</project>
                    <description>&lt;p&gt;Hi,&lt;/p&gt;

&lt;p&gt;I&apos;m doing a map/reduce to collect a large set of log entries of users input. When moving from 2.4 or something to 3.0.2 a lot of problems of too long fields to index appeared. We then cut the large ones down to prefixes and it seemed to work. When doing this map/reduce however I get:&lt;/p&gt;

&lt;p&gt;2015-04-29T09:40:46.014+0200 E QUERY    Error: map reduce failed:{&lt;br/&gt;
	&quot;errmsg&quot; : &quot;exception: Btree::insert: key too large to index, failing kostbevakningen.tmp.mr.logentrys_0_inc.$_temp_0 1057 { : \&quot;2 &#227;&#402;&#230;&#8217;&#227;&#8224;&#226;&#8364;&#8482;&#227;&#402;&#226;&#8364;&#160;&#227;&#162;&#226;&#8218;&#172;&#226;&#8222;&#162;&#227;&#402;&#230;&#8217;&#227;&#162;&#226;&#8218;&#172;&#226;&#160;&#227;&#402;&#226;&#162;&#227;&#162;&#226;&#8364;&#353;&#226;&#172;&#227;&#162;&#226;&#8364;&#382;&#226;&#162;&#227;&#402;&#230;&#8217;&#227;&#8224;&#226;&#8364;&#8482;&#227;&#402;&#226;&#162;&#227;&#162;&#65533;...\&quot; }&quot;,&lt;br/&gt;
	&quot;code&quot; : 17280,&lt;br/&gt;
	&quot;ok&quot; : 0&lt;br/&gt;
}&lt;/p&gt;


&lt;p&gt;But these fields are less than 100 chars:&lt;br/&gt;
&amp;gt; db.logentrys.find({inputText: {$regex: &apos;2 &#227;&#402;&#230;&#8217;&#227;.*&apos;}}).count()&lt;br/&gt;
294&lt;br/&gt;
&amp;gt; db.logentrys.find({inputText: {$regex: &apos;.&lt;/p&gt;
{100,}
&lt;p&gt;&apos;}}).count()&lt;br/&gt;
0&lt;/p&gt;

&lt;p&gt;We have a lot of these cases with weird encodings, I think this one is the beginning of swedish &quot;2 &#228;gg&quot; which means &quot;2 eggs&quot; &lt;img class=&quot;emoticon&quot; src=&quot;https://jira.mongodb.org/images/icons/emoticons/smile.png&quot; height=&quot;16&quot; width=&quot;16&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt;&lt;/p&gt;

&lt;p&gt;My guess is that mongo does some internal tree encoding which makes these unusual characters take up a looot of space so the overhead makes &amp;lt; 100 chars more than 1024 bytes.&lt;/p&gt;

&lt;p&gt;What can I do? I could probably go with losing these log entries, but I really don&apos;t even know how to identify them all?&lt;/p&gt;</description>
                <environment></environment>
        <key id="200939">SERVER-18248</key>
            <summary>&lt; 100 chars are still too large to index if weird chars or messed up encoding&#8230;?</summary>
                <type id="6" iconUrl="https://jira.mongodb.org/secure/viewavatar?size=xsmall&amp;avatarId=14720&amp;avatarType=issuetype">Question</type>
                                            <priority id="3" iconUrl="https://jira.mongodb.org/images/icons/priorities/major.svg">Major - P3</priority>
                        <status id="6" iconUrl="https://jira.mongodb.org/images/icons/statuses/closed.png" description="The issue is considered finished, the resolution is correct. Issues which are closed can be reopened.">Closed</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="9">Done</resolution>
                                        <assignee username="sam.kleinman">Sam Kleinman</assignee>
                                    <reporter username="hedefalk">Viktor Hedefalk</reporter>
                        <labels>
                    </labels>
                <created>Wed, 29 Apr 2015 07:53:26 +0000</created>
                <updated>Tue, 26 May 2015 16:09:58 +0000</updated>
                            <resolved>Tue, 26 May 2015 16:09:58 +0000</resolved>
                                    <version>3.0.2</version>
                                                    <component>MapReduce</component>
                                        <votes>0</votes>
                                    <watches>5</watches>
                                                                                                                <comments>
                            <comment id="923494" author="samk" created="Tue, 26 May 2015 16:09:48 +0000"  >&lt;p&gt;I&apos;m glad that you&apos;ve been able to resolve this, and sorry for the confusion. I&apos;m going to go ahead and close this ticket. Feel free to reopen if you run into this again or open a new ticket as needed. &lt;/p&gt;

&lt;p&gt;Cheers,&lt;br/&gt;
sam&lt;/p&gt;</comment>
                            <comment id="920896" author="hedefalk" created="Thu, 21 May 2015 20:18:20 +0000"  >&lt;p&gt;Hi @Ramon, I could get round it by wiping my mongo installation. Seems like some temps crap staying behind even though I had removed the failing data. &quot;kostbevakningen.tmp.mr.logentrys_0_inc.$_temp_0 1057&quot; sounds like something else than the &quot;real&quot; data.&lt;/p&gt;</comment>
                            <comment id="920877" author="ramon.fernandez" created="Thu, 21 May 2015 19:58:37 +0000"  >&lt;p&gt;Hi &lt;a href=&quot;https://jira.mongodb.org/secure/ViewProfile.jspa?name=hedefalk&quot; class=&quot;user-hover&quot; rel=&quot;hedefalk&quot;&gt;hedefalk&lt;/a&gt;, we haven&apos;t heard back from you for some time. If this is still an issue for you can you please answer Sam&apos;s question above about a reproducer?&lt;/p&gt;

&lt;p&gt;Thanks,&lt;br/&gt;
Ram&#243;n.&lt;/p&gt;</comment>
                            <comment id="907751" author="samk" created="Thu, 7 May 2015 16:17:22 +0000"  >&lt;p&gt;Hello, &lt;/p&gt;

&lt;p&gt;Thanks for reporting this is issue. Can you provide sample data data and/or a small script that we could use to reproduce the issue? This will help us understand the problem much more clearly. &lt;/p&gt;

&lt;p&gt;Regards, &lt;br/&gt;
sam&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                <customfield id="customfield_10050" key="com.atlassian.jira.toolkit:comments">
                        <customfieldname># Replies</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>4.0</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                <customfield id="customfield_10055" key="com.atlassian.jira.ext.charting:firstresponsedate">
                        <customfieldname>Date of 1st Reply</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>Wed, 29 Apr 2015 21:33:17 +0000</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10052" key="com.atlassian.jira.toolkit:dayslastcommented">
                        <customfieldname>Days since reply</customfieldname>
                        <customfieldvalues>
                                        8 years, 38 weeks, 1 day ago
    
                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_18254" key="com.onresolve.jira.groovy.groovyrunner:scripted-field">
                        <customfieldname>Dependencies</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue><![CDATA[]]></customfieldvalue>


                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_15850" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            <customfield id="customfield_10057" key="com.atlassian.jira.toolkit:lastusercommented">
                        <customfieldname>Last comment by Customer</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>true</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10056" key="com.atlassian.jira.toolkit:lastupdaterorcommenter">
                        <customfieldname>Last commenter</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>sam.kleinman</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_11151" key="com.atlassian.jira.toolkit:LastCommentDate">
                        <customfieldname>Last public comment date</customfieldname>
                        <customfieldvalues>
                            8 years, 38 weeks, 1 day ago
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                    <customfield id="customfield_10051" key="com.atlassian.jira.toolkit:participants">
                        <customfieldname>Participants</customfieldname>
                        <customfieldvalues>
                                        <customfieldvalue>ramon.fernandez@mongodb.com</customfieldvalue>
            <customfieldvalue>sam.kleinman</customfieldvalue>
            <customfieldvalue>hedefalk</customfieldvalue>
    
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                        <customfield id="customfield_14254" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Product Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hrl76f:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                <customfield id="customfield_12550" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>2|hs8v33:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10558" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_23361" key="com.onresolve.jira.groovy.groovyrunner:scripted-field">
                        <customfieldname>Requested By</customfieldname>
                        <customfieldvalues>
                                

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                <customfield id="customfield_10053" key="com.atlassian.jira.ext.charting:timeinstatus">
                        <customfieldname>Time In Status</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_22870" key="com.onresolve.jira.groovy.groovyrunner:scripted-field">
                        <customfieldname>Triagers</customfieldname>
                        <customfieldvalues>
                                

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                    <customfield id="customfield_14350" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>serverRank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hsg2kv:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                    </customfields>
    </item>
</channel>
</rss>