Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Duplicate
Priority: Major - P3
Fix Version/s: None
Affects Version/s: 0.9.7
Component/s: Performance
Labels:
None

CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

Filtering by a field that isn't indexed is extremely fast, however doing a sum over that same field is very slow when using db.group. I'm sure it has to do with the fact that aggregation takes a function and the javascript server side eval is very slow, but would it be possible to have an aggregation facility that had a similar performance to full scans, which is very fast.

The query I am using here is:
db.group( {ns: "testCollection", key: {}, reduce: function(obj, prev)

{ prev.csum += obj.field1; }

, initial:

{ csum: 0 }

})

Sum isn't the only interesting aggregation to make fast. The most useful case here would be to have a few buckets and as we visit each item, we place one of it's fields in a bucket based to collect a histogram of stats. An example of this would be to count jira bug status codes in one aggregate

{ blocking: 5, major: 3 ... }

It just seems to me that if a non-indexed find is fast, than the aggregate case should have similar perf.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

mongo_log.txt
1 kB
Aug 14 2009 04:12:17 AM UTC

Assignee:: Eliot Horowitz (Inactive)
Reporter:: John Carrino
Participants:: Eliot Horowitz, John Carrino
Votes:: 0 Vote for this issue
Watchers:: 0 Start watching this issue

Created:: Aug 14 2009 04:08:16 AM UTC
Updated:: Sep 10 2009 10:03:07 AM UTC
Resolved:: Aug 14 2009 11:42:23 AM UTC

Details

Description

Attachments

Attachments

Activity

People

Dates