Sampling then projecting in the MongoSamplePartitioner is slow

XMLWordPrintableJSON

    • Type: Improvement
    • Resolution: Won't Fix
    • Priority: Major - P3
    • 1.0.0
    • Affects Version/s: None
    • Component/s: Performance
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      For example with the MovieLens dataset ~1million documents:

      Pipeline: sample, project _id: 76120 ms
      Pipeline: project _id, sample: 1124 ms

              Assignee:
              Ross Lawley
              Reporter:
              Ross Lawley
              None
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

                Created:
                Updated:
                Resolved: