Improve retryability of aggregation processes

XMLWordPrintableJSON

    • Type: Improvement
    • Resolution: Unresolved
    • Priority: Major - P3
    • None
    • Affects Version/s: None
    • Component/s: None
    • Workload Resilience
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      In the Shard class, we have a runAggregation function that starts a aggregation process. The retryability was implemented with the fact that getMore was not retryable. We should also have retryability at the fetcher level. This would allow to properly retry on retryable errors, while handling other classes of error in Shard.

      We just have to be careful stripping the RetryableError label from the fetcher as the retries have already been performed.

      We should also consider simply switching to the ARS for the shard class as noted in the technical design of SPM-4003

            Assignee:
            Unassigned
            Reporter:
            Guillaume Racicot
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

              Created:
              Updated: