Optimization Scenarios

In this document, we’ll have a look at the job duration for some basic dataflows and the effect certain optimizations have on this duration.

The size of the source dataset for all of the cases covered is 5 MILLION records.

Aggregate Transformation

To learn about the Aggregate transformation object, click here.

Here’s the dataflow that we’ve created to test for different optimizations in the Aggregate transformation object:

