Join Using a Dataset

In this document, you’ll learn how to use the Join function in Astera Dataprep to combine two datasets within the same Dataprep Recipe.

Use Case

For this use case, we have a Dataprep Recipe where a company’s Orders and OrderDetails dataset has been cleansed, they now want to join these datasets with each other.

  1. To begin, click on the Join option in the toolbar and select Dataset from the drop-down.

  1. This will open the Recipe Configuration – Join panel.

  1. In this panel, you’ll configure the following options:

  • Dataset: From the drop-down, choose the dataset you want to join with. For example, if you're currently working with the Orders dataset, you can select OrderDetails as the joining dataset.

  • Join Dataset: You can enter a custom name for the joined dataset or keep the default name.

  • Join Type: Choose the type of join you want to perform:

    • Inner: Keeps only the records that have matching values in both datasets.

    • Left Outer: Keeps all records from the current dataset and adds matching data from the joined dataset. Unmatched records from the joined dataset are filled with nulls.

    • Right Outer: Keeps all records from the joined dataset and adds matching data from the current dataset. Unmatched records from the current dataset are filled with nulls.

    • Full Outer: Keeps all records from both datasets. Unmatched values are filled with nulls.

    In our example, we’ll use an Inner join to include only matching records.

  • Keys: Specify the key fields that the join will be based on. Astera will auto-detect matching fields, but you can modify them as needed.

    • Left Field: Field from the current dataset.

    • Right Field: Field from the joining dataset.

  1. Once you’re done, click Apply. The datasets will now be joined, and the result will appear in the Grid.

Last updated

Was this helpful?