Join Using a Table
In this document, you’ll learn how to use the Join function in Astera Dataprep to combine a dataset from a database table in a shared connection with an existing dataset in your Dataprep Recipe.
Use Case
In this use case, we have a Dataprep Recipe where a company’s Customers dataset has been cleansed. Now, they want to join it with their Orders dataset, which is stored in a database table accessible through a shared connection in the project.

To begin, click on the Join option in the toolbar and select Table from the drop-down.

This will open the Recipe Configuration – Join panel.

In this panel, you’ll configure the following options:
Connection Name: Select the shared connection you want to use. The drop-down lists all shared connections available in the project.

Table: From the drop-down, choose the database table you want to join with. In this example, we’ll select the Orders table.

Join Dataset: You can provide a custom name for the joined dataset or keep the default name. In this example, we’ll keep the default name.

Join Type: Choose the type of join you want to perform:
Inner: Keeps only the records that have matching values in both datasets.
Left Outer: Keeps all records from the current dataset and adds matching data from the table. Unmatched records from the table are filled with nulls.
Right Outer: Keeps all records from the table and adds matching data from the current dataset. Unmatched records from the current dataset are filled with nulls.
Full Outer: Keeps all records from both datasets. Unmatched values are filled with nulls.
In our example, we’ll use an Inner join to include only matching records.

Keys: Specify the key fields that the join will be based on. Astera will auto-detect matching fields, but you can modify them as needed.
Left Field: Field from the current dataset.
Right Field: Field from the table in the shared connection.
In this case, we’ll keep the default key fields selected.

Once you’re done, click
Apply. The shared connection table will now be joined, and the result will appear in the grid.

Last updated
Was this helpful?