AI Match Transformation

Astera Data Stack introduces an innovative AI Matching feature which leverages the power of Artificial Intelligence to perform intelligent matching. This feature works based on semantic similarity, ensuring more accurate and comprehensive matching results.

AI Match in Astera Data Stack

In Astera Data Stack, the AI Match object can be found in the Toolbox and can be used within the scope of the Dataflow.

Use Case

For our use case, we have a Customers dataset from the sales department as shown below:

We want to replace the values in the Country column of the sales dataset by semantically matching them with Country values from the Customers dataset provided by the marketing team, ensuring both departments follow a unified naming standard.

  1. To get started, let’s drag-and-drop an Excel Workbook Source object and configure it with the customers dataset provided by the sales department.

  1. Next, drag-and-drop the AI Match object from the Toolbox onto the Dataflow and auto-map the fields from the Excel Workbook Source onto the AI Match object.

  1. Once all the fields have been mapped, right-click on the AI Match object and select Properties from the context menu.

  1. This will open the Layout Builder screen, which shows the layout of the incoming dataset. Click Next.

  1. The AIMatch Transformation Properties screen will open, let’s configure these properties.

  • File Path: This is where we provide the path of the file on the basis of which we want to perform our semantic matching.

  • Worksheet: This is where we can define which Excel sheet data to use if there are multiple sheets.

  • Lookup Field: This is the field based on which we are performing the lookup.

  • Incoming Field: This lets us define the lookup field from the incoming dataset

For our use case, let’s select the Country Field for both.

  1. Once done, click OK and right-click on the AI Match object to preview its output.

As you can see below, the values in the Country field have been semantically matched and replaced from the file, using AI. We can also see that, since the country Pakistan did not have a matching field in the marketing dataset, it hasn’t been replaced.

  1. Now, let’s drag-and-drop a Database Table Destination object and map the matched data onto it.

Running this Dataflow will write the data to the destination table.

This concludes the working of the AI Match object in Astera Data Stack.

Last updated

© Copyright 2023, Astera Software