> For the complete documentation index, see [llms.txt](https://documentation.astera.com/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://documentation.astera.com/dataflows/database-write-strategies/source-diff-processor.md).

# Source Diff Processor

The *Source Diff Processor* object is one of the Database Write Strategies offered in Astera. It works like the *Database Diff Processor*, however, unlike the *Database Diff Processor*, it is used to perform write actions (such as Insert, Update and Delete) on file destinations. It stores a snapshot of your data processed in the first run in a CDC file. So, the next time you run it, it will only import the new records.

### Use Case

We have a sample *Employees* dataset coming in from an *Excel Workbook Source*. Initially, we had records of 10 employees but later on, 2 more were added in the source dataset. We wish to apply a database write strategy that can read the data incrementally from file sources. To achieve this, we will use the *Source Diff Processor* in Astera.

### How to Work with Source Diff Processor

1. Drag-and-drop the *Source Diff Processor* object from *Toolbox > Database Write Strategy > Source Diff Processor* onto the dataflow designer and map the source data to it.

![](/files/MEzKll90eP0jtwxCLs5w)

2. Right-click on the *Source Diff Processor* object’s header and select *Properties*.

![](/files/ewnjbXta9P2ZyOsgkwlF)

3. A *Layout Builder* window will open where you can modify your layout. Click *Next*.

![](/files/DtdjdhGFz5PyX7OcFjYD)

4. The next window is the *Incremental Write Options* window.

![](/files/sysgSe1s7k3t1nyoTHUU)

Here, you have to specify the *Record Matching* field. This field is used to match and compare the incoming and existing records. We will select *EmployeeID* as the *Record Matching* field.

*Case Sensitive* – Check this option if you want to compare records on a case sensitive basis.

*Sort Input* – Check this option if you want to sort the incoming data.

![](/files/ki6QkFWxXmxpqAgHitmS)

* Now, if the incoming dataset has a new record with a new *EmployeeID* i.e. the ID is not present in the existing file which is being compared against the incoming file, Astera will perform the INSERT action.
* If the *EmployeeID* is already present in the existing file, Astera will compare the records against that ID and will perform UPDATE action in the fields where the information has updated.
* If the *EmployeeID* is there in the existing file, but not present in the incoming file, it means that the particular record has been deleted. In this case, Astera will perform the DELETE action.

In the Output Options section, you can either select the *Single Output* option or *One Port for Each Action*.

![](/files/ubr2SE97dU8ZhMlILdDl)

#### *Single Output:*

The *Single Output* option is selected if you wish to load your data into the destination without modifying it further on the basis of individual write actions. If you select *Single* *Output*, the database action such as INSERT, UPDATE, SKIP or ERROR will be chosen by the database write strategy’s logic rather than being specified by the user. Using a *Single Output* is recommended when a database write strategy is applied.

#### *One Port for Each Action:*

*One Port for Each Action* is used when you want to further transform or log your data. If you select *One Port for Each Action*, you will get separate nodes for each Diff action in the Source Diff Processor’s object.

In this example, we will select *Single Output*.

The third section in the *Incremental Write Options* window is the *Incremental Transfer Information File Path* option. Here, you must specify the file path where you want to store information related to the last run.

![](/files/MxgRvSlOzaaQNRgDvF6Q)

Now, if you have worked with *Excel Workbook* and *Database table Sources* in Astera, you would have noticed that the *Database Table Source* object gives you the option to read incremental changes. However, no such option is available in Excel or other file source objects. This option in the *Source Diff Processor* enables you to read incrementally from different file formats such as Excel, Delimited, and Fixed Length.

Click *OK*.

5. Now, right-click on the *Source Diff Processor* object’s header and select *Preview Output*.

![](/files/OsRPAFn2unXjgukfhNQ2)

Output preview for *Single Output*:

![](/files/wTpU5lm8hK4OKRAik3Xk)

Output preview if *One Port for Each Action* is selected:

![](/files/icwlJu1U8zOt6u8W8xez)

You can now write your data to any destination or perform any transformation on the dataset.

![](/files/HBQGFD1kowIDQtAuT8BW)

This concludes using the *Source Diff* *Processor* write strategy in Astera.


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://documentation.astera.com/dataflows/database-write-strategies/source-diff-processor.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.