> For the complete documentation index, see [llms.txt](https://documentation.astera.com/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://documentation.astera.com/astera-data-stack-v9/report-model/report-model-interface/data-regions-in-report-models.md).

# Data Regions in Report Models

To extract data from an unstructured document via template-based extraction, it is important to capture the parts of the document from where data can be extracted. In terms of the Report Model, the area captured within your source report is called a data region.

Creating data regions is the first step in designing a reusable extraction template. An extraction template is called a report model, and data regions are the backbone of report models as they direct Astera on where to extract the data from.

Data regions are defined by specifying a pattern and they may span over any number of lines in a source file as per the use-case.

There are different types of data regions, such as header, footer, single-instance, collection, and append, and some special regions like name-entity, and table region.

In this article, we will learn about the types of regions and their purpose in Astera.

## **Header and Footer Region**

As the name explains, header and footer regions are used to extract the information occurring at the beginning or end of a page. The data extracted in these regions can be a date, page number, author/company name, or any other recurring information at the top/bottom of every page.

{% hint style="info" %}
**Note:** It is important to note here that a report model is incomplete without a main data region. So, users must create a main data region first, and then create other types of regions.
{% endhint %}

On page 1 of the *Orders Report* (shown below), the first 3 lines can be extracted in a header region as they contain information about the time, date, and title of the report. Moreover, these lines occur on every page of the report.

The last line contains the page number, repeating on each page. Therefore, page numbers can be extracted in a footer region.

## **Single-Instance Data Region**

A single-instance data region is a sub-region that extracts a single set of data points.

A single-instance data region is used when the relationship between parent and child data regions is a one-to-one relationship. For example, in the sample data of the *Orders Report*, an order can only be placed by one Account. Therefore, we will use a single-instance data region to extract the details of the Account from the data source.

![](/files/XZHqD2N6Oic4O3ShWK0N)

A main data region is always a single-instance data region.

Single-instance data region node has a blue icon in the Model Layout panel.

## **Collection Data Region**

A collection data region is a sub-region that extracts multiple sets of data points.

We use a collection data region when the relationship between parent and child data regions is a one-to-many relationship. For example, in the same example of the *Orders Report*, there can be multiple items in one order. Therefore, we will use a collection data region to extract details of the items ordered.

![](/files/r30cVRxsb8gTyoATzgDn)

Collection data regions have a yellow icon in the Model Layout panel.

## **Append Region**

The *Append Region* is a region where the data extracted is concatenated or linked with the main data region. It is a region that you can add as part of a report model that would otherwise be left out of a data region. It can be before or after the main data region.

In the example below, the grand total can be extracted in an Append region as it needs to be concatenated with the main data region, Account. Notice that the grand total does not contain a pattern or logic like the lines containing the details of the account, therefore it needs to be in a separate append region.

## **AGL Data Regions**

In addition to the aforementioned data regions, Astera has some additional regions that are created when the layout is auto-generated using the *Auto-Generate Layout* (AGL) feature. The primary difference between AGL regions and other data regions is that AGL regions cannot be created manually. A user must run AGL for these regions to be created.

## **Table Region**

A table data region is the first region that is created when AGL is run. A table data region contains multiple sets of data points, just like collection regions. A table region is detected based on the names of the columns of a table. In Astera, column names are called headers. Users can adjust the height of the header after AGL extracts a table region to improve the accuracy of the area detected within the region.

A table region cannot be created independently or manually. It can only be created when an AGL operation detects a table in an unstructured document.

{% hint style="info" %}
**Note:** Column headers in a table region are different from a header region mentioned at the beginning of this article.
{% endhint %}

## **Name-Entity Region**

A name-entity region contains a single set of data points as key-value pairs. Name-entity regions are like the single-instance data regions in theory, meaning, there is a one-to-one relationship between the keys and values.

Name-entity regions cannot be created independently or manually. They are only created when an AGL operation detects key-value pairs in an unstructured document.

## **Data Regions Summary**

| **Region Type**   | **Description**                                                                                        | **Can it be created manually?** |
| ----------------- | ------------------------------------------------------------------------------------------------------ | ------------------------------- |
| Header and Footer | Extracts data occurring at the beginning/end of each page.                                             | Yes                             |
| Single-Instance   | Extracts single set of data points with a one-to-one relationship between parent and child region.     | Yes                             |
| Collection        | Extracts multiple sets of data points with a one-to-many relationship between parent and child region. | Yes                             |
| Append            | Extracts data which needs to be concatenated with the main data region.                                | Yes                             |
| Name-Entity       | Extracts area with key-value pairs. Clone of Single-Instance region.                                   | No (AGL region)                 |
| Table             | Extracts area which contains a table based on headers detected. Similar to collection region.          | No (AGL region)                 |


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://documentation.astera.com/astera-data-stack-v9/report-model/report-model-interface/data-regions-in-report-models.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.