Line Count

Line Count option in Astera ReportMiner enables users to specify the number of lines over which a data region spans. This feature is useful when transposing data that appears in rows in an unstructured file and convert it into vertical fields (columns) inside a report model.

In this document, we will explore how Line Count feature helps with the selection of a data region in Astera ReportMiner.

Sample Use-Case

In this case, we have unstructured data in a PDF file.

Download the sample PDF file from here.

1_data

This file contains a customer list report including their account name, contact, and address details.

You can see that a single record spans over 4 lines on this PDF file. In order to capture this data and place it into different fields, we will use the Line Count feature.

First, let’s load this unstructured file onto ReportMiner’s designer.

1. Go to File > New > Report Model and select the source file from your directory.

1-file-path

There are many options available in the Report Options panel to configure how ReportMiner reads the unstructured PDF file such as specifying the scaling factor, font, tab size, and password.

You can read about these options here.

2. Click OK and the source file will open on the designer.

2-file-loaded

Now that we have loaded the source file in ReportMiner, let’s create an extraction template.

Creating a Report Model

1. Right-click on the Record node under the Model layout panel, and select Add Data Region from the context menu.

3-add-data-region

A pattern-matching bar and Region Properties panel will appear and a Data node will be added under the Record node in the Model Layout panel.

4-data-node

2. Specify a pattern with which ReportMiner can match your source file to capture the desired data. You can use an alphabet, character, number, word, or a wild card or any combination of these to define your pattern.

In this case, write “ACCOUNT” in the pattern-matching bar as shown below.

5-matching-pattern

You can see that only one field information is captured through matching pattern. In order to capture the data spanning over 4 lines, we must increase the Line Count to 4.

3. Increase the Line Count value to 4 from the Region Properties panel.

6-increasing-line-count

Each data region block contains information of a single record. Let’s create data fields.

4. Highlight the data region after “ACCOUNT:”, right-click on it and select Add Data Field from the context menu. Rename it to Account.

adding-data-fields-gif

5. Repeat the process to create more data fields. You can see the layout of the extraction template in the Model Layout panel.

7-model-layout

6. You can preview data by clicking on the Preview Data icon placed in the toolbar at the top of the designer window.

8-preview-data

A Data Preview window will open displaying a preview of the extracted data.

9-output

This concludes using the Line Count option while building an extraction template in Astera ReportMiner. The source file for the use-case discussed above is attached below.

Customer_List_Report.pdf