Data Preparation

Data Preparation

Time-series Data Preparation is located under Forecasting (

 ), in the task pane on the left. Use the drag-and-drop method to use the algorithm in the canvas. Click the algorithm to view and select different properties for analysis.

Refer to Properties of Time-series Data Preparation


Properties of Time-series Data Preparation

The available properties of time-series Data Preparation in Forecasting are as shown in the figure given below.



The table given below describes the different fields present in the properties of time-series Data Preparation in Forecasting.

Field

Description

Remark

RunIt allows you to run the node.-
ExploreIt allows you to explore the successfully executed node.-
Vertical Ellipses

The available options are

  • Run till node
  • Run from node
  • Publish as a model
  • Publish code
-

Task Name

It displays the name of the selected task.

You can click the text field to edit or modify the name of the task as required.

Time ID Variable

It allows you to select the interval type variable for which we need to process the dependent or target variable's values.

  • Only one data field can be selected.
  • If selected, only an interval type variable should be selected.
  • Variables with numerical value are not available.

Target Variable

It allows you to select the experimental or predictor variable(s).

  • Only one data field can be selected.
  • Variables with only numerical value are available.

Group By

It allows you to select the variable for which you want to group the data.

  • Multiple data field can be selected.
  • Only categorical type variables are available.

Advanced

Interval

It allows to select the interval for the accumulation of data in the accumulation test.

Available options are –

  • Day
  • Week
  • Month
  • Quarter
  • Year

The default value is Month.

Time Format

It allows to select the time format to display the result of the selected tests.


Start Time

It allows to set the start time of the time-series data.


  • The default value is None.
  • The default setting specifies that the time-series data uses its own start time.

End Time

It allows to set the end time of the time-series data.

  • The default value is None.
  • The default setting specifies that the time-series data uses its own end time.

(info) 

Notes:

  • If you do not select the time format, the workbook gives an error when you click Save.
  • If you want to analyze sequential data without using the time column, you need not select the Time ID Variable data field.
  • You can analyze sequential data without the time column by executing the time-series data preparation tests to impute the missing values in the data, predict the next values, transform the data, and make the data stationary.
  • When you do not select the Time ID Variable data field, the Accumulation test checkbox is disabled for selection in the Result.

Example of Time-series Data Preparation

Consider an example of Sales time-series data. Here, the variable Date is of interval type.

In the Properties pane, the values are selected as below.

PropertyValue
Time IDOrder Date
Target VariableSales
Group BySegment
IntervalMonth

The figure below shows the values selected. 

We apply Data Preparation on the above data.

On the Result page, you can see Group as Consumer based on the selection in Group By in the Properties pane. When you select the check box for Select All Tests, all four tests, AccumulationMissing ValueTransformation, and Differencing are selected. Trace displays the log of the selected tests when they are executed. It logs the number of times each test starts and ends, based on the option selected in Group By.  

(info)Note:

You can also execute each test independently by selecting the corresponding check box, selecting the required values in the fields corresponding to the selected test, and then clicking Run Test.

When you click Run Selected Tests, the tests are performed based on the default function and values in the fields corresponding to each test. The result for each test is displayed in the figures given below.


    • Related Articles

    • Data Merge

      Data Merge is located under Model Studio ( ) in Data Preparation, in the task pane on the left. Use drag-and-drop method to use the algorithm in the canvas. Click the algorithm to view and select different properties for analysis. Properties of Data ...
    • Data Merge

      Data Merge is located under Model Studio ( ) in Data Preparation, in the task pane on the left. Use drag-and-drop method to use the algorithm in the canvas. Click the algorithm to view and select different properties for analysis. Properties of Data ...
    • Data Joiner

      Data Joiner is located under Model Studio ( ) in Data Preparation, in the task pane on the left. Use the drag-and-drop method to use the algorithm in the canvas. Click the algorithm to view and select different properties for analysis. Refer to ...
    • Data Joiner

      Data Joiner is located under Model Studio ( ) in Data Preparation, in the task pane on the left. Use the drag-and-drop method to use the algorithm in the canvas. Click the algorithm to view and select different properties for analysis. Refer to ...
    • Data Unpivot

      Data Unpivot is located under Model Studio ( ) in Data Preparation, in the left task pane. Use the drag-and-drop method to use the algorithm in the canvas. Click the algorithm to view and select different properties for analysis. Refer to Properties ...