Cross Validation

Cross Validation

Cross Validation is located under Model Studio () under Sampling, in Data Preparation, in the left task pane. Use the drag-and-drop method to use the algorithm in the canvas. Click the algorithm to view and select different properties for analysis.

Refer to Properties of Cross Validation.



Properties of Cross Validation

The available properties of Cross Validation are as shown in the figure given below.


The table given below describes the different fields present on the properties of Cross Validation.

Field

Description

Remark
RunIt allows you to run the node.-
ExploreIt allows you to explore the successfully executed node.-
Vertical Ellipses

The available options are

  • Run till node
  • Run from node
  • Publish as a model
  • Publish code
-

Task Name

It is the task selected on the workbook canvas.

You can click the text field to edit or modify the name of the task as required.

Number Of Folds

It allows the dataset to be split into the given number of folds.

Shuffle

It allows you to select whether or not to shuffle the input data while creating the different folds.

Its values are either True or False.

True: The data is shuffled before splitting into folds.

False: The data is not shuffled before splitting into folds.

Random Seed

It is the value that builds a pattern in random data. This ensures that the data is split in the same pattern every time the code is re-run.


Example of Cross Validation

Consider a flower dataset with 150 records. A snippet of input data is shown in the figure given below.


We apply Cross Validation to the input data. The output of Cross Validation is given as input to the Regression model, Ridge Regression.

The result displays the Regression Statistics for each of the folds, as shown in the figure below.


The final score for each of the different metrics on complete data is also displayed.


The result also displays Fold-wise Cross Validation (CV) Score, Standard Deviation, and Mean Score of all the CV scores.


Similarly, you can use Train Test Split and test any other Classification or Regression models.


    • Related Articles

    • Cross Validation

      Cross Validation is located under Model Studio () under Sampling, in Data Preparation, in the left task pane. Use the drag-and-drop method to use the algorithm in the canvas. Click the algorithm to view and select different properties for analysis. ...
    • Workbook Validation

      In Rubiscape, you can drag-and-drop algorithms and datasets on the workbook or workflow canvas to build a model. When you run the model, Rubiscape validates it before execution. The validation feature is used to notify the validation errors that ...
    • Workbook Validation

      In Rubiscape, you can drag-and-drop algorithms and datasets on the workbook or workflow canvas to build a model. When you run the model, Rubiscape validates it before execution. The validation feature is used to notify the validation errors that ...
    • Model Validation

      Model validation is an enhancement of publishing a model. You can use this feature to explore the result of the published model for a selected dataset. In model validation, you can use the published model for the selected algorithm with the same ...
    • Validation using the Validate Option

      The Validate option is available on the Function Pane of the workbook or workflow canvas. To validate a workbook or workflow using the Validate option, follow the steps given below. Open the workbook or workflow on which you want to work. Refer to ...