Auto ML is a process that helps you to build models with the help of predefined datasets. You can select your datasets, the dependent, and the independent variables, and run the model. AutoML wizard suggests the best fit model for your dataset.
To run the Auto ML , follow the steps given below.
Select Target page is displayed.
Select the radio button corresponding to the Target or Dependent variable you want to select.
Note: | If the target variable is Categorical, you will get Classification algorithms and if the target variable is Numerical, you will get Regression algorithms. |
The table below describes the Advanced properties.
Property | Description |
Test Percentage | This is the percentage of the total dataset that is used to test the model. |
Random Seed | It is used to ensure that results are reproducible. |
Number Of Iterations | It is the repetition of the process to generate the outcome. |
Number of Samples | It shows the number of observations taken from the population. |
The performance metrics are rendered depending on the selection of selected target variable.
The table below describes the various Performance Metrices based on the target variable.
Target Variable | Recommended Algorithm | Performance Metrices |
---|---|---|
Categorical | Classification |
|
Numerical | Regression |
|
Note: |
|
Click Run ML.
The results are displayed after the execution of the model completes.
After running Auto ML, you get the result in the form of a chart and other statistics. It recommends the model and pipelines for the given data that the wizard comes across during the training of the model. The Result page shows the detailed metrics of the recommended model and pipelines.
Note: | Pipelines are executed by AutoML and it depends upon Number of Iterations and Number of Samples. |
The result page shows two types of results i.e. Result on Train data and Result on Test data.
The metrics in the Train data result are different from the Test data result. Train Data result is displayed by default, you can see the Overview result in the form of a Bar chart. On the right side of the Bar chart, models are recommended with all statistics.
While viewing the Data, you can choose to see the result sorted as per performance metrics applicable for the selected algorithm.
Performance metrics are part of the results of the algorithm. Runtime values change for the algorithm when you select train data and test data. The train data results are displayed in the below figure.
The Test data results show the changes in the runtime of the model and the performance metrics of the recommended model and pipelines. All these metrics are different from the train data results.
The details of the particular model and pipelines are displayed as per the performance metrics. Here, you can see the details of the parameters of the model and pipelines are built.
To see the details of the models and pipelines parameters, follow the steps given below, click View more details on the Result page, on the right side of the model.
For Classification Algorithms, You see the confusion metrices and specified algorithm parameters for the Classification algorithm only. The same procedure can be followed to see the detailed metrics for recommended pipelines.
Note: | The Confusion metrics graph is only for Classification algorithms. If you get Regression algorithm as recommended model, Confusion metrics and graph are not shown. |
The option of Re-Run ML is available in the wizard to change the parameters that have been selected earlier. If you do not want to use the recommended model, you can tweak some parameters and run the model again. After Re-Run of the model, you will get a new set of recommended models and pipelines.
To Re-Run ML follows the steps given below.
Click Re-Run ML.
Notes: |
|
New Recommended models and pipelines are displayed.
Once you get the recommended model and pipelines, you can publish them.
To publish the model, follow the steps below.
Click Publish.
Note: | You can Publish a pipeline, explore it, and see the result. |
When you use the Auto ML wizard, it creates a workbook for you. This workbook is created inside your current workspace and available even after you close the Auto ML wizard. You can perform workbook tasks on it.
To explore the model, follow the steps given below.
To view the publish model in the task panel, follow the steps given below.
You can manage the Auto ML workbook which is implicitly created by the Wizard.
You can perform all the operations on this workbook. For more details, refer to Working with Workbooks.
You can also validate the model.