Chi Square Goodness of Fit Test
Chi Square Goodness of Fit Test is located under Model Studio ( ) in Hypothesis Test, in Statistical Analysis, in the left task pane. Use the drag-and-drop method to use the algorithm in the canvas. Click the algorithm to view and select different properties for analysis. Refer to Properties of Chi Square Goodness of Fit Test.
Properties of Chi Square Goodness of Fit Test
The available properties of the Chi Square Goodness of Fit Test are as shown in the figure given below.
The table below describes the different fields present on the Properties pane of the Chi Square Goodness of Fit Test.
Field | Description | Remark |
Run | It allows you to run the node. | - |
Explore | It allows you to explore the successfully executed node. | - |
Vertical Ellipses | The available options are - Run till node
- Run from node
- Publish as a model
- Publish code
| - |
Task Name | It is the name of the task selected on the workbook canvas. | You can click the text field to edit or modify the name of the task as required. |
Feature | It allows you to select the categorical variable for the test. | Only one categorical variable can be selected. |
Advanced | Alpha | It allows you to set the level of significance. | The default value is 0.05. |
Node Configuration | It allows you to select the instance of the AWS server to provide control on the execution of a task in a workbook or workflow. | For more details, refer to Worker Node Configuration. |
Example of Chi Square Goodness of Fit Test
Consider a HR dataset containing features like Age, BusinessTravel, Daily Rate, Department, DistanceFromHome, Education, and so on. A snippet of the input data is shown in the figure given below.
The BusinessTravel feature is selected as the categorical variable for studying the Chi Square Goodness of Fit Test.
The part of the Result page containing charts for the Chi Square Goodness of Fit Test is displayed below.
On this part of the Result Page,
- Chart of Contribution to the Chi Square value by Category shows Combined Values depicting the contribution of each BusinessTravel frequency to the calculated Chi Square value.
- Chart of Observed and expected Values gives a comparative idea of the contribution of each BusinessTravel frequency to the calculated Chi Square value.
On this part of the Result Page,
- Chart of Contribution to the Chi Square value by Category shows Combined Values depicting the contribution of each BusinessTravel frequency to the calculated Chi Square value.
- Chart of Observed and expected Values gives a comparative idea of the contribution of each BusinessTravel frequency to the calculated Chi Square value.
- Null Hypothesis assumes that there is no difference between observed and expected values.
- Alternative Hypothesis assumes that there is significant difference between observed values and expected values.
- Computation table for Chi Square gives the Observed Frequency (O) and Expected Frequency (E) of the BusinessTravel feature in the categories, Travel_Rarely, Travel_Frequently, and Non-Travel. It also shows the values for (O-E), (O-E)2, (O-E)2 /E.
- The Result table for Chi Square gives the Critical Value (952.6082), Calculated Value (5.9915) for Chi Square. It also gives the p value (0) and alpha (0.05).
You observe that the p value is less than alpha. Thus, the Interpretation states that there is not enough evidence available to accept the null hypothesis. Thus, values are not coming from a normal distribution. This is because, there is a significant amount of difference between the observed values and expected values.
Related Articles
Chi Square Test
The Chi-Square test is located under Model Studio > Statistical analysis > Hypothesis Test > Non Parametric Test. Alternatively, use the search bar to find the Chi-Square test feature. Use the drag-and-drop method or double-click to use the algorithm ...
Chi Square Test for Independence
Chi Square Test for Independence is located under Model Studio ( ) in Hypothesis Test, in Statistical Analysis, in the left task pane. Use the drag-and-drop method to use the algorithm in the canvas. Click the algorithm to view and select different ...
Train Test Split
Train Test Split is located under Model Studio () under Sampling in Data Preparation, in the left task pane . Use the drag-and-drop method to use the algorithm in the canvas. Click the algorithm to view and select different properties for analysis. ...
Train Test Split
Train Test Split is located under Model Studio ( ) under Sampling in Data Preparation, in the left task pane . Use the drag-and-drop method to use the algorithm in the canvas. Click the algorithm to view and select different properties for analysis. ...
Scale to Fit
Scale to Fit allows you to change the Widget visualization fit in View Mode depending on the available space. Hence, after selecting any of these options, click Save and switch to View Mode to visualize the fit. A vertical and horizontal scroll bar ...