Chi Square Goodness of Fit Test

Chi Square Goodness of Fit Test

Chi Square Goodness of Fit Test is located under Model Studio (  ) in Hypothesis Test, in Statistical Analysis, in the left task pane. Use the drag-and-drop method to use the algorithm in the canvas. Click the algorithm to view and select different properties for analysis. Refer to Properties of Chi Square Goodness of Fit Test.



Properties of Chi Square Goodness of Fit Test

The available properties of the Chi Square Goodness of Fit Test are as shown in the figure given below.


The table below describes the different fields present on the Properties pane of the Chi Square Goodness of Fit Test.

Field

Description

Remark

RunIt allows you to run the node.-
ExploreIt allows you to explore the successfully executed node.-
Vertical Ellipses

The available options are

  • Run till node
  • Run from node
  • Publish as a model
  • Publish code
-

Task Name

It is the name of the task selected on the workbook canvas.

You can click the text field to edit or modify the name of the task as required.

Feature

It allows you to select the categorical variable for the test.

Only one categorical variable can be selected.

Advanced

Alpha

It allows you to set the level of significance.

The default value is 0.05.

Node Configuration

It allows you to select the instance of the AWS server to provide control on the execution of a task in a workbook or workflow.

For more details, refer to Worker Node Configuration

Example of Chi Square Goodness of Fit Test

Consider a HR dataset containing features like AgeBusinessTravelDaily RateDepartmentDistanceFromHomeEducation, and so on. A snippet of the input data is shown in the figure given below.


The BusinessTravel feature is selected as the categorical variable for studying the Chi Square Goodness of Fit Test.

The part of the Result page containing charts for the Chi Square Goodness of Fit Test is displayed below.




On this part of the Result Page,

  • Chart of Contribution to the Chi Square value by Category shows Combined Values depicting the contribution of each BusinessTravel frequency to the calculated Chi Square value.
  • Chart of Observed and expected Values gives a comparative idea of the contribution of each BusinessTravel frequency to the calculated Chi Square value.

       

On this part of the Result Page,

  • Chart of Contribution to the Chi Square value by Category shows Combined Values depicting the contribution of each BusinessTravel frequency to the calculated Chi Square value.
  • Chart of Observed and expected Values gives a comparative idea of the contribution of each BusinessTravel frequency to the calculated Chi Square value.
  • Null Hypothesis assumes that there is no difference between observed and expected values.
  • Alternative Hypothesis assumes that there is significant difference between observed values and expected values.
  • Computation table for Chi Square gives the Observed Frequency (O) and Expected Frequency (E) of the BusinessTravel feature in the categories, Travel_RarelyTravel_Frequently, and Non-Travel. It also shows the values for (O-E), (O-E)2, (O-E)/E.
  • The Result table for Chi Square gives the Critical Value (952.6082), Calculated Value (5.9915) for Chi Square. It also gives the p value (0) and alpha (0.05).

You observe that the p value is less than alpha. Thus, the Interpretation states that there is not enough evidence available to accept the null hypothesis. Thus, values are not coming from a normal distribution. This is because, there is a significant amount of difference between the observed values and expected values.


    • Related Articles

    • Chi Square Test

      The Chi-Square test is located under Model Studio > Statistical analysis > Hypothesis Test > Non Parametric Test. Alternatively, use the search bar to find the Chi-Square test feature. Use the drag-and-drop method or double-click to use the algorithm ...
    • Chi Square Test for Independence

      Chi Square Test for Independence is located under Model Studio ( ) in Hypothesis Test, in Statistical Analysis, in the left task pane. Use the drag-and-drop method to use the algorithm in the canvas. Click the algorithm to view and select different ...
    • Train Test Split

      Train Test Split is located under Model Studio () under Sampling in Data Preparation, in the left task pane . Use the drag-and-drop method to use the algorithm in the canvas. Click the algorithm to view and select different properties for analysis. ...
    • Train Test Split

      Train Test Split is located under Model Studio ( ) under Sampling in Data Preparation, in the left task pane . Use the drag-and-drop method to use the algorithm in the canvas. Click the algorithm to view and select different properties for analysis. ...
    • Scale to Fit

      Scale to Fit allows you to change the Widget visualization fit in View Mode depending on the available space. Hence, after selecting any of these options, click Save and switch to View Mode to visualize the fit. A vertical and horizontal scroll bar ...