Chi Square Test for Independence

Chi Square Test for Independence

Chi Square Test for Independence

Description

Chi Square Test for Independence determines whether two categorical variables are related or independent.

Why to use

To test the independence or association between categorical variables.

When to use

When the dataset contains at least two categorical variables.

When not to use

On Continuous data

Prerequisites

The data should be categorical.

Input

Two categorical variables

Output

  • Chi Square Statistic
  • Observed Frequency Table and Expected Frequency Table for the selected categorical Variables

Statistical Methods used

Limitations

It can be used only on categorical data.


The Chi Square Test for Independence is a hypothesis test. It compares two categorical variables to check if they are related to each other or not. It uses a contingency table (cross table) for the analysis of data. In a cross table, the data is classified according to the two categorical variables. The categories for one variable appear in the rows, while the categories for another variable appear in columns. Each cell represents the total count of cases for a specific pair of categories.
    • Related Articles

    • Chi-square test

      Description A Chi-square test is a data analysis process based on random samples of categorical variables. It is mainly used to accept or reject the null hypothesis. The null hypothesis mainly predicts the relation between two independent categorical ...
    • Chi Square Goodness of Fit Test

      Chi Square Goodness of Fit Test Description Chi Square Goodness of Fit Test determines whether a categorical variable is likely to be derived from a specified distribution. This test is the same as Pearson’s Chi Square test. Why to use To check ...
    • One Sample Z Test

      One Sample Z Test Description One-sample z-test is a statistical test used to determine if the mean of a single sample is significantly different, from a hypothesized population mean, when the population standard deviation is known. Why to use ...
    • One Sample Proportion Test

      One Sample Proportion Test Description A one-sample proportion test is a statistical test used to determine if a single proportion (or percentage) of a population is statistically different from a hypothesized value. Why to use To determine if a ...
    • Train Test Split

      Train Test Split Description The data is split randomly into train data and test data. Ideally, the split is in the ratio of 70:30 or 80:20 for train and test. Why to use To evaluate the accuracy of the model with an unknown dataset. When to use The ...