Data Compare

Data Compare

Data Compare

Description

The Data Compare Task is used to find and highlight differences between two datasets in a simple and efficient way. It helps users compare numeric values, identify changes or mismatches, and ensure data consistency across datasets.

Why to use

For Data Preparation 

When to use

When you want to compare two or more datasets.

When not to use

Prerequisites

The data must have at least one common column to select as a key column and at least one common numeric column to select as the column to compare.

Related Algorithms

Data Compare

Alternative Algorithms

Input

Two or more datasets

Output

Single dataset with a new column that flags each row as either "M" for a match when there is no difference, or "D" for a difference when a non-zero difference is found. Along with this, the output also shows the numeric value of the difference and the percentage difference.

Statistical Methods used

Limitations


    • Related Articles

    • Model Compare

      Model Compare is a feature, which is used to compare the performance of Classification and Regression algorithms. You can select the performance metrics on which the comparison is to be done. Based on the performance metrics results, Rubiscape ...
    • Data Preparation

      Data preparation is the process of cleaning and transforming raw data into organized data so that it can be processed and analyzed further. In data preparation, data is reformatted, corrected, and combined to enrich the data. Data preparation is ...
    • Data Unpivot

      Data Unpivot Description Data Unpivot is a way of transforming data from a wide format to a linear format. The source data is rearranged to make it a part of a single column in the new dataset. Why to use To transform the column data into row data ...
    • Data Unpivot

      Data Unpivot Description Data Unpivot is a way of transforming data from a wide format to a linear format. The source data is rearranged to make it a part of a single column in the new dataset. Why to use To transform the column data into row data ...
    • Data Pivot

      Data Pivot Description Data Pivot is a way of transforming data from a tall format to a wide format. The source data is rearranged in a way that unique values are converted into columns. Why to use To transform the row data into column data When to ...