Data Preparation

Data Preparation

What is Data Preparation
Data preparation is the process of cleaning and transforming raw data into organized data so that it can be processed and further analyzed. In data preparation, data is reformatted, corrected, and combined so that it gets enriched.
Why is Data Preparation required
Data preparation is complex yet essential to create relevant contextual data. This makes the analysis of such data, efficient, and produces reliable and insightful results. In the absence of precise data preparation, we may get a biased data which may result in poor analysis and erroneous results.

How is Data Preparation done in rubiscape
In rubiscape, there is a comprehensive set of algorithms for performing data preparation. They are used singularly or in combination with other algorithms to remove any anomaly in the dataset. Each algorithm has a specific function, which can be used to enhance the data quality. In rubiscape, the user can find the missing values, create additional data columns, merge and join data, and so on.
In rubiscape, the Data Preparation algorithms are,

  • Aggregation
  • Combined Data Cleansing
  • Data Joiner
  • Data Merge
  • Data Pivot
  • Data Unpivot
  • Descriptive Statistics
  • Expression
  • Factor Analysis
  • File Management
  • Filtering
  • Lookup 
  • Missing Value Imputation
  • Outlier Detection
  • PCA
  • Sequence Generator
  • Sorting

In the task pane, click Model Studio, and then click Data Preparation.



For more information, refer to Data Preparation

    • Related Articles

    • Data Preparation

      Data preparation is the process of cleaning and transforming raw data into organized data so that it can be processed and analyzed further. In data preparation, data is reformatted, corrected, and combined to enrich the data. Data preparation is ...
    • Data Preparation in Forecasting

      Data Preparation is the process of cleaning and transforming raw data into organized data so that it can be processed and analyzed further. In data preparation, data is reformatted, corrected, and combined to enrich the data. Data preparation is ...
    • Time-series Data Preparation

      Time-series Data Preparation organizes and formats transactional data into time-series data to predict trends and seasonality in the data. Transactional data is timestamped data recorded over a period at no specific frequency, while time-series data ...
    • Time-series Data Preparation Tests in Forecasting

      The different tests available in Time-series Data Preparation under Forecasting are given below. Accumulation Missing Value Transformation Differencing Data Preparation Description The time-series data may contain missing values that need to be ...
    • Data Joiner

      Data Joiner Description Data joiner is a method to join two or more datasets. It is used to join rows based on a related column present in two or more datasets. Why to use For Data Preparation When to use When you want to join two or more datasets. ...