Categorical Naive Bayes

Categorical Naive Bayes

Categorical Naive Bayes

Description

The categorical Naïve Bayes algorithm is suitable for categorically discrete values like Weather Prediction, and Medical Diagnosis. It is the simplest and fastest classification algorithm.

Why to use

It is the simplest and fastest algorithm for a large chunk of textual data.

When to use

When you want to process discrete features that are categorically distributed on a large amount of data.

When not to use

When your dataset does not contain a categorical dependent variable

Prerequisites


Input

A dataset with one categorical dependent variable.

Output

  • Confusion Matrix
  • ROC Chart
  • Lift Chart

Statistical Methods Used

  • Accuracy
  • F-score
  • Precision
  • Sensitivity/Recall
  • Specificity

Limitations

  • In case a category does not exist in the test dataset then the algorithm assigns zero frequency.
  • This method has a performance overhead.
    • Related Articles

    • Naïve Bayes

      Naïve Bayes Description Naïve Bayes is a family of classification algorithms based on the Bayes Theorem. All algorithms in the family share a common principle that the pair of features being classified are mutually independent. Why to use To classify ...
    • Lookup for Categorical Variables

      The fuzzy lookup is based on the fuzzy logic in mathematics. It is supported only for categorical variables. Methods There are three methods for this feature: Threshold Matching: It compares the string values based on fuzzy logic and calculates a ...
    • Classification

      Classification is the process of predicting the class of given data points. Classes are referred to as targets/ labels or categories. Classification predictive modeling is the task of approximating a mapping function (f) from input variables (X) to ...
    • Rubiscape Winter '19

      New Features Platform & Studio New dataset creation feature for Twitter, PostgresSQL, SQL, MySQL, Oracle, Excel, CSV, Google News. Create dataset from a local TXT file using delimiter option. Supported delimiters are Semicolon, Pipe, Comma, Tab, ...
    • Moving Average in Forecasting

      Moving Average in Forecasting Description The Moving Average is also known as Naïve Forecasting or moving/rolling mean. It is an indicator that creates a series of averages of several subsets of a complete dataset Why to use The Moving Average is ...