Naïve Bayes

Naïve Bayes

Naïve Bayes

Description

Naïve Bayes is a family of classification algorithms based on the Bayes Theorem. All algorithms in the family share a common principle that the pair of features being classified are mutually independent.

Why to use

To classify text into possible categories.

When to use

When textual data needs to be classified.

When not to use

When the dataset does not contain textual data.

Prerequisites

  • The dataset should not contain any missing (NaN) values.
  • The dataset should contain at least one categorical and one textual variable.

Input

Textual Data

Output

Classified data with predicted labels

Statistical Methods used

  • Accuracy
  • Sensitivity
  • Specificity
  • F-score
  • Confusion Matrix

Limitations

It is difficult to get a set of independent predictors for the Naïve Bayes model



    • Related Articles

    • Categorical Naive Bayes

      Categorical Naive Bayes Description The categorical Naïve Bayes algorithm is suitable for categorically discrete values like Weather Prediction, and Medical Diagnosis. It is the simplest and fastest classification algorithm. Why to use It is the ...
    • Classification

      Classification is the process of predicting the class of given data points. Classes are referred to as targets/ labels or categories. Classification predictive modeling is the task of approximating a mapping function (f) from input variables (X) to ...
    • Rubiscape Winter '19

      New Features Platform & Studio New dataset creation feature for Twitter, PostgresSQL, SQL, MySQL, Oracle, Excel, CSV, Google News. Create dataset from a local TXT file using delimiter option. Supported delimiters are Semicolon, Pipe, Comma, Tab, ...
    • Rubiscape Spring '23

      New Features Studio Rubiscape Desktop All your favorite Rubiscape features, functions, and powerful dashboarding capabilities are now available in Rubiscape Desktop for Windows. With a completely new setup wizard, you can now customize your ...
    • Statistical Concepts

      Accuracy Accuracy (of classification) of a predictive model is the ratio of the total number of correct predictions made by the model to the total predictions made. Thus, Accuracy = (TP + TN) / (TP + TN + FP + FN) Where, TP, TN, FP, and FN indicate ...