Support Vector Machine

Support Vector Machine

Support Vector Machine

Description

  • SVM belongs to the family of supervised classification algorithms. It uses the kernel trick to transform data into the desired format and creates an optimum boundary between the possible outputs.
  • SVM performs remarkably even if there is limited available data for analysis.

Why to use

To classify text into the possible categories.

When to use

When textual data needs to be classified.

When not to use

When the dataset does not contain textual data.

Prerequisites

  • The dataset should not contain any missing (NaN) values.
  • The dataset should contain at least one categorical and one textual variable.

Input

Textual Data

Output

Classified data with predicted labels

Statistical Methods used

  • Accuracy
  • Sensitivity
  • Specificity
  • F-score
  • Confusion Matrix

Limitations

  • Choosing a good kernel
  • Takes long training time in case of large datasets
  • Not easy to tune hyperparameters (Gamma and Penalty Parameter)
    • Related Articles

    • Support Vector Machine In Regression

      Support Vector Machine In Regression Description The Support Vector Machine (SVM) approach for regression is a machine learning technique that seeks to identify a hyperplane that closely resembles the regression function. It reduces deviations ...
    • Machine Learning Concepts

      Advanced Entity Extraction Advanced entity extraction, also known as entity recognition, is used to extract vital information for natural language processing (NLP). It is widely used for finding, storing and sorting textual content into default ...
    • One Class SVM

      One-Class SVM Transformation Description One-Class Support Vector Machine (One Class SVM) is an unsupervised variation of SVM used for anomaly detection. One-Class SVM is an unsupervised algorithm for outlier detection. It detects whether a new data ...
    • rubiscape Concepts

      Code Fusion Code fusion in rubiscape gives an ingenious option to the users to build their models in programming languages such as JAVA, R, or Python and integrate them into rubiscape. Code fusion makes the rubiscape platform more customizable and ...
    • Rubiscape Winter '19

      New Features Platform & Studio New dataset creation feature for Twitter, PostgresSQL, SQL, MySQL, Oracle, Excel, CSV, Google News. Create dataset from a local TXT file using delimiter option. Supported delimiters are Semicolon, Pipe, Comma, Tab, ...