Isolation Forest | |||
Description | Isolation Forest is an unsupervised algorithm used for anomaly detection that isolates the anomalies rather than building a model of normal instances. | ||
Why to use | Isolation forest detects anomalies faster and requires less memory space compared to other anomaly detection algorithms. | ||
When to use | To handle high-dimensional and large-sized input data | When not to use | Inappropriate feature extraction & defining normal and abnormal behaviour in the data, variations in the abnormal data increase the dataset's complexity. In such cases, isolation forest cannot be used. |
Prerequisites | Data should contain only numeric/Continuous datatype variables. Data should not contain any missing values. | ||
Input | Any classification dataset with numeric input variables | Output |
|
Statistical Methods used | It works on the principle of the decision tree algorithm. It works on the principle of decision tree algorithms, but that cannot be defined in the statistical methods used section as a decision tree is an ML algorithm. | Limitations | It fails to detect local anomaly points, which affects the accuracy of the algorithm. |