Density Based Clustering | |||||
Description | It classifies the given set of data by building clusters based on the idea that a cluster in the data space is a continuous region of high point density, separated from other clusters by continuous regions of low density. | ||||
Why to use | It works well to separate data areas with a high density of observations from data areas that are not very dense with observation. DBSCAN can sort data into clusters of arbitrary shapes as well. | ||||
When to use |
| When not to use | When the number of clusters is known. | ||
Prerequisites | Input data should be of text type and should not contain special characters and numbers. | ||||
Input | Textual Data | Output | Data divided into clusters. | ||
Statistical Methods used |
| Limitations | It does not work well in the case of high-dimensional data or with clusters of varying densities. |