Outlier & Anomaly Detection

Outlier vs. Anomaly

Outlier Detection

Univariate outlier detection

Z-score / Standard Score

Interquartile Range (IQR) Method

Multivariate outlier detection

Mahalanobis Distance

PCA

Elliptic Envelope (Robust Covariance)

FPOF (Fixed-Point Outlier Factor)

Counts Outlier Detector

Distance Metric Learning

Shared Nearest Neighbors (SNN)

Anomaly Detection (Contextual / Model-Based)

Types of Anomalies

Detection Methods

Probabilistic Modeling

Fit density model p(x) on training set and predict

Machine Learning-Based Methods

Doping (Synthetic Anomalies)

Evaluating Anomaly Detection

Use standard metrics from classification:

Anomaly detection vs. supervised learning

Why bother using anomaly detection rather than supervised learning if your examples already have labels?