Density Estimation

What is density estimation?

Density estimation=reconstructing the probability density function of a random variable, X, given a sample of random variates $X_{1}, X_{2}, . . ., X n .$

produces density estimates that follow a discrete distribution
the choice of binning can have a disproportionate effect on the resulting visualization

First, you need to know how a naive density estimator works as a "moving histogram": it extends the traditional histogram by estimating density at a point based on observations within a centered interval
Then, KDE just generalizes the naive density estimator by replacing the uniform density function with an arbitrary density function, the Kernel Function.

kernel
- specifies the shape of the distribution placed at each point
- kernel selection: Kernel Function#Types of Kernels
kernel bandwidth
- controls the size of the kernel at each point
- it acts as a smoothing parameter, controlling the tradeoff between bias and variance in the result
- a large bandwidth leads to a very smooth (i.e. high-bias) density distribution
- a small bandwidth leads to an unsmooth (i.e. high-variance) density distribution
- bandwidth selection methods:
  - unbiased cross-validation
  - Sheather-Jones methods