!main_tags!Probability Density Function - probability | What's Your IQ !main_header!

Definition

Basic Concept

Probability Density Function (PDF): function describing relative likelihood of continuous random variable taking a value near x. Denoted as f(x). Integral over interval gives probability.

Mathematical Definition

For continuous random variable X, PDF f(x) satisfies:

f(x) ≥ 0 for all x ∈ ℝ∫_{-∞}^{∞} f(x) dx = 1

Interpretation

f(x) not probability itself but density. Probability over interval [a, b] is area under curve:

P(a ≤ X ≤ b) = ∫_{a}^{b} f(x) dx

"Probability densities embody the continuous analog to discrete probabilities, enabling precise quantification of uncertainty across intervals." -- William Feller

Properties

Non-negativity

f(x) ≥ 0 for all x. Ensures valid probability interpretation.

Normalization

Total area under f(x) equals 1:

∫_{-∞}^{∞} f(x) dx = 1

Probability Calculation

Probability over any interval equals integral of f(x) over that interval.

Support

Set of x where f(x) > 0. Outside support, PDF equals zero.

Continuity

PDF often continuous, but can have discontinuities if distribution is mixed.

Property Description
Non-negativity f(x) ≥ 0 ∀x
Normalization ∫ f(x) dx = 1
Support Set of x with f(x) > 0

Relation to Cumulative Distribution Function

Definition of CDF

Cumulative Distribution Function (CDF) F(x) = P(X ≤ x). Non-decreasing, right-continuous, limits 0 and 1.

Connection Between PDF and CDF

PDF is derivative of CDF wherever derivative exists:

f(x) = dF(x)/dx

Recovering CDF from PDF

CDF found by integrating PDF:

F(x) = ∫_{-∞}^x f(t) dt

Computing Probabilities

Probability Over Interval

Calculate probability X lies between a and b:

P(a ≤ X ≤ b) = ∫_{a}^{b} f(x) dx

Point Probability

For continuous variables, P(X = x) = 0 because integral over zero-width interval.

Using Improper Integrals

Probabilities over infinite intervals use improper integrals:

P(X > c) = ∫_{c}^{∞} f(x) dx

Numerical Integration

When analytic integration impossible, apply numerical methods: trapezoidal, Simpson’s rule, Monte Carlo.

Examples of PDFs

Uniform Distribution

Constant PDF over finite interval [a, b]:

f(x) = 1/(b - a), a ≤ x ≤ b; 0 otherwise

Exponential Distribution

Models waiting times. Parameter λ > 0:

f(x) = λ e^{-λ x}, x ≥ 0; 0 otherwise

Normal Distribution

Bell-shaped curve, parameters μ (mean), σ (std dev):

f(x) = (1/(σ√(2π))) e^{-(x - μ)^2 / (2σ^2)}

Beta Distribution

Defined on [0,1], parameters α, β > 0:

f(x) = (x^{α-1} (1-x)^{β-1}) / B(α, β), 0 ≤ x ≤ 1

Common Continuous Distributions

Normal Distribution

Symmetric, unimodal, central limit theorem foundation. Parameters μ, σ.

Exponential Distribution

Memoryless property, models time until event.

Gamma Distribution

Generalizes exponential. Parameters shape k, rate θ.

Beta Distribution

Flexible on [0,1], models proportions.

Chi-Squared Distribution

Sum of squares of k independent standard normal variables. Used in hypothesis testing.

Distribution Parameters Support
Normal μ, σ > 0 (-∞, ∞)
Exponential λ > 0 [0, ∞)
Beta α, β > 0 [0, 1]

Applications

Statistical Modeling

PDFs model real-world continuous phenomena: heights, weights, times.

Signal Processing

Noise distributions modeled by PDFs (e.g., Gaussian noise).

Reliability Engineering

Failure time distributions inform maintenance schedules.

Machine Learning

Density estimation, anomaly detection, probabilistic classifiers.

Physics and Engineering

Quantum mechanics, thermodynamics describe system states probabilistically.

Estimation and Inference

Parameter Estimation

PDF form determined by parameters estimated via Maximum Likelihood, Method of Moments.

Kernel Density Estimation

Non-parametric method to approximate PDF from data samples.

Hypothesis Testing

Test statistics distributions modeled by PDFs for p-value computation.

Confidence Intervals

PDFs used to derive intervals containing true parameter with specified probability.

Multivariate Probability Density Functions

Definition

Joint PDF f(x₁, x₂, ..., xₙ) describes likelihood of vector-valued continuous random variables.

Properties

Non-negative, integrates to one over ℝⁿ. Marginal PDFs obtained by integrating out variables.

Conditional PDFs

Define conditional density via joint and marginal PDFs:

f(x|y) = f(x,y) / f(y), f(y) > 0

Example: Bivariate Normal

Joint PDF with mean vector μ and covariance matrix Σ:

f(x) = (1 / (2π |Σ|^{1/2})) exp(-1/2 (x - μ)^T Σ^{-1} (x - μ))

PDF vs PMF

Definition

PDF: continuous variables, PMF: discrete variables.

Value Interpretation

PDF values are densities, not probabilities. PMF values are probabilities.

Probability Calculation

PDF: probability via integration; PMF: probability via summation.

Example Comparison

Discrete variable X ∈ {1, 2, 3} has PMF values; continuous variable Y on ℝ has PDF.

Limitations and Considerations

Interpretation Challenges

PDF values alone lack probabilistic meaning without integration context.

Non-existence for Some Variables

Not all distributions have PDFs (e.g., singular distributions).

Computational Complexity

Integral calculations may be difficult, require approximations.

Mixed Distributions

Some variables combine discrete and continuous parts, requiring hybrid approaches.

Summary

Probability Density Function (PDF) defines distribution of continuous random variables. Key properties: non-negativity, normalization, relation to CDF. Enables calculation of probabilities via integration. Central in statistics, engineering, science. Understanding PDFs critical for modeling uncertainty and analyzing data.

References

  • Billingsley, P. Probability and Measure, Wiley, 3rd ed., 1995, pp. 100-150.
  • Casella, G., Berger, R.L. Statistical Inference, Duxbury, 2nd ed., 2002, pp. 250-300.
  • Feller, W. An Introduction to Probability Theory and Its Applications, Vol. 2, Wiley, 1971, pp. 200-260.
  • Ross, S.M. Introduction to Probability Models, Academic Press, 11th ed., 2014, pp. 120-180.
  • Papoulis, A., Pillai, S.U. Probability, Random Variables and Stochastic Processes, McGraw-Hill, 4th ed., 2002, pp. 90-130.
!main_footer!