Learn More

Empirical Fit¶

P-P Plot¶

Random image of the beach or ocean! — Figure 1:Example of p-p plot from wikipedia. [Source: Wikipedia]

This is also known as the probability-probability plot, the percent-percent plot, or the p-value plot. It is used for assessing how closely two datasets agree and/or the residuals. See wiki for more details.

Note: this plot only works when we have the theoretical cumulative distribution.

Q-Q Plot¶

This is the quantile-quantile plot. This is a graphical tool to compare two probability distributions by comparing their quantiles versus one another. See wiki for more details.

Predictions¶

Return Period¶

We can extrapolate an distribution by computing the $p$ -return level (see blog). This represents the high quantile for which the probability that the maximym exceeds this quantile is $1/p$ . This concept is known as the return period. The return period of a particular event is the inverse probability that the event will be exceeded in any given year. For example, the $p$ -year return level is associated with a return period of $p$ years. The $1/p$ return level is the $1-p$ quantile of an arbitrary quantile function for a distribution.(Mahmoudian & Mahammadzadeh, 2014).

To compute the return level of $y_p$ s.t. the distribution, $\text{Quantile}(p)=1-p$ . We can solve for $p$ which results in the following formula.

\begin{aligned} \text{Distribution}(y_p) &= 1 - p\\ y_p &= \text{Distribution}^{-1}(1-p)=\text{Quantile}(1-p) \end{aligned}

(1)

This is also known as the recurrence interval or repeat interval which is an average time or an estimated average time between events. In the case of extreme events, these can include floods, heatwaves, or droughts. See wiki for more details.

Mean Excess Function¶

\mathbb{E}(Y - u|Y>u = \frac{\sigma_u - u\xi}{1 - \xi})

(8)

where the scale parameter, $\sigma_u$ , varies linearly in the threshold $u$ and the shape parameter ξ is fixed wrt the threshold $u$ .

Expected Shortfall¶

Ultimately, the goal of EVT is to compute the value at risk. We can derive the expected shortfall (conditional rv).

y_p = \mu - \frac{\sigma}{\xi}\left[ 1 - \left(\frac{N}{N_\mu} (1 - p) \right)^{-\xi} \right]

(9)

where:

$y_p$ - the random variable
μ - the threshold (in percentage terms)
$N$ - number of observations
$N_\mu$ - # of observations that exceed the threshold

We can define the expected shortfall (ES) as:

\text{ES} = \frac{y_p}{1 -\xi} + \frac{\sigma - \xi\mu}{1 - \xi}

(10)

References¶

Mahmoudian, B., & Mohammadzadeh, M. (2014). A spatio-temporal dynamic regression model for extreme wind speeds. Extremes, 17(2), 221–245. 10.1007/s10687-014-0180-2

Basics

Calculating Extremes

Covariate Effects

II - Covariates Effects

Extreme Value Modeling Evaluation