Generalized Extreme Value Distribution

This is also known as the Point-Percentile-Function or the inverse CDF. This function maps an input threshold, $y_0$ , to a value $y$ st the probability of $Y$ being less than or equal to $y$ is $y_p$ .

y_p = \boldsymbol{F}(y;\boldsymbol{\theta})

(10)

We can take the inverse of this function to see that it is the inverse CDF which we denote as the quantile function.

y_p = \boldsymbol{F}^{-1}(y_p;\boldsymbol{\theta}) := \boldsymbol{Q}(y_p;\boldsymbol{\theta})

(11)

where $y_p\in[0,1]$ is the data within the probability transform domain. These can be computed in closed form

\boldsymbol{Q}(y_p) = \begin{cases} \mu + \frac{\sigma}{\kappa }\left[(- \log y_p)^{-\kappa} - 1 \right] && \kappa\neq 0 \\ \mu - \sigma\log(- \log y_p ) && \kappa=0 \end{cases}

(12)

Derivation,

\kappa\neq 0

\boldsymbol{F}(y;\boldsymbol{\theta}) := y_p = \exp\left[-t(y;\boldsymbol{\theta})\right]

(13)

So let’s rearrange the terms within the equation

\begin{aligned} y_p &= \exp\left[-(1 + \kappa z)^{-1/\kappa}\right] \\ \log y_p &= -(1 + \kappa z)^{-1/\kappa}\\ -\frac{1}{\kappa}\log[1 + \kappa z] &= \log \left( -\log y_p \right) \\ \log [1 + \kappa z] &= -\kappa \log \left( -\log y_p\right) \\ 1 + \kappa z &= (-\log y_p)^{-\kappa} \\ \kappa z &= (- \log y_p)^{-\kappa} - 1\\ z &= \frac{1}{\kappa} \left[(- \log y_p)^{-\kappa} - 1 \right] \end{aligned}

(14)

Finally, we plug in our normalized variable

y = \mu + \frac{\sigma}{\kappa }\left[(- \log y_p)^{-\kappa} - 1 \right]

(15)

Derivation,

\kappa = 0

\boldsymbol{F}(y;\boldsymbol{\theta}) := y_p = \exp (-\boldsymbol{t}(y;\boldsymbol{\theta}))

(16)

So let’s rearrange the terms within the equation

\begin{aligned} y_p &= \exp (-\exp(-z)) \\ \log y_p &= -\exp(-z)\\ \exp(-z) &= -\log y_p\\ z &= -\log(- \log y_p ) \end{aligned}

(17)

Finally, we plug in our normalized variable

y = \mu - \sigma\log(- \log y_p )

(18)

Code Snippet

We can create an likelihood function for the quantile function where $\kappa\neq 0$ .

# function for kappa > 0
def quantile(, loc, scale, shape):
    level = loc - scale / shape * (1 - (- log(1 - p)) ** (- shape))
    return level

We can also create a quantile function where $\kappa=0$ .

# function for kappa = 0
def quantile(p, loc, scale):
    level = loc - scale * log(- log())
    return level

Return Period¶

We can calculate the RP using equation (8). Practically, we set this to the survival function of the GEVD (equation (6)).

1/T_R = 1 - \boldsymbol{F}(y;\boldsymbol{\theta})

(19)

To make things simpler, we can simply use the quantile function in equation (12) and set the probability to

y_p = 1 - 1 / T_R

(20)

However, if we expand this out, we get

y = \begin{cases} \mu + \frac{\sigma}{\kappa}\left\{\left[\log\left(1-1/T_R\right)\right]^{\kappa}-1\right\} && \kappa\neq 0 \\ \mu - \sigma \log \left[ - \log \left(1 - 1/T_R \right) \right] && \kappa=0 \end{cases}

(21)

Proof

In general, we can expand the RHS of the equation to include the CDF

1 - 1/T_R = \exp \left( -t(y;\boldsymbol{\theta}) \right)

(22)

and we can reduce this to be:

-\log\left(1-1/T_R\right) = t(y;\boldsymbol{\theta})

(23)

Finally, we can plug in the $\kappa \neq 0$ term to get

\begin{aligned} -\log\left(1-1/T_R\right) &= [1 + \kappa z]_+^{-1/\kappa} \\ \log\left[-\log\left(1-1/T_R\right)\right]&= -(1/\kappa)\log[1 + \kappa z] \\ \log(1+\kappa z) &= -\kappa\log\left[-\log\left(1-1/T_R\right)\right] \\ 1+\kappa z &=\left[-\log\left(1-1/T_R\right)\right]^{-\kappa} \\ \kappa z &= \left[\log\left(1-1/T_R\right)\right]^{\kappa}-1\\ z &= \frac{1}{\kappa}\left\{\left[\log\left(1-1/T_R\right)\right]^{\kappa}-1\right\} \\ \end{aligned}

(24)

Now, we can plug in the normalization factor

y = \mu + \frac{\sigma}{\kappa}\left\{\left[\log\left(1-1/T_R\right)\right]^{\kappa}-1\right\}

(25)

We can do the same thing for $\kappa = 0$ term to get

\begin{aligned} -\log (1 - 1/T_R) &= \exp(-z) \\ \log \left(-\log(1 - 1/T_R)\right) &= - z \\ z &= - \log \left(-\log(1 - 1/T_R)\right) \\ \end{aligned}

(26)

Now, we can plug in the normalization factor

y = \mu - \sigma \log \left[ - \log \left(1 - 1/T_R \right) \right]

(27)

Average Recurrence Interval¶

We can calculate the ARI using equation (14). Practically, we set this to the survival function of the GEVD (equation (6)).

1 - \exp\left(-1/\bar{T}\right) = 1 - \boldsymbol{F}(y;\boldsymbol{\theta})

(28)

To make things simpler, we can simply use the quantile function in equation (12) and set the probability to

y_p = \exp\left(-1/\bar{T}\right)

(29)

However, if we expand this out and simplify, we get

y = \begin{cases} \mu + \frac{\sigma}{\kappa}\left( \bar{T}^{\kappa}-1\right) && \kappa\neq 0 \\ \mu + \sigma\log \bar{T} && \kappa=0 \end{cases}

(30)

Proof

In general, we can expand the RHS of the equation to include the CDF

\exp(-1/\bar{T}) = \exp \left( -t(y;\boldsymbol{\theta}) \right)

(31)

and we can reduce this to be:

1/\bar{T} = t(y;\boldsymbol{\theta})

(32)

Finally, we can plug in the $\kappa \neq 0$ term to get

\begin{aligned} \bar{T} &= [1 + \kappa z]_+^{1/\kappa} \\ \kappa \log \bar{T} &= \log [ 1 + \kappa z] \\ 1 + \kappa z &= \bar{T}^{\kappa} \\ z &= \frac{1}{\kappa}\left( \bar{T}^{\kappa}-1\right) \\ \end{aligned}

(33)

Now, we can plug in the normalization factor

y = \mu + \frac{\sigma}{\kappa}\left( \bar{T}^{\kappa}-1\right)

(34)

We can do the same thing for $\kappa = 0$ term to get

\begin{aligned} 1/\bar{T} &= \exp(-z) \\ z &= \log \bar{T} \end{aligned}

(35)

Now, we can plug in the normalization factor

y = \mu + \sigma\log \bar{T}

(36)

Joint Distribution¶

We can write the likelihood that the observations, $y$ , follow the GEVD distribution. So, given some observations, $\mathcal{D}=\{y_n\}_{n=1}^{N}$ , which we believe follow the GEVD distribution, we can write the joint distribution decomposition as

p(y_{1:N};\boldsymbol{\theta}) = p(\boldsymbol{\theta}) \prod_{n=1}^N p(y_n|\boldsymbol{\theta})

(37)

This implies that the global prior parameters come from some distribution

\boldsymbol{\theta} \sim p(\boldsymbol{\theta})

(38)

and that these parameters get passed through our data likelihood term

y_n \sim p(y|\boldsymbol{\theta})

(39)

Log Probability¶

Recall the PDF for our iid samples is

p(y_{1:N}|\boldsymbol{\theta}) = \prod_{n=1}^N\frac{1}{\sigma}t\left(y_n;\boldsymbol{\theta}\right)^{\kappa+1}e^{-t\left(y_n;\boldsymbol{\theta}\right)}

(40)

where $t(y_n;\boldsymbol{\theta})$ is defined in equation (4). We can add the log term to get

\log p(\boldsymbol{y}_{1:N}|\boldsymbol{\theta}) = \sum_{n=1}^N \log p(y_n|\boldsymbol{\theta})

(41)

which we can expand as

\sum_{n=1}^N\log p(y_n;\boldsymbol{\theta}) = -N \log \sigma - (1+1/\kappa)\sum_{n=1}^N \log t\left(y_n;\boldsymbol{\theta}\right) - \sum_{n=1}^N t\left(y_n;\boldsymbol{\theta}\right)

(42)

which reduces to

\log p(\boldsymbol{y}_{1:N}|\boldsymbol{\theta}) = - N \log \sigma - (1+1/\kappa)\sum_{n=1}^N \log \left[ 1 + \kappa z_n\right]_+ - \sum_{n=1}^N \left[ 1 + \kappa z_n\right]_+^{-1/\kappa}

(43)

Code Snippet

We can create an likelihood function for this.

def gev_logpdf(x, location, scale, shape):
    # calculate location scale: z=(y−μ)/σ
    z = (x - mu) / sigma
    # calculate t(z) = 1+κz
    t = 1.0 + shape * z
    # grab max value
    t = np.max(t, 0)
    # term 1: −log σ
    t1 = - np.log(sigma)
    # term 2: − (1+κz) ** −1/κ
    t2 = - np.power(t, -1.0 / xi)
    # term 3: - (1+1/κ)log(1+κz)
    t3 = - (1.0 / xi + 1.0) * np.log(t) 
    return  t1 + t2 + t3

Instead of actually calculating the full scheme, we can simply apply this

y: Array["T"] = ...
params: PyTree = ...
# apply vectorized operation
nll: Array["T"] = vectorize(gev_logpdf, y, params)
# take the sume
nll: Scalar = sum(nll)

Proof of Log-Probability,

\kappa\neq 0

We are interested in calculating the log probability function

\log p(\boldsymbol{y}_{1:N}|\boldsymbol{\theta}) = \sum_{n=1}^N \log p(y_n|\boldsymbol{\theta})

(44)

Let’s consider only a single input, $y_n$ . We plug in $\boldsymbol{t}(y_n;\boldsymbol{\theta})$ to the likelihood term.

p(y_n|\boldsymbol{\theta}) = \frac{1}{\sigma}t\left(y_n;\boldsymbol{\theta}\right)^{\kappa+1}e^{-t\left(y_n;\boldsymbol{\theta}\right)}

(45)

Now, we apply the log function

\log p(y_n|\boldsymbol{\theta}) = \log \left(\frac{1}{\sigma}t\left(y_n;\boldsymbol{\theta}\right)^{\kappa+1}e^{-t\left(y_n;\boldsymbol{\theta}\right)}\right)

(46)

We can separate each of the terms

\log p(y_n|\boldsymbol{\theta}) = \log \left(\frac{1}{\sigma}\right) + \log\left(t\left(y_n;\boldsymbol{\theta}\right)^{\kappa+1}\right) + \log \left(e^{-t\left(y_n;\boldsymbol{\theta}\right)}\right)

(47)

Now we can do some log rules to simplify the terms

\log p(y_n|\boldsymbol{\theta}) = -\log \sigma + (\kappa+1) \log t\left(y_n;\boldsymbol{\theta}\right) - t\left(y_n;\boldsymbol{\theta}\right)

(48)

We can plug in the $\boldsymbol{t}(y;\boldsymbol{\theta})$ to get a complete form. Let $z=\frac{y-\mu}{\sigma}$

\log p(y_n|\boldsymbol{\theta}) = -\log \sigma + (\kappa+1) \log \left[ 1 + \kappa z\right]_+^{-1/\kappa} - \left[ 1 + \kappa z\right]_+^{-1/\kappa}

(49)

We can do some final simplification

\log p(y_n|\boldsymbol{\theta}) = -\log \sigma - (1+1/\kappa)\log \left[ 1 + \kappa z_n\right]_+ - \left[ 1 + \kappa z_n\right]_+^{-1/\kappa}

(50)

Now, we can plug in the sum

\log p(\boldsymbol{y}_{1:N}|\boldsymbol{\theta}) = \sum_{n=1}^N \left( -\log \sigma - (1+1/\kappa)\log \left[ 1 + \kappa z_n\right]_+ - \left[ 1 + \kappa z_n\right]_+^{-1/\kappa} \right)

(51)

We can factor out the constant values

\log p(\boldsymbol{y}_{1:N}|\boldsymbol{\theta}) = - N \log \sigma - (1+1/\kappa)\sum_{n=1}^N \log \left[ 1 + \kappa z_n\right]_+ - \sum_{n=1}^N \left[ 1 + \kappa z_n\right]_+^{-1/\kappa}

(52)

Reparameterization¶

In this instance, we are assuming that there is a threshold parameter, $y_0$ . We can write the reparameterization of this distribution as

\begin{aligned} \mu &= \mu_{y_0} + \frac{\sigma_{y_0}}{\kappa}\left(1 - \lambda_{y_0}^{-\kappa} \right) && && \sigma =\sigma_{y_0}\lambda_{y_0}^{-\kappa} && && \kappa\neq0 \\ \mu &= \mu_{y_0} + \sigma_{y_0}\ln\lambda_{y_0} && && \sigma =\sigma_{y_0}\lambda_{y_0}^{-\kappa} && && \kappa=0 \\ \end{aligned}

(53)

Rescaling¶

\delta_h = \frac{h}{h^*}

(54)

where $h$ is in years and $h^*$ is in days. We can write all of the parameters with these rescaled ones

\begin{aligned} \mu^* &= \mu + \frac{1}{\kappa}\left[\sigma^*(1-\delta_h^{-\kappa}) \right] \\ \sigma^* &= \sigma\delta_h^{\kappa} \\ \kappa^* &= \kappa \end{aligned}

(55)

Literature Review¶

Theory. Leadbetter et al. (1983) is another very popular book.

Applications. García et al. (2023) investigated annual temperature extremes in Extremadura, Spain where they applied a Gaussian process to account for spatial dependencies for the GEVD. Räty et al. (2022) looked at sea level extremes in the Finnish coastal region where they applied a Bayesian hierarchical model to account for spatial dependencies for the GEVD.

Algorithms. Moins et al. (2023) look at a reparameterization framework to improve the convergence of the MCMC inference algorithms. Koh et al. (2021) investigate spatiotemporal extremes of daily large wildfires in the French Mediterranean where they employ a Bayesian Hierarchical model for a PP for the events and a GPD for the marks.

References¶

Leadbetter, M. R., Lindgren, G., & Rootzén, H. (1983). Extremes and Related Properties of Random Sequences and Processes. In Springer Series in Statistics. Springer New York. 10.1007/978-1-4612-5449-2
García, J. A., Acero, F. J., & Portero, J. (2023). A Bayesian hierarchical spatio-temporal model for extreme temperatures in Extremadura (Spain) simulated by a Regional Climate Model. Climate Dynamics, 61(3–4), 1489–1503. 10.1007/s00382-022-06638-x
Räty, O., Laine, M., Leijala, U., Särkkä, J., & Johansson, M. M. (2022). Bayesian hierarchical modeling of sea level extremes in the Finnish coastal region. 10.5194/nhess-2021-410
Moins, T., Arbel, J., Girard, S., & Dutfoy, A. (2023). Reparameterization of extreme value framework for improved Bayesian workflow. Computational Statistics & Data Analysis, 187, 107807. 10.1016/j.csda.2023.107807
Koh, J., Pimont, F., Dupuy, J.-L., & Opitz, T. (2021). Spatiotemporal wildfire modeling through point processes with moderate and extreme marks. arXiv. 10.48550/ARXIV.2105.08004