Calculating Extremes for Spatiotemporal Data

In this section, we will look at the 3 most basic methods for calculate extreme values: the block maxima (BM), peak-over-threshold (POT), and point process (PP) methods. We will do detailed demonstrations showcasing how we can extract these extreme values through the lens of discretization via histograms. With histograms, we will show how we can extract extreme values through for each of these methods as a combination of thresholding, selecting maximum values, and counting.

Make some simple Spatial Correlation Analysis
Make some temporal correlation analysis

Regular Discretized - Use a histogram transform with predefined
Irregular Discretized - Use a search for values above a threshold

To cover all cases, we will need tools for the following operations:

defining a spatiotemporal block
Selecting a threshold
Selecting maximum/minimum values
Counting the number occurrences of the values above a threshold within a spatiotemporal block
Summary statistic of occurrences, eg mean, median, etc

Appendix

Applied Discretization Strategies - Cartesian, Rectilinear, Curvilinear, Irregular | boost-histogram

References

spatial correlation analysis

Temporal correlation analysis
Post analysis of thresholds and discretization schemes

Block Maxima¶

We define a spatiotemporal block and we take the maximum count within a spatiotemporal block.

Algorithm

Define spatiotemporal block
Select maximum/minimum values

Clon_coords: Array[“”] = …
Clat_coords: Array[“”] = …

Peak-Over-Threshold¶

We select the values that are over a predefined threshold and discard the rest. We also have the option to discretize this further by taking the maximum within a pre-defined spatiotemporal block. The POT method is a discretized version of the block maxima method, i.e., it is the infinite limit as the size of spatiotemporal block goes to zero whereby each individual point is a maximum. This will result in an irregular grid because there is no guarantee that only one maximum occurrence above a pre-defined threshold within a pre-defined spatiotemporal block. In addition, one could have irregular blocks/shapes but this makes processing much harder. One could further discretize this to count exceedences (and intensity).

Algorithm

Define maximum/minimum threshold values
Select values above/below threshold
Define spatiotemporal block (Optional)
Summary statistic of values within spatiotemporal block (Optional)

Point Processes¶

This method is similar to the POT method with the spatiotemporal blocks. However, we also count the number of exceedences and take a summary statistic of the values within the block.

In this section, we will do a deeper dive into how one can further preprocess the data to remove extreme values

Spatially Aggregate Data (Optional)
Temporally Aggregate Data (Optional)
Stitching, SuperImposing, Aggregating, Batch Sampling - PoPPY

Examples

1D Data Recorded in a sequence of distance or time
2D sampling for spatial interpolation
3D sampling for spatial interpolation
Spatiotemporal

Spatial Scale

Changes —> Mean, Variance, Tails, Range, Distribution Shape
Tools —> Variogram, Predict the scale
Recale:
- DownScale/SuperResolution/UpSample
- Upscaling/Coarsen/DownSample —> Average Arithmetic, Power Law Average, Harmonic, Geometric
Aggregations
Creating location weights - https://youtu.be/k9VbyqafnPk?si=biWcgcqwuXVe8RfG

# filtering - remove high/low frequency signals
# spatiotemporal peaks - spatial,temporal dependencies
# remove climatology - temporal dependencies
# spatial aggregation - spatial dependencies
# rolling mean - spatial, temporal dependencies

Cookbook

Spatial Statistics with Declustering Weights —> Grid Cell Size vs Declustered Mean
Lat-Lon Spatial Averages using weights at poles

Example PsuedoCode¶

First, we need some spatiotemporal data. This data could be any spatiotemporal field, $y=y(\mathbf{s},t)$ , representing the extreme values we wish to extract.

y: Array["Dt Dy"] = ...

Now, we need to do some preprocessing steps to ensure that we get an iid dataset. We will remove some of the excess effects.

# filter high frequency signals
y: Array["Dt Dy"] = low_pass_filter(y, params)
# remove climatology
climatology["Dclim"] = calculate_climatology(y, reference_period, params)
y: Array["Dt Dy"] = remove_climatology(y, climatology, params)
# spatial aggregation
y: Array["Dt"] = spatial_aggregator(y, params)

Now, we need to select some extreme values.

y_max: Array["Dt"] = block_maximum(y, params)