Skip to article frontmatterSkip to article content

Simulations

CSIC
UCM
IGEO

This is a summary of different datasets available from the geoscience community. I want to understand some key differences in how they are constructed as well as some key similarities that can be exploited for other tasks.


TLDR

There are essentially 2 types of datasets that are available within the community: simulations and reanalysis.

Simulations. These are solutions to dynamical models under different forcing conditions. There are a large number of simulations that exist from different institutions under different assumptions and different numerical implementations. In addition, some include multiple runs. The CMIP-X project is a large collection of simulations from different institutions.

Reanalysis. These are simulations that are assimilated with real observations. The ECMWF Reanalysis Version 5 (ERA5) is a large collection of reanalysis data available.


Simulations

In this section, we will primarily talk about the CMIP6 project and the simulations available therein.

For notation, we are considering a vector-valued field, u\boldsymbol{u}, as a vector-valued field that varies in a spatial domain, Ω and a time domain, [t0,t1][t_0, t_1].

u=u(x,t)xΩuRDst[t0,t1]R+\begin{aligned} \boldsymbol{u} &= \boldsymbol{u}(\mathbf{x},t) && && && \mathbf{x}\in\mathcal{\Omega}_u\subseteq\mathbb{R}^{D_s} && t\in[t_0, t_1]\subseteq\mathbb{R}^+ \end{aligned}

In the case of climate simulations, these variables can be considered any number of physical quantities on the land, atmosphere and ocean surface. For example, the temperature, wind, and precipitation. The CMIP6 simulations are particularly impressive because attempt to encapsulate almost all variables that are important for modeling the full climate system on Earth. For a full list of variables, see the CMIP6 simulations located at the climate data store.

Recall, that every PDE can be written in a form of the partial derivative of a field in time, tu\partial_t\boldsymbol{u}, that’s equivalent to some parameterized spatial operators, Fθ\boldsymbol{F_\theta}.

tu(x,t)=F[u;θ](x,t)\partial_t \boldsymbol{u}(\mathbf{x},t) = \boldsymbol{F}[\boldsymbol{u};\boldsymbol{\theta}](\mathbf{x}, t)

F\boldsymbol{F} is set of all operators that act on a field u\boldsymbol{u} and θ\boldsymbol{\theta} represents all parameters that stem from this PDE. This is the set of all operators, i.e., FF\boldsymbol{F}\in\mathcal{F}, which could be simple linear differentiable operators like the gradient, \boldsymbol{\nabla}, or the Laplacian, 2\boldsymbol{\nabla}^2, or non-linear operators like the advection term in the material derivative, uq\boldsymbol{u}\cdot\boldsymbol{\nabla}q.

Regardless of the operation, these are meant to encapsulate all physical processes that we wish to include within our dynamical system. In addition, there could be other operators like parameterizations, i.e., simplified equations that represent a dynamical system. It represents a model of the effect of a process instead of a dynamical system resolving the process directly. Parameterizations are included for a variety of reasons including increasing the speed of the simulation or making hard assumptions about the contribution of certain factors. We could also include forcing functions which attempt to incorporate external (or internal) operations on the system. Lastly, there could be stochasticity in the system which could be from unresolved processes or inherent variability within the system.

In the case of the CMIP6 simulations, there are many variations of these operators ranging from different dynamical models to different forcings. The remaining sections will outline some key differences to distinguishing the many available simulations for the CMIP6 project. Essentially, we will decompose the operator, F\boldsymbol{F}, into core components that highlight the key differences across different the simulations.


Dynamical Components

Random image of the beach or ocean!

Figure 1:Example of a decomposition of the Earth system based on a domain. [Source]

The first way to decompose the operator is into the core dynamical system that represents processes within the Earth system.

F=FDynamical\boldsymbol{F} = \boldsymbol{F}_\text{Dynamical}

While it is ideal to have all dynamical processes under a single umbrella, it’s more feasible to decompose each into subcomponents. We can do this by decomposing the Earth domain into different sections. There are potentially many ways one can partition the Earth into sections as we can be as granular as possible when describing a dynamical system. However, the Earth is a massive scale and we need to put a limit due to computational constraints.

There are 3 key partitions that are prevalent in the CMIP6 simulations include the atmosphere, the land, and the ocean.

FDynamical=Atmospheric+Land+Ocean\boldsymbol{F}_\text{Dynamical} = \text{Atmospheric} + \text{Land} + \text{Ocean}

On a high-level, this is a reasonable decomposition. However, depending on your field and beliefs, one could easily decompose this field into other specific dynamical systems that are potentially influential. For example, we could include a vegetation dynamical system and a (sea) ice dynamical system.

In the case of the CMIP6 project, the AMIP6 models only include the atmospheric portion and parameterize the ocean and land (?). However, all other scenarios include each of these dynamical systems within their framework.


Forcings

Random image of the beach or ocean!

Figure 2:A figure from [Meehl et. al., 2004] showing the anthropogenic plus natural vs. just natural radiative forcing temperature change vs. observed global surface temperature increase. [Source]

We can add another term to our PDE which takes into account any external natural forcings that we think are necessary.

F=FDynamical+FForcings\boldsymbol{F} = \boldsymbol{F}_\text{Dynamical} + \boldsymbol{F}_\text{Forcings}

A forcing is essentially an external (or internal) factor that is independent of our defined dynamical system. For example, the wind is a factor of the atmosphere which influences the dynamical system of the ocean system and vice versa.

For the CMIP6 project, we consider 2 broad categories of forcings: natural and anthropogenic.

FForcings=Natural+Anthropogenic\boldsymbol{F}_\text{Forcings} = \text{Natural} + \text{Anthropogenic}

Natural is anything that we consider is a natural process, e.g., solar radiation, albedo, Milankovitch cycles, natural oscillations, sun spots, and volcanoes. Anthropogenic is anything that we consider is a process or effect that was caused by humans, e.g., greenhouse gases.

One important factor is that forcings come from observations, yobs\boldsymbol{y}_{obs}, when they are available.

FForcings:=FForcings[yobs;θ](x,t)t[1850,2014]R+FForcings:=0t[2015,2100]R+\begin{aligned} \boldsymbol{F}_\text{Forcings} &:= \boldsymbol{F}_\text{Forcings}[\boldsymbol{y}_{obs};\boldsymbol{\theta}](\mathbf{x},t) && && t\in[1850,2014]\subseteq\mathbb{R}^+ \\ \boldsymbol{F}_\text{Forcings} &:= 0 && && t\in[2015,2100]\subseteq\mathbb{R}^+ \end{aligned}

The first row showcases the simulations in the historical period t[1850,2014]t\in[1850,2014] where observations are available so these are included within the forcing terms. However, the second row showcases the simulations in the projection period t[2015,2100]t\in[2015,2100] where observations are not available. So the forcing terms are 0.

In the context of the CMIP6 project, they have different forcings available:

  • piControl run will include none of the forcings (neither anthropogenic nor natural).
  • Historcal-Natural will only include the natural forcing.
  • Historical will include both the anthropogenic forcing and the natural forcing.

Note: A key component to note is that these are forcings using observations, and not assimilation.


Summary

We can summarize a number of the different simulations that are available from the CDS.

F=Atmospheric+Land+OceanDynamical+Natural+AnthropogenicForcing\begin{aligned} \boldsymbol{F} &= \underbrace{\text{Atmospheric} + \text{Land} + \text{Ocean}}_{\text{Dynamical}} + \underbrace{\text{Natural} + \text{Anthropogenic}}_{\text{Forcing}} \end{aligned}

Below, we have a summary table of the different CMIPX simulations that are available and what distinguishes them.

Table 1:Table of Different CMIP6

SimulationDynamicalObservation ForcingAnthropogenic Forcing
piControlAtm, Land, OcnNoNo
Historical-NaturalAtm, Land, OcnYesNo
HistoricalAtm, Land, OcnYesYes
GHGAtm, Land, OcnYesYes
AMIP5Atm, LandYesYes

Misc.

There are other important factors to consider when looking at the models.


Ensembles

Many institutions offer a number of different runs, i.e., ensembles.


Scenarios

The strength of the forcing is the focus for the projections. We don’t have observations of the anthropogenic activity, so we need to prescribe this forcing. In general, there are a number of Shared Socioeconomic Pathways (SSP), a.k.a., scenarios provided ranging from optimistic to pessimistic.

Random image of the beach or ocean!

Figure 3:Example of projections with different scenarios for surface air temperature [Source]


Pressure Levels

In the database, there is an option to choose the CMIP6 simulations at pressure levels or at single levels.


List of Datasets

Below is a large list of my datasets.

A quick list of current reanalysis, see this link. They have a comprehensive list of reanalysis available for the atmosphere and the ocean.


Climate

CMIP5 | CMIP6. Some climate projections for the land, atmosphere, and ocean. They feature a historical period from 1850-2014 and a projection period from 2015-2100. The anthropogenic forcing was dictated using different SSP scenarios.


Atmosphere


Ocean

References
  1. Meehl, G. A., Washington, W. M., Ammann, C. M., Arblaster, J. M., Wigley, T. M. L., & Tebaldi, C. (2004). Combinations of Natural and Anthropogenic Forcings in Twentieth-Century Climate. Journal of Climate, 17(19), 3721–3727. https://doi.org/10.1175/1520-0442(2004)017<;3721:conaaf>2.0.co;2