
Some Literature that might be useful


Research Questions

This is inspired by the talk by Steve Penny - Recording | Slides

Reanalysis vs Simulations vs Observations

Observations and Reanalysis are inherently imperfect data sources with often uncharacterized uncertainties.

  • Q1: Are reanalysis datasets an adequate source of training data for ML?
  • Q2: Are pure simulation datasets more effective data for ML?
  • Q3: How will biases & systematic errors be handled?
  • Q4: Can we learn directly from observations plus basic physics constraints?



PINNS in the Wild

[Agarwal et al., 2022]

Hybrid Models

As numerical Forecasts are modernized (e.g. written in new languages that support differentiation, and designed to take advantage of GPUs), can AI/ML solutions maintain a competitive edge (in terms of computational cost) over conventional modeling.

[Belochitski & Krasnopolsky (2021)Dresdner et al. (2022)Frerix et al. (2021)Kochkov et al. (2021)]

Model Error Estimation

How much State Dependent (Conventional) Model error can we learn from comparison with observations? How do we separate system observation errors from systematic model forecast errors?

[Bonavita & Laloyaux (2022)Laloyaux et al. (2022)Pathak et al. (2018)Arcomano et al. (2022)]

Subgrid Parameterization

This is an instance of

[Frezat et al., 2022]

Better Metrics

[Frezat et al., 2021]

Operational Center

Observation Datasets



Extrapolation Datasets



Reanalysis Datasets


Plants n Things

Water n Things

Data Assimilation


Back-and-Forth Nudging

