Question: Given a population of detected plumes (and the ones we missed), what is the true total emitted mass over a region and time window?
This is the inventory-grade output of plumax — the number that gets reported into national greenhouse-gas inventories, climate models, and policy dashboards. It also requires the most care, because the satellite catalog you start from is systematically biased: detection thinning means the very things you can’t see (small, frequent leaks) are exactly the things that matter for total mass.
The missing-mass paradox¶
The full Monte Carlo proof is in methane_pod/notebooks/03_missing_mass_paradox. The result, in one sentence:
These two biases pull in opposite directions, but they don’t cancel — averaging the wrong thing over the wrong sample size gives you the wrong total. The corrected estimator has to model the thinning explicitly.
The corrected total-mass estimator¶
Given a TMTPP fit (Tier V.B) with posterior :
This is the un-thinned total — what would be emitted regardless of detection. Compare to the naive estimator:
with
is biased low because:
- (some events missed).
- (detected events are systematically bigger — heavy tail of ).
The two errors compound rather than cancel: the regional total is undercounted, and the per-event mean is inflated. Inverting the POD model is the only way to recover an unbiased total.
Posterior over total mass¶
With NUTS samples , the posterior over is:
Reported as posterior median + 95% credible interval. Both integrals are tractable for the standard intensity / mark choices (closed-form for constant λ + lognormal ; quadrature otherwise).
Validation strategy¶
- MC ground truth (bias direction). Reproduce the qualitative result of the paradox notebook: simulate a known , compute exactly, and check that the corrected estimator recovers while is biased low.
- MC ground truth (calibration). Across 1000 replicates of the previous test, the 95% credible interval on should contain ~95% of the time.
- Per-satellite sensitivity. Same population, two different (e.g. GHGSat-floor GHGSat Inc., 2016 vs. TROPOMI-floor Veefkind et al., 2012) → corrected estimator should give the same posterior. The naive estimator gives wildly different . This is the test that proves the correction is doing its job.
- Real-data benchmark. Once
07_pod_fitting_mcmclands with IMEO + Tanager data, compare the corrected total for a well-studied basin (Permian) to published bottom-up inventories (U.S. Environmental Protection Agency, 2024Scarpelli et al., 2020, GHGRP) and to top-down inverse-modelling estimates (Maasakkers et al., 2023Jacob et al., 2022, Sherwin et al.). They will disagree; the question is whether the corrected estimator is closer to the top-down number than the naive one.
Module layout¶
Table (1):Tier V.D module layout — concern, target module, status.
| Concern | Module | Status |
|---|---|---|
| Missing-mass MC simulator | methane_pod.paradox | ✓ (NumPy) |
| Posterior fit | methane_pod.fitting | ✓ (synthetic); 🚧 (real data) |
| estimator + uncertainty | plume_simulation.population.totals | ☐ |
| Per-satellite calibration loader | plume_simulation.population.satellite_pod | ☐ |
| Multi-satellite fusion | plume_simulation.population.fusion | ☐ |
Multi-satellite fusion (Tier V.D extension)¶
For a region observed by satellites, each with its own POD, the unified detection probability is:
This is the “any satellite saw it” probability. Folds into the TMTPP likelihood as a single replacement of with . Adds one strong assumption: detections by different satellites are conditionally independent given the leak size — defensible at the population level, possibly violated for clustered super-emitters.
Open questions¶
- GHGSat Inc. (2016). GHGSat WAF-P imaging spectrometer constellation. https://www.ghgsat.com/
- Veefkind, J. P., Aben, I., McMullan, K., Förster, H., de Vries, J., Otter, G., Claas, J., Eskes, H. J., de Haan, J. F., Kleipool, Q., & others. (2012). TROPOMI on the ESA Sentinel-5 Precursor: a GMES mission for global observations of the atmospheric composition for climate, air quality and ozone layer applications. Remote Sensing of Environment, 120, 70–83.
- U.S. Environmental Protection Agency. (2024). Inventory of U.S. Greenhouse Gas Emissions and Sinks: 1990–2022. EPA 430-R-24-004. https://www.epa.gov/ghgemissions/inventory-us-greenhouse-gas-emissions-and-sinks
- Scarpelli, T. R., Jacob, D. J., Maasakkers, J. D., Sulprizio, M. P., Sheng, J.-X., Rose, K., Romeo, L., Worden, J. R., & Janssens-Maenhout, G. (2020). A global gridded (0.1° × 0.1°) inventory of methane emissions from oil, gas, and coal exploitation based on national reports to the United Nations Framework Convention on Climate Change. Earth System Science Data, 12(1), 563–575. 10.5194/essd-12-563-2020
- Maasakkers, J. D., Mcduffie, E. E., Sulprizio, M. P., Chen, C., Schultz, M., Brunelle, L., Thrush, R., Steller, J., Sherry, C., Jacob, D. J., & others. (2023). A gridded inventory of annual 2012-2018 U.S. anthropogenic methane emissions. Environmental Science & Technology, 57(43), 16276–16288. 10.1021/acs.est.3c05138
- Jacob, D. J., Varon, D. J., Cusworth, D. H., Dennison, P. E., Frankenberg, C., Gautam, R., Guanter, L., Kelley, J., McKeever, J., Ott, L. E., Poulter, B., & others. (2022). Quantifying methane emissions from the global scale down to point sources using satellite observations of atmospheric methane. Atmospheric Chemistry and Physics, 22(14), 9617–9646. 10.5194/acp-22-9617-2022
- Varon, D. J., Jacob, D. J., McKeever, J., Jervis, D., Durak, B. O. A., Xia, Y., & Huang, Y. (2018). Quantifying methane point sources from fine-scale satellite observations of atmospheric methane plumes. Atmospheric Measurement Techniques, 11(10), 5673–5686. 10.5194/amt-11-5673-2018