Tier V.D — Total emission estimation

Question: Given a population of detected plumes (and the ones we missed), what is the true total emitted mass over a region and time window?

This is the inventory-grade output of plumax — the number that gets reported into national greenhouse-gas inventories, climate models, and policy dashboards. It also requires the most care, because the satellite catalog you start from is systematically biased: detection thinning means the very things you can’t see (small, frequent leaks) are exactly the things that matter for total mass.

The missing-mass paradox¶

The full Monte Carlo proof is in methane_pod/notebooks/03_missing_mass_paradox. The result, in one sentence:

These two biases pull in opposite directions, but they don’t cancel — averaging the wrong thing over the wrong sample size gives you the wrong total. The corrected estimator has to model the thinning explicitly.

The corrected total-mass estimator¶

Given a TMTPP fit (Tier V.B) with posterior $(\lambda, f, P_d)$ :

M_\text{total}(T) \;=\; \mathbb{E}[N_\text{true}(T)] \cdot \mathbb{E}[Q] \;=\; \left(\int_{0}^{T} \lambda(t)\, \mathrm{d}t\right) \cdot \left(\int Q\, f(Q)\, \mathrm{d}Q\right)

((1))

This is the un-thinned total — what would be emitted regardless of detection. Compare to the naive estimator:

M_\text{naive}(T) \;=\; \sum_{i \in \mathcal{D}} Q_i \;\approx\; \mathbb{E}[N_\text{detected}(T)] \cdot \mathbb{E}[Q \mid \text{detected}]

((2))

with

\mathbb{E}[N_\text{detected}(T)] \;=\; \int_{0}^{T} \lambda(t) \int P_d(Q)\, f(Q)\, \mathrm{d}Q\, \mathrm{d}t.

((3))

$M_\text{naive}$ is biased low because:

$\mathbb{E}[N_\text{detected}] < \mathbb{E}[N_\text{true}]$ (some events missed).
$\mathbb{E}[Q \mid \text{detected}] > \mathbb{E}[Q]$ (detected events are systematically bigger — heavy tail of $f$ ).

The two errors compound rather than cancel: the regional total is undercounted, and the per-event mean is inflated. Inverting the POD model is the only way to recover an unbiased total.

Posterior over total mass¶

With NUTS samples $(\lambda^{(s)}, f^{(s)}, P_d^{(s)})$ , the posterior over $M_\text{total}(T)$ is:

M_\text{total}^{(s)}(T) \;=\; \left(\int_{0}^{T} \lambda^{(s)}(t)\, \mathrm{d}t\right) \cdot \left(\int Q\, f^{(s)}(Q)\, \mathrm{d}Q\right)

((4))

Reported as posterior median + 95% credible interval. Both integrals are tractable for the standard intensity / mark choices (closed-form for constant λ + lognormal $f$ ; quadrature otherwise).

Validation strategy¶

MC ground truth (bias direction). Reproduce the qualitative result of the paradox notebook: simulate a known $(\lambda^{*}, f^{*}, P_d^{*})$ , compute $M_\text{true}$ exactly, and check that the corrected estimator recovers $M_\text{true}$ while $M_\text{naive}$ is biased low.
MC ground truth (calibration). Across 1000 replicates of the previous test, the 95% credible interval on $M_\text{total}$ should contain $M_\text{true}$ ~95% of the time.
Per-satellite sensitivity. Same population, two different $P_d$ (e.g. GHGSat-floor GHGSat Inc., 2016 vs. TROPOMI-floor Veefkind et al., 2012) → corrected estimator should give the same $M_\text{total}$ posterior. The naive estimator gives wildly different $M_\text{naive}$ . This is the test that proves the correction is doing its job.
Real-data benchmark. Once 07_pod_fitting_mcmc lands with IMEO + Tanager data, compare the corrected total for a well-studied basin (Permian) to published bottom-up inventories (U.S. Environmental Protection Agency, 2024Scarpelli et al., 2020, GHGRP) and to top-down inverse-modelling estimates (Maasakkers et al., 2023Jacob et al., 2022, Sherwin et al.). They will disagree; the question is whether the corrected estimator is closer to the top-down number than the naive one.

Module layout¶

Table (1):Tier V.D module layout — concern, target module, status.

Concern	Module	Status
Missing-mass MC simulator	`methane_pod.paradox`	✓ (NumPy)
Posterior fit	`methane_pod.fitting`	✓ (synthetic); 🚧 (real data)
$M_\text{total}$ estimator + uncertainty	`plume_simulation.population.totals`	☐
Per-satellite calibration loader	`plume_simulation.population.satellite_pod`	☐
Multi-satellite fusion	`plume_simulation.population.fusion`	☐

Multi-satellite fusion (Tier V.D extension)¶

For a region observed by $K$ satellites, each with its own POD, the unified detection probability is:

P_d^{\cup}(Q) \;=\; 1 - \prod_{k=1}^{K} \bigl(1 - P_d^{k}(Q)\bigr)

((5))

This is the “any satellite saw it” probability. Folds into the TMTPP likelihood as a single replacement of $P_d$ with $P_d^{\cup}$ . Adds one strong assumption: detections by different satellites are conditionally independent given the leak size — defensible at the population level, possibly violated for clustered super-emitters.

Open questions¶

References¶

GHGSat Inc. (2016). GHGSat WAF-P imaging spectrometer constellation. https://www.ghgsat.com/
Veefkind, J. P., Aben, I., McMullan, K., Förster, H., de Vries, J., Otter, G., Claas, J., Eskes, H. J., de Haan, J. F., Kleipool, Q., & others. (2012). TROPOMI on the ESA Sentinel-5 Precursor: a GMES mission for global observations of the atmospheric composition for climate, air quality and ozone layer applications. Remote Sensing of Environment, 120, 70–83.
U.S. Environmental Protection Agency. (2024). Inventory of U.S. Greenhouse Gas Emissions and Sinks: 1990–2022. EPA 430-R-24-004. https://www.epa.gov/ghgemissions/inventory-us-greenhouse-gas-emissions-and-sinks
Scarpelli, T. R., Jacob, D. J., Maasakkers, J. D., Sulprizio, M. P., Sheng, J.-X., Rose, K., Romeo, L., Worden, J. R., & Janssens-Maenhout, G. (2020). A global gridded (0.1° × 0.1°) inventory of methane emissions from oil, gas, and coal exploitation based on national reports to the United Nations Framework Convention on Climate Change. Earth System Science Data, 12(1), 563–575. 10.5194/essd-12-563-2020
Maasakkers, J. D., Mcduffie, E. E., Sulprizio, M. P., Chen, C., Schultz, M., Brunelle, L., Thrush, R., Steller, J., Sherry, C., Jacob, D. J., & others. (2023). A gridded inventory of annual 2012-2018 U.S. anthropogenic methane emissions. Environmental Science & Technology, 57(43), 16276–16288. 10.1021/acs.est.3c05138
Jacob, D. J., Varon, D. J., Cusworth, D. H., Dennison, P. E., Frankenberg, C., Gautam, R., Guanter, L., Kelley, J., McKeever, J., Ott, L. E., Poulter, B., & others. (2022). Quantifying methane emissions from the global scale down to point sources using satellite observations of atmospheric methane. Atmospheric Chemistry and Physics, 22(14), 9617–9646. 10.5194/acp-22-9617-2022
Varon, D. J., Jacob, D. J., McKeever, J., Jervis, D., Durak, B. O. A., Xia, Y., & Huang, Y. (2018). Quantifying methane point sources from fine-scale satellite observations of atmospheric methane plumes. Atmospheric Measurement Techniques, 11(10), 5673–5686. 10.5194/amt-11-5673-2018