Coverage is not enough: Frequentist tests of simulation-based inference for primordial non-Gaussianity

AI-generated keywords: Simulation-based inference Primordial non-Gaussianity Cosmological parameter estimation Likelihood-based inference Validation strategies

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Simulation-based inference (SBI) is valuable for analyzing complex, non-linear data without analytical likelihoods
  • SBI reliability typically assessed using coverage-based diagnostics under prior predictive distribution, which may not provide constraints on posterior behavior at fixed parameter values
  • Study focused on primordial non-Gaussianity parameterized by $f_\mathrm{NL}$ using simulations of dark matter halo field
  • Comparison between SBI based on contrastive neural ratio estimation (CNRE) and likelihood-based inference (LBI) using statistical measures like power spectrum, bispectrum, and wavelet scattering transform (WST) coefficients across 1000 realizations
  • SBI and LBI generally agreed well on posterior means and skewness but showed discrepancies in variance consistency and kurtosis values
  • Incorporating WST coefficients enhanced constraints on $f_\mathrm{NL}$, highlighting the potential of higher-order statistics in refining cosmological parameter estimation
  • Importance of validation strategies beyond standard coverage diagnostics to probe posterior shapes comprehensively for accurate cosmological information extraction
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Toka Alokda, Cristiano Porciani, Alexander Eggemeier

arXiv: 2605.00980v1 - DOI (astro-ph.CO)
15 pages, 9 figures, 2 tables. Submitted to Astronomy & Astrophysics. Comments are welcome

Abstract: (Abridged) Simulation-based inference (SBI) has emerged as a powerful framework for extracting cosmological information from complex, non-linear data where analytical likelihoods are unavailable. Its reliability is commonly assessed using coverage-based diagnostics under the prior predictive distribution, which probe calibration only in an averaged sense and do not constrain posterior behavior at fixed parameter value, the regime relevant for practical inference. We investigate these limitations in the context of primordial non-Gaussianity, parameterized by $f_\mathrm{NL}$, using simulations of the dark matter halo field. We compare SBI based on contrastive neural ratio estimation (CNRE) with likelihood-based inference (LBI) using the power spectrum, bispectrum, and wavelet scattering transform (WST) coefficients across 1000 realizations. SBI and LBI agree well on posterior means and skewness, while the variance agrees on average but shows weaker realization-by-realization consistency. Larger differences arise in the kurtosis, indicating discrepancies in the posterior tails. These effects are already present for the power spectrum - where the Gaussian likelihood assumed in LBI is best justified - and are most pronounced for the combined power spectrum and bispectrum, where SBI posteriors are often underconfident and can yield weaker constraints than either statistic individually, despite passing coverage tests. WST coefficients further tighten constraints on $f_\mathrm{NL}$, even when restricted to large scales. Our results highlight both the potential of higher-order statistics and the need for validation strategies that probe the posterior shape beyond standard coverage diagnostics.

Submitted to arXiv on 01 May. 2026

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2605.00980v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In their paper titled "Coverage is not enough: Frequentist tests of simulation-based inference for primordial non-Gaussianity," authors Toka Alokda, Cristiano Porciani, and Alexander Eggemeier delve into the limitations of simulation-based inference (SBI) in the context of extracting cosmological information related to primordial non-Gaussianity. SBI has proven to be a valuable tool for analyzing complex, non-linear data where analytical likelihoods are not readily available. However, the reliability of SBI is typically evaluated using coverage-based diagnostics under the prior predictive distribution, which only assesses calibration in an averaged sense and does not provide constraints on posterior behavior at fixed parameter values - a crucial aspect for practical inference. To address these limitations, the authors conducted a comprehensive investigation focusing on primordial non-Gaussianity parameterized by $f_\mathrm{NL}$ using simulations of the dark matter halo field. They compared SBI based on contrastive neural ratio estimation (CNRE) with likelihood-based inference (LBI) utilizing various statistical measures such as the power spectrum, bispectrum, and wavelet scattering transform (WST) coefficients across 1000 realizations. The study revealed that SBI and LBI generally agreed well on posterior means and skewness, with some discrepancies observed in variance consistency across realizations. Notably, larger differences were found in kurtosis values, indicating discrepancies in posterior tails. These discrepancies were evident even for the power spectrum analysis - where Gaussian likelihood assumptions are most justified - and were particularly pronounced when considering combined power spectrum and bispectrum analyses. In such cases, SBI posteriors tended to be underconfident and could lead to weaker constraints compared to individual statistics despite passing coverage tests. Furthermore, incorporating WST coefficients further enhanced constraints on $f_\mathrm{NL}$, showcasing the potential of higher-order statistics in refining cosmological parameter estimation. The results underscored the importance of validation strategies that go beyond standard coverage diagnostics to probe posterior shapes more comprehensively. Overall, this study sheds light on the nuances of simulation-based inference in cosmological parameter estimation and emphasizes the significance of exploring alternative statistical approaches to improve accuracy and reliability in extracting cosmological information from complex datasets.
Created on 05 May. 2026

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.