PPC Calibration plots #352

TeemuSailynoja · 2025-05-19T15:23:25Z

This is my work in progress of the pava calibration plots discussed in #343

Currently implemented:

ppc_calibration_overlay()
ppc_calibration_overlay_grouped()
ppc_calibration()
ppc_calibration_grouped()
.ppc_calibration_data() - internal function

Needs:

Fast example to test functions.
Fix intervals in ppc_calibration()
Also example use in documentation
LOO versions
Should .ppc_calibration_data() be exposed to users?
tests
check that the input parameter names and default values make sense and are intuitive
Add documentation and comments to the code also.

codecov-commenter · 2025-05-19T15:30:05Z

Codecov Report

Attention: Patch coverage is 0% with 134 lines in your changes missing coverage. Please review.

Project coverage is 96.35%. Comparing base (527c48c) to head (14eb2dc).

Files with missing lines	Patch %	Lines
R/ppc-calibration.R	0.00%	134 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #352      +/-   ##
==========================================
- Coverage   98.60%   96.35%   -2.25%     
==========================================
  Files          35       36       +1     
  Lines        5650     5784     +134     
==========================================
+ Hits         5571     5573       +2     
- Misses         79      211     +132

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

TeemuSailynoja · 2025-05-22T12:08:11Z

Examples

These should allow for some tests of these functions.

Creating example data

library(bayesplot)
ymin <- range(example_y_data(), example_yrep_draws())[1]
ymax <- range(example_y_data(), example_yrep_draws())[2]
# Observations and posterior predictive probabilitites.
y <- rbinom(length(example_y_data()), 1, (example_y_data() - ymin) / (ymax - ymin))
prep <- (example_yrep_draws() - ymin) / (ymax - ymin)
groups <- example_group_data()

PAVA Calibration overlay

Basic

ppc_calibration_overlay(y, prep[1:50,])

Grouped

ppc_calibration_overlay_grouped(y, prep[1:50,], groups)

PAVA Calibration

This isn't yet quite what we want. Now the interval is not what we show in the paper. There, we use consistency intervals, that is, intervals centered at the diagonal displaying, where the calibration curve should lie, i.e. the posterior mean should stay within these bounds.
In this implementation, I'm plotting a confidence interval, which shows, where we think the curve lies, i.e. the diagonal should be included.

ppc_calibration(y, prep)

ppc_calibration_grouped(y, prep, groups)

jgabry

This all sounds good, thanks @TeemuSailynoja. I made a few small review comments/questions. In addition to those questions, when you say

This isn't yet quite what we want. Now the interval is not what we show in the paper.

you mean that we will want to change this to use the consistency intervals you use in the paper, right? Do you think it's at all useful to give the user the option to choose which kind of interval? Or just strictly better to use the consistency intervals? I hadn't really thought about that.

jgabry · 2025-05-22T22:18:51Z

R/ppc-calibration.R

+  if (requireNamespace("monotone", quietly = TRUE)) {
+    monotone <- monotone::monotone
+  } else {
+    monotone <- function(y) {
+      stats::isoreg(y)$yf
+    }
+  }


Is there an advantage to using monotone::monotone instead of stats::isoreg?

That is, does it do something slightly better? Or the same thing more efficiently? I've seen stats::isoreg before but I had never seen the monotone package. If there's no difference then it's probably not worth checking for the monotone package. If it's better then we could put monotone in Suggests and then check for it like you do here.

monotone offers an implementation of the algorithm that is noticeably faster for large samples.
I think it would be good to add it to the suggests.

Ok sounds good

jgabry · 2025-05-22T22:24:30Z

R/ppc-calibration.R

+#' @rdname PPC-calibration
+#' @export
+ppc_calibration_overlay <- function(
+    y, prep, ..., linewidth = 0.25, alpha = 0.5) {


So for these functions prep is a matrix of probabilities and not actually a matrix of draws of binary outcomes from the posterior predictive distribution, right? I think in that case the argument name prep makes sense. But the description at the top of the file says

Assess the calibration of the predictive distributions yrep in relation to the data `y'

which makes it sound like the user should give us yrep. So I think we just need to reconcile how we describe this to the user.

Your interpretation is right. The description needs more clarity. yrep can be made to be accepted by ppc_calibration (without overlay).

TeemuSailynoja · 2025-06-03T12:03:36Z

This all sounds good, thanks @TeemuSailynoja. I made a few small review comments/questions. In addition to those questions, when you say

This isn't yet quite what we want. Now the interval is not what we show in the paper.

you mean that we will want to change this to use the consistency intervals you use in the paper, right? Do you think it's at all useful to give the user the option to choose which kind of interval? Or just strictly better to use the consistency intervals? I hadn't really thought about that.

Leaving an option to choose is perhaps the best, as long as the difference is explained in the documentation.

Confidence = "Where do we think the calibration curve for our model lies."
Consistency = "Where should the curve of a consistent model lie."

jgabry · 2025-06-03T16:24:45Z

Ok great, thanks for the replies. Sounds good to me.

…_ppc_calibration

…alibration

TeemuSailynoja added 2 commits April 29, 2025 15:49

.ppc_calibration_overlay_data, and ppc_calibration_overlay(_grouped)

0e1e446

draft of ppc_calibration plots

d784405

TeemuSailynoja self-assigned this May 19, 2025

TeemuSailynoja added documentation tests new plot labels May 19, 2025

TeemuSailynoja added 2 commits May 22, 2025 14:59

Add example for ppc_calibration_overlay()

5cf62f9

Fix ppc_calibration_grouped()

01ac826

fix typo preventing building doc

f9806eb

jgabry reviewed May 22, 2025

View reviewed changes

TeemuSailynoja added 3 commits June 12, 2025 14:39

Merge branch 'master' of github.com:TeemuSailynoja/bayesplot into add…

4f09a00

…_ppc_calibration

Add ppc_calibration plots to namespace and docs.

5efdf47

Merge branch 'master' of github.com:stan-dev/bayesplot into add_ppc_c…

14eb2dc

…alibration

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

PPC Calibration plots #352

PPC Calibration plots #352

TeemuSailynoja commented May 19, 2025 •

edited

Loading

Uh oh!

codecov-commenter commented May 19, 2025 •

edited

Loading

Uh oh!

TeemuSailynoja commented May 22, 2025

Uh oh!

jgabry left a comment

Uh oh!

jgabry May 22, 2025

Uh oh!

jgabry May 22, 2025 •

edited

Loading

Uh oh!

TeemuSailynoja Jun 3, 2025

Uh oh!

jgabry Jun 3, 2025

Uh oh!

jgabry May 22, 2025

Uh oh!

TeemuSailynoja Jun 3, 2025 •

edited

Loading

Uh oh!

TeemuSailynoja commented Jun 3, 2025

Uh oh!

jgabry commented Jun 3, 2025

Uh oh!

Uh oh!

Uh oh!

PPC Calibration plots #352

Are you sure you want to change the base?

PPC Calibration plots #352

Conversation

TeemuSailynoja commented May 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Currently implemented:

Needs:

Uh oh!

codecov-commenter commented May 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

TeemuSailynoja commented May 22, 2025

Examples

Creating example data

PAVA Calibration overlay

PAVA Calibration

Uh oh!

jgabry left a comment

Choose a reason for hiding this comment

Uh oh!

jgabry May 22, 2025

Choose a reason for hiding this comment

Uh oh!

jgabry May 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

TeemuSailynoja Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

jgabry Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

jgabry May 22, 2025

Choose a reason for hiding this comment

Uh oh!

TeemuSailynoja Jun 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

TeemuSailynoja commented Jun 3, 2025

Uh oh!

jgabry commented Jun 3, 2025

Uh oh!

Uh oh!

TeemuSailynoja commented May 19, 2025 •

edited

Loading

codecov-commenter commented May 19, 2025 •

edited

Loading

jgabry May 22, 2025 •

edited

Loading

TeemuSailynoja Jun 3, 2025 •

edited

Loading