Export srs_diff_est() by vinniott · Pull Request #340 · stan-dev/loo

vinniott · 2026-03-22T11:27:41Z

Fixes #333

Note:

There is no @example yet because I did not have time yet to get fully familiar with the whole package.
I updated NEWS.md as suggested in CONTRIBUTING.md but I am not sure whether I did that correctly.

synced with upstream/master

@avehtari

as proposed by @avehtari in issue stan-dev#333

codecov-commenter · 2026-03-22T14:46:37Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 92.78%. Comparing base (7eafeb8) to head (5d476b5).

Additional details and impacted files

@@           Coverage Diff           @@
##           master     #340   +/-   ##
=======================================
  Coverage   92.78%   92.78%           
=======================================
  Files          31       31           
  Lines        2992     2992           
=======================================
  Hits         2776     2776           
  Misses        216      216

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

jgabry · 2026-03-23T17:16:37Z

Thank you @vinniott.

There is no @example yet because I did not have time yet to get fully familiar with the whole package.

@avehtari or @MansMeg, is there any specific example you'd like to use for this in the documentation?

avehtari · 2026-03-23T18:13:27Z

The example should be based on flexible enough model so that elpd(log_lik_matrix) and loo(log_lik_matrix) differ more than by 1. The current example_loglik_matrix() has too few observations. The example used in tests for subsampling is one parameter model. Should we have a real model, or store another example loglik matrix?

After we have useful loglik matrix, the example code would be something like

# Use posterior predictive density as the fast but biased method for all observations
lpd <- elpd(log_lik_matrix)
sum(lpd$pointwise[,"elpd"])

# Use PSIS-LOO for subsample of 50 randomly selected observations
idx <- sample(1:N, 50)
elpd_loo_sub <- loo(log_lik_matrix[,idx])
20 * sum(elpd_loo_sub$pointwise[,"elpd_loo"])

# Use difference estimator to combine fast result and subsampled accurate result
loo:::srs_diff_est(lpd$pointwise[,"elpd"], elpd_loo_sub$pointwise[,"elpd_loo"], idx)

# Comparison to using PSIS-LOO for all observations
loo(log_lik_matrix)

This matches what someone was asking

jgabry · 2026-03-23T20:50:06Z

Should we have a real model, or store another example loglik matrix?

Either is fine by me. Also if we're only using it for this example, we could also just generate an example loglik matrix in the example code instead of storing it.

avehtari · 2026-03-24T18:21:21Z

I think the interesting examples can be slow to run. I'll test subsampling with few interesting real models this week

avehtari · 2026-03-27T10:38:50Z

Thus would be a good example with data from https://archive.ics.uci.edu/ml/datasets/wine+quality

library(dplyr)
library(brms)
options(brms.backend = "cmdstanr")
options(mc.cores = 4)
library(loo)

wine <- read.delim(root("winequality-red", "winequality-red.csv"), sep = ";") |>
  distinct()

wine_scaled <- as.data.frame(scale(wine))

fitos <- brm(ordered(quality) ~ .,
            family = cumulative("logit"),
            prior = prior(R2D2(mean_R2 = 1/3, prec_R2 = 3)),
            data = wine_scaled,
            seed = 1,
            silent = 2,
            refresh = 0)

log_lik_matrix <- log_lik(fitos)

N <- nrow(wine_scaled)
Nsub <- 100

# posterior log-score
lpd <- elpd(log_lik_matrix)
sum(lpd$pointwise[,"elpd"])

# Use PSIS-LOO for subsample of Nsub randomly selected observations
set.seed(1)
idx <- sample(1:N, Nsub)
elpd_loo_sub <- loo(log_lik_matrix[,idx])
sum(elpd_loo_sub$pointwise[,"elpd_loo"]) / Nsub * N

# Use difference estimator to combine fast result and subsampled accurate result
loo:::srs_diff_est(lpd$pointwise[,"elpd"], elpd_loo_sub$pointwise[,"elpd_loo"], idx)

# Comparison to using PSIS-LOO for all observations
loo(log_lik_matrix)

p_loo is here about 17 and thus posterior log-score is clearly different
N is 1359, so that a subsample of 100 is still only small part of all observations
No high Pareto-k values to complicate things
Subsampling with Nsub gets close to the full result

As compiling and sampling the brms model takes some time, I would store only the log_lik_matrix but show the code for how it is generated. The rest of code is fast

vinniott added 11 commits March 17, 2026 19:58

set up documentation structure

84ee41f

srs_diff_est.Rd matches .R documentation

7d2c817

Merge branch 'master' into export-srs-diff-est

a914fc6

synced with upstream/master

added documentation

816bcf8

as proposed by @avehtari in issue stan-dev#333

added @Seealso at loo_subsample()

e596847

added reference Cochran (1977)

25fddcf

removed oudated @return duplicate

fe5a45e

corrected .R formulas to render in .Rd

dd72938

removed example placeholder

287c039

updated .Rd to match .R

d4dda71

Update NEWS.md

5d476b5

vinniott mentioned this pull request Mar 22, 2026

export srs_diff_est #333

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Export srs_diff_est()#340

Export srs_diff_est()#340
vinniott wants to merge 11 commits intostan-dev:masterfrom
vinniott:export-srs-diff-est

vinniott commented Mar 22, 2026

Uh oh!

codecov-commenter commented Mar 22, 2026

Uh oh!

jgabry commented Mar 23, 2026

Uh oh!

avehtari commented Mar 23, 2026

Uh oh!

jgabry commented Mar 23, 2026

Uh oh!

avehtari commented Mar 24, 2026

Uh oh!

avehtari commented Mar 27, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

vinniott commented Mar 22, 2026

Uh oh!

codecov-commenter commented Mar 22, 2026

Codecov Report

Uh oh!

jgabry commented Mar 23, 2026

Uh oh!

avehtari commented Mar 23, 2026

Uh oh!

jgabry commented Mar 23, 2026

Uh oh!

avehtari commented Mar 24, 2026

Uh oh!

avehtari commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

avehtari commented Mar 27, 2026 •

edited

Loading