Naive approach for IID potential evaluation for score estimators #1508

Kartik-Sama · 2025-03-20T19:14:31Z

Addresses the issue #1450

Used Effective Sample Score (ESS) to evaluate the log_probs returned by estimated posterior. This way log_probs are being checked both for single observation and iid observations case through the test added via the parameter - iid_batch_size

codecov · 2025-03-20T20:22:56Z

Codecov Report

Attention: Patch coverage is 73.07692% with 7 lines in your changes missing coverage. Please review.

Project coverage is 79.31%. Comparing base (b5b4790) to head (f0da7e2).
Report is 9 commits behind head on main.

Files with missing lines	Patch %	Lines
sbi/inference/potentials/score_based_potential.py	70.00%	6 Missing ⚠️
sbi/inference/posteriors/score_posterior.py	83.33%	1 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##             main    #1508       +/-   ##
===========================================
- Coverage   89.70%   79.31%   -10.40%     
===========================================
  Files         122      125        +3     
  Lines        9394     9828      +434     
===========================================
- Hits         8427     7795      -632     
- Misses        967     2033     +1066

Flag	Coverage Δ
unittests	`79.31% <73.07%> (-10.40%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
sbi/inference/posteriors/score_posterior.py	`80.95% <83.33%> (-11.13%)`	⬇️
sbi/inference/potentials/score_based_potential.py	`81.35% <70.00%> (-15.71%)`	⬇️

... and 37 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

manuelgloeckler · 2025-03-21T07:35:14Z

sbi/simulators/__init__.py

+from sbi.simulators.linear_gaussian import (
+    diagonal_linear_gaussian,
+    linear_gaussian,
+    true_posterior_linear_gaussian_mvn_prior,


Please revert this change. Its unrelated, instead just import in the tests from sbi.simulators.linear_gaussian import ...

manuelgloeckler · 2025-03-21T07:38:38Z

tests/score_log_prob_test.py

+    )
+
+
+def _compute_ess(proposal_log_weights: Tensor, true_log_weights: Tensor):


For these two lines I would prefer not introduce a new function with a big docstring (except if its reused several times and then it should be in sbi.utils.

I would just move the calculations in the test above and add a one sentence inline comment.

manuelgloeckler · 2025-03-21T07:42:56Z

tests/score_log_prob_test.py

+
+@pytest.mark.parametrize("num_dims", [1, 2])
+@pytest.mark.parametrize("iid_batch_size", [1, 2])
+def test_score_fn_log_prob(num_dims, iid_batch_size):


This test will be quite expensive now as one would have to retrain on each combination.

Please:

move the test to linearGaussian_npse_test.py

similar to test_npse_iid_inference use the fixture npse_trained_model (you can skip all the posteriors which are have a uniform prior

This fixture is only trained once and the used by all tests, avoiding this overhead.

manuelgloeckler

Thanks for implementing looks great.

Just a few changes i.e. moving the tests and using corresponding pytest fixture needs to be done.

…r_gaussian_mvn_prior in tests

manuelgloeckler

Thanks, great effort. Looks good.

This is done, but I will block merging this for now as it will have conflicts with #1497.

Kartik-Sama requested review from gmoss13 and manuelgloeckler March 20, 2025 19:14

Kartik-Sama force-pushed the score_matching branch from e16d436 to e5c8480 Compare March 21, 2025 05:51

manuelgloeckler reviewed Mar 21, 2025

View reviewed changes

manuelgloeckler requested changes Mar 21, 2025

View reviewed changes

Kartik-Sama added 7 commits March 21, 2025 11:39

Changes to compute log_prob of NPSE under iid observations

0b7c31f

Improved variable naming, and fixed duplicate log_probs being returned

43c5e63

cleaned unneccesary comment

ca2a7e2

Added doc strings for score based log prob test function

a0edac9

init path for simulators modified to make use of true_posterior_linea…

dd48e41

…r_gaussian_mvn_prior in tests

Removed redundant print statement

88e5aac

Moved test of log_prob calculation for NPSE to an existing test file

f0da7e2

Kartik-Sama force-pushed the score_matching branch from e5c8480 to f0da7e2 Compare March 21, 2025 10:39

Kartik-Sama requested a review from manuelgloeckler March 21, 2025 10:44

manuelgloeckler approved these changes Mar 21, 2025

View reviewed changes

manuelgloeckler added the blocked Something is in the way of fixing this. Refer to it in the issue label Mar 21, 2025

This was referenced Mar 24, 2025

Unify flow matching and score-based models #1497

Merged

Add GPU tests to vector field methods #1530

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Naive approach for IID potential evaluation for score estimators #1508

Naive approach for IID potential evaluation for score estimators #1508

Uh oh!

Kartik-Sama commented Mar 20, 2025

Uh oh!

codecov bot commented Mar 20, 2025 •

edited

Loading

Uh oh!

manuelgloeckler Mar 21, 2025

Uh oh!

manuelgloeckler Mar 21, 2025

Uh oh!

manuelgloeckler Mar 21, 2025

Uh oh!

manuelgloeckler left a comment

Uh oh!

manuelgloeckler left a comment

Uh oh!

Uh oh!

		)


		def _compute_ess(proposal_log_weights: Tensor, true_log_weights: Tensor):

Naive approach for IID potential evaluation for score estimators #1508

Are you sure you want to change the base?

Naive approach for IID potential evaluation for score estimators #1508

Uh oh!

Conversation

Kartik-Sama commented Mar 20, 2025

Uh oh!

codecov bot commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

manuelgloeckler Mar 21, 2025

Choose a reason for hiding this comment

Uh oh!

manuelgloeckler Mar 21, 2025

Choose a reason for hiding this comment

Uh oh!

manuelgloeckler Mar 21, 2025

Choose a reason for hiding this comment

Uh oh!

manuelgloeckler left a comment

Choose a reason for hiding this comment

Uh oh!

manuelgloeckler left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

codecov bot commented Mar 20, 2025 •

edited

Loading