Pareto optimization #475

AdrianSosic · 2025-02-03T12:35:23Z

This PR finally brings Pareto optimization via the new ParetoObjective class, together with an example comparing Pareto vs. single target optimization for a simple pair of synthetic targets.

Note: Support is currently limited to maximization and minimization targets. Match targets will follow but require a refactoring of the corresponding target transformation mechanism.

README.md

examples/Multi_Target/pareto.py

Scienfitz

first round of comments

CHANGELOG.md

Scienfitz · 2025-02-11T07:17:04Z

baybe/acquisition/acqfs.py

+
+    abbreviation: ClassVar[str] = "qLogNEHVI"
+
+    ref_point: float | tuple[float, ...] | None = field(


do we want to include the prune_baseline keyword? Sounds useful, or could always be set to true depending on our preference

any merit to include some of the other keywords that might be useful? thinking of eta, alpha or fat

Yes, why not, let's agree on a subset. I'd say let's include prune_baseline with True as default. But I would not include anything that we don't yet fully understand ourselves / stuff that does not primarily affect the optimization. So if you ask me, I'd leave it with that. Opinions?

Looking at our implementation of X_baseline

we definitley need the pruning to be true as we dont do any pre-selection, perhaps theres no need to make it configurable tbh.

This made me look up what is done for the noisy non HV variante qNEI because we have that included and also set baseline to jsut all train data. Strangely, there the default for prune_baseline is True while the HI variant here has it set to False. So I would ensure its True by hardcoding it base.py

For the other parameters, I think the only one I would possibly provide access to is alpha. If the other HV variants are also included they need an alpha passed to partitioning.

To simplify thing we could also not make alpha configurable but set the value according to a heuristic, it seems it should be 0.0 for m<=5, then we could add a linear increase until m=20 or so.

But then I think it's more elegant to just add it to our qLogNEHVI wrapper with default value True just like we did for the scalar version. That way, we have a consistent useful default while still being configurable.

Well, botorch already has the following built-in logic (I guess this what you are referring to)? So if we want to use a smart non-static default, I'd rather go with just calling their internal logic instead of coding on ourselves?

absolutely lets use this
so alpha becomes a property and not an attribute right?

baybe/acquisition/acqfs.py

Scienfitz · 2025-02-11T07:22:17Z

baybe/acquisition/__init__.py

@@ -61,6 +63,8 @@
    "qUpperConfidenceBound",
    # Thompson Sampling
    "qThompsonSampling",
+    # Hypervolume Improvement
+    "qLogNoisyExpectedHypervolumeImprovement",


why not include qExpectedHypervolumeImprovement too?

there are in fact 4 choices: w/o log and w/o noisy. The reason why I haven't included the non-noisy versions yet is because I haven't really gotten into the partition mechanics required for those. Do you already have some insights to share here?

can you explain why this matters? Is the implementation here requiring anything different for eg just swapping one of the other functions in?

yes, it requires passing an explicit partition object. Probably not a big deal, though, just haven't had the time yet to fully understand the underlying mechanism. I guess this is analog to the 1D case where for the regular EI you pass best_f but for the noisy version you don't. In that sense, the partition would act like the multidimensional generalization of best_f. Whoever of us gets there first can add the logic 👍🏼

ok didnt realize that
i dont understand yet why the other methods require a partitioning but there are exact and approximate utilities that essentially only depend on ref_point and Y so in principle there should be no obstacle to compute a property that provides partitioning

it further seems that the interface differenes might be due to legacy things or so, you will find for the noisy variant the alpha parameter

which has the same role as the alpha parameter for the partitioning utility. So it appears there the partitioning is just done internally, which imo would justify to just hardcode partitioning to be e.g. FastNondominatedPartitioning

README.md

Scienfitz · 2025-02-11T12:42:28Z

examples/Multi_Target/pareto.py

+    searchspace=searchspace,
+    objective=ParetoObjective([y0, y1]),
+    recommender=BotorchRecommender(
+        acquisition_function=qLogNEHVI(),


do we have the option to make a HVI based acqf the default in case there is a multi output objective?

Thanks for the reminder. Actually wanted to do this but forgot. Now it's included

The question is though what should be the actual defaults, perhaps we should discuss this. What do you think?

in analogy to our single task default the default here should be the non-noisy log variant
but tbh I dont care if theres evidence or practical preference for another one

we could also debate whether the single task default should be the noisy log variant, but this eventually becomes a question like with the priors where we should look at the outcome on our benchmarks. So tha decision can be postponed until thats ready

Would have argued in the same way, but this is what is stated in one of the botorch examples:

I wonder why the same doesn't apply to the 1-D case though 🤔

hmm ok

fine if we dont have consistent defaults at the moment, as I said, I'd investigate and potentially change this after the benchmarking is more complete. I got at least one colleague already telling me some time ago that noisy EI delivered better results, so the hope would be that its also better across the board there.

examples/Multi_Target/pareto.py

CHANGELOG.md

Scienfitz · 2025-02-11T13:53:17Z

CHANGELOG.md

detached comment2: no particular tests? the pareto objective is not tested. No hypothesis for it. No integrative tests like iterations (unless being done automatically but I dont think we have tests that iterate over objective types.

Co-authored-by: Martin Fitzner <[email protected]>

The ref_point is now in the original target space so that the user can intuitively specify its coordinates. Sign flips for minimization targets happen behind the scenes.

Scienfitz · 2025-02-13T17:31:49Z

docs/userguide/objectives.md

+weights. By contrast, the Pareto approach allows to specify this trade-off
+*after* the experiments have been carried out, giving the user the flexibly to adjust
+their preferences post-hoc – knowing that each of the obtained points is optimal
+with respect to a particular preference model. In this sense, the


would drop the last sentence doesnt seem necessary / a bit opinionated

Scienfitz · 2025-02-13T17:32:39Z

docs/userguide/objectives.md

+target_2 = NumericalTarget(name="t_2", mode="MAX")
+objective = ParetoObjective(targets=[target_1, target_2])
+```
+


Imo its important that this restricts acqf choices

So I would cross ref here that the optimzation is acheived bia special acqfs, linking to the autodoc or later on to the acqf user guide

AdrianSosic added the new feature New functionality label Feb 3, 2025

AdrianSosic self-assigned this Feb 3, 2025

AdrianSosic requested review from Scienfitz and AVHopp as code owners February 3, 2025 12:35

AdrianSosic commented Feb 3, 2025

View reviewed changes

README.md Show resolved Hide resolved

examples/Multi_Target/pareto.py Outdated Show resolved Hide resolved

AdrianSosic force-pushed the feature/pareto branch 2 times, most recently from 385bebe to e633f92 Compare February 3, 2025 14:08

Scienfitz reviewed Feb 11, 2025

View reviewed changes

AdrianSosic force-pushed the feature/pareto branch 3 times, most recently from 189bbe5 to 614a660 Compare February 13, 2025 09:52

AdrianSosic added 15 commits February 13, 2025 10:53

Improve deprecation warning message

bd6f025

Draft ParetoObjective class

cdfb622

Extract function for transforming target columns

6c74060

Add qLogNEHVI acqusition function

abced3a

Make botorch multiobjective acqusition functions autodetectable

d72f076

Add temporary restriction allowing only MAX targets

2e6fb02

Draft example

842aee6

Enable minimization targets

555aeea

Add highlighted feature to README

9613050

Update CHANGELOG.md

0df7ce4

Compute default reference point from data

a011533

Flip signs of custom reference points in MIN mode

a65fa51

Interpolate target paretor frontier along transformed points

7b93127

Drop unnecessary label arguments

3adfb75

Drop square root from target functions

926a880

AdrianSosic force-pushed the feature/pareto branch from 614a660 to 758fb7d Compare February 13, 2025 09:53

AdrianSosic and others added 3 commits February 13, 2025 16:22

Mention ParetoObjective in README

40c6d54

Fix enum comparison operator

0f5d823

Co-authored-by: Martin Fitzner <[email protected]>

Drop duplicate override decorator

02b56e7

AdrianSosic and others added 11 commits February 13, 2025 16:22

Fix random seed utility import

c4fbe8b

Explicitly convert targets to objectives

3aa72dd

Dynamically select default acquisition function

b81af42

Deactivate comparison for non-persistent attributes

63efab3

Fix variable reference in example

49e9556

Co-authored-by: Martin Fitzner <[email protected]>

Turn assert statement into proper exception

88240ba

Add prune_baseline attribute

6ccdca4

Add full docstring to compute_ref_point

8362904

Refactor ref_point computation logic

9281087

The ref_point is now in the original target space so that the user can intuitively specify its coordinates. Sign flips for minimization targets happen behind the scenes.

Let doc generation append regular image when available

3f7a7c6

Add ParetoObjective user guide section

711935f

AdrianSosic force-pushed the feature/pareto branch from 27ae08c to 711935f Compare February 13, 2025 15:22

Scienfitz reviewed Feb 13, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pareto optimization #475

Pareto optimization #475

AdrianSosic commented Feb 3, 2025

Scienfitz left a comment

Scienfitz Feb 11, 2025

AdrianSosic Feb 13, 2025

Scienfitz Feb 13, 2025 •

edited

Loading

Scienfitz Feb 13, 2025

AdrianSosic Feb 13, 2025

AdrianSosic Feb 13, 2025

Scienfitz Feb 13, 2025

Scienfitz Feb 11, 2025

AdrianSosic Feb 12, 2025

Scienfitz Feb 12, 2025

AdrianSosic Feb 12, 2025

Scienfitz Feb 13, 2025

Scienfitz Feb 13, 2025

Scienfitz Feb 11, 2025

AdrianSosic Feb 13, 2025

AdrianSosic Feb 13, 2025

Scienfitz Feb 13, 2025 •

edited

Loading

AdrianSosic Feb 13, 2025

Scienfitz Feb 13, 2025

Scienfitz Feb 11, 2025

Scienfitz Feb 13, 2025

Scienfitz Feb 13, 2025


		abbreviation: ClassVar[str] = "qLogNEHVI"

		ref_point: float \| tuple[float, ...] \| None = field(

Pareto optimization #475

Are you sure you want to change the base?

Pareto optimization #475

Conversation

AdrianSosic commented Feb 3, 2025

Scienfitz left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Scienfitz Feb 13, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Scienfitz Feb 13, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Scienfitz Feb 13, 2025 •

edited

Loading

Scienfitz Feb 13, 2025 •

edited

Loading