Expanded guassian_process_torch hyperparameter space #243

MaxBalmus · 2024-09-17T13:06:15Z

Expanded the hyper parameter space of the torch GPEs to include more expensive, but also potentially more expensive cases:
- added ScaleKernel(MatternKernel())+ConstantKernel()
- set ard_num_dims to be the number of inputs for the kernels where the is possible (this allows the kernels to adjust the lengthscale based on individual parameters)
- replaced ConstantMean() and ZeroMean() from the mean_module with the higher order LinearMean() and PolyMean(degree=2)
- PolyMean is a new class of mean_module found in autoemulatate.emulators.gaussian_process_utils
- process_param_space: added new parameter input_dim (default value 1) equal the number of dimensions of the input. This addition is necessary to be able to define the value of ard_num_dims.
Defined a new callback EarlyStoppingMax found in autoemulatate.emulators.gaussian_process_utils:
- the new class is derived skorch.callbacks.EarlyStopping
- it fixes a bug which assumes that the cost function is always positive which can break the monotonicity of the cost function threshold
- a pull request was created on skorch to fix the original bug and is currently under review.

… to find potentially more accurate emulators

…lue of 1

mastoffel

Great PR @MaxBalmus ! Lots of good stuff in there! I've given some comments, let me know whether anything is unclear.

Some broader comments:

having seperate lengthscales for features sounds great, but I suggested a slightly different implementation (see below). Let me know whether you'd like to have a go, otherwise I'm happy to do that.
I think I'd be good to put the train/valid split for early stopping in a seperate PR
there's a typo in the utils file name
it'd be great to add docstrings and tests where possible (see below for details)
could you give a bit of context about the new mean modules for me to understand them a bit better?

mastoffel · 2024-09-18T08:59:08Z

autoemulate/emulators/conditional_neural_process.py

@@ -253,7 +253,7 @@ def predict(self, X, return_std=False):
            return mean

    @staticmethod
-    def get_grid_params(search_type: str = "random"):
+    def get_grid_params(search_type: str = "random", input_dim=1):


setting ard_num_dims is great, and we should definitely do that (I hope that the computational cost isn't too big, but lets try this). I think adding an input_dim argument isn't the cleanest option as we don't need it for most models and it suddenly makes get_grid_params data dependent. Instead, we can change the kernels to be callables and initialise them in fit when self.n_features_in_ is available. I'm thinking about something like this:

So we could have the kernels as callables like this:

"""Returns the grid parameters for the emulator.""" param_space = { "covar_module": [ lambda n_features: gpytorch.kernels.RBFKernel(ard_num_dims=n_features).initialize(lengthscale=1.0), lambda n_features: gpytorch.kernels.MaternKernel(nu=2.5, ard_num_dims=n_features), lambda n_features: gpytorch.kernels.MaternKernel(nu=1.5, ard_num_dims=n_features), lambda n_features: gpytorch.kernels.PeriodicKernel(ard_num_dims=n_features), lambda n_features: gpytorch.kernels.RQKernel(ard_num_dims=n_features), ],

and then in the fit method call the callable with the number of features:

covar_module = self.covar_module(self.n_features_in_) if callable(self.covar_module) else self.covar_module

Let me know if there's anything I'm missing. Happy to implement this myself or leave it up to you!

mastoffel · 2024-09-18T09:08:30Z

autoemulate/emulators/gaussian_process_torch.py

+        X_train, X_val, y_train, y_val = train_test_split(X, y, test_size=0.2)
+        train_split = predefined_split(Dataset(X_val, y_val))


nice, I was wondering about this. The thing is that we have quite small datasets already, and we split initially into test/train and then do cv on train which splits and this would therefore be a third split. Would if be ok to open a seperate PR for this? Maybe we implement it as an option and see how it does.

Yeah I see your point. Let's open a new PR.

autoemulate/emulators/gaussian_processs_utils/early_stopping_criterion.py

mastoffel · 2024-09-18T09:14:00Z

autoemulate/emulators/gaussian_processs_utils/poly_mean.py

+from .polynomial_features import PolynomialFeatures
+
+
+class PolyMean(gpytorch.means.Mean):


Would you mind adding a docstring and tests for this? Feel free to start a new test file. Testing with pytest tests/test_gp_utils.py

mastoffel · 2024-09-18T09:15:28Z

autoemulate/emulators/gaussian_processs_utils/polynomial_features.py

+import torch
+
+
+class PolynomialFeatures:


again, having a docstring and tests would be great here.

Submitted the docstrings but to be honest I am not 100% how to go about making the tests.

autoemulate/emulators/gaussian_process_torch.py

aranas · 2024-09-18T09:46:29Z

4. it'd be great to add docstrings and tests where possible (see below for details)

Re testing, I personally would be very keen on some upskilling re best practices around code testing. I know there are many resources around code testing, see here for one developed at the Turing & Turing Way community: https://book.the-turing-way.org/reproducible-research/testing.

I wonder, @MaxBalmus might this also be of interest to you and other researchers in TRIC? And @mastoffel would there be capacity to do a small session on this from REG side? apart from general upskilling for TRIC researchers re writing good tests, we could use this as an opportunity to develop a paragraph on contributor requirements re testing for autoemulate specifically. What do you think?

also to note, there is an upcoming Turing Way book dash that could be an opportunity to run this session using (and potentially improving!) The Turing Way resource specifically.

mastoffel · 2024-09-18T10:14:25Z

@aranas I'm sure there would be capacity if there's interest. Happy to chat about this. Lots of possibilities here, like an interactive session or just a talk etc. . I haven't read the Turing Way bits on testing yet, but will definitely do it now, thanks for the link!

mastoffel · 2024-09-30T09:20:30Z

@MaxBalmus let me know if this is ready again to review!

MaxBalmus · 2024-09-30T10:04:15Z

I didn't get to write the tests that you indicated. Unfortunately, I am a bit pressed for time. Could you have a look into that?

MaxBalmus and others added 5 commits September 4, 2024 17:21

a first version

403b414

updates on gaussian_process_torch: expanded the hyperparameter search…

fcdc93e

… to find potentially more accurate emulators

Merge branch 'alan-turing-institute:main' into main

59b80f5

add forgotten import

84ce391

added np import

4a87e20

MaxBalmus requested a review from mastoffel September 17, 2024 13:06

MaxBalmus added 2 commits September 17, 2024 14:08

post pre-commit

213e13a

hyperparam_searching._process_param_space: added input_dim default va…

9851bb3

…lue of 1

mastoffel requested changes Sep 18, 2024

View reviewed changes

MaxBalmus added 3 commits September 27, 2024 12:05

merging with upstream

dc184b7

Addressing some of the issues in the commonts

a4cf6be

pre-commit run --all-files

f2ee0cb

mastoffel mentioned this pull request Sep 30, 2024

Pr 243 #252

Merged

mastoffel merged commit f2ee0cb into alan-turing-institute:main Sep 30, 2024
4 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expanded guassian_process_torch hyperparameter space #243

Expanded guassian_process_torch hyperparameter space #243

MaxBalmus commented Sep 17, 2024 •

edited

Loading

mastoffel left a comment

mastoffel Sep 18, 2024

mastoffel Sep 18, 2024

MaxBalmus Sep 27, 2024

mastoffel Sep 18, 2024

mastoffel Sep 18, 2024

MaxBalmus Sep 27, 2024

aranas commented Sep 18, 2024

mastoffel commented Sep 18, 2024

mastoffel commented Sep 30, 2024

MaxBalmus commented Sep 30, 2024

		X_train, X_val, y_train, y_val = train_test_split(X, y, test_size=0.2)
		train_split = predefined_split(Dataset(X_val, y_val))

		from .polynomial_features import PolynomialFeatures


		class PolyMean(gpytorch.means.Mean):

Expanded guassian_process_torch hyperparameter space #243

Expanded guassian_process_torch hyperparameter space #243

Conversation

MaxBalmus commented Sep 17, 2024 • edited Loading

mastoffel left a comment

Choose a reason for hiding this comment

mastoffel Sep 18, 2024

Choose a reason for hiding this comment

mastoffel Sep 18, 2024

Choose a reason for hiding this comment

MaxBalmus Sep 27, 2024

Choose a reason for hiding this comment

mastoffel Sep 18, 2024

Choose a reason for hiding this comment

mastoffel Sep 18, 2024

Choose a reason for hiding this comment

MaxBalmus Sep 27, 2024

Choose a reason for hiding this comment

aranas commented Sep 18, 2024

mastoffel commented Sep 18, 2024

mastoffel commented Sep 30, 2024

MaxBalmus commented Sep 30, 2024

MaxBalmus commented Sep 17, 2024 •

edited

Loading