add dynamic hyperparameter scaling #435

GeorgWa · 2025-01-16T23:09:04Z

Dynamic scaling of batch size and learning rate in NN classifier.

Prevents FDR collapse for small libraries
Improves speed for large libraries
Overrides static config of batchsize and lr

anna-charlotte

lgtm!

anna-charlotte · 2025-01-17T06:54:52Z

alphadia/fdrexperimental.py

@@ -905,6 +908,42 @@ def predict_proba(self, x: np.ndarray):
        return self.network(torch.Tensor(x)).detach().numpy()


+def get_scaled_training_params(df, base_lr=0.001, max_batch=1024, min_batch=64):


Is it intentional that the max batch_size is not closer to 5000 anymore, as we used it previously?

This kinda sneaked in here :D
I saw improved performance going down from 5000 to 1000

github-actions · 2025-01-17T15:16:34Z

alphadia/fdrexperimental.py

@@ -905,6 +908,42 @@ def predict_proba(self, x: np.ndarray):
        return self.network(torch.Tensor(x)).detach().numpy()


+def get_scaled_training_params(df, base_lr=0.001, max_batch=1024, min_batch=64):


Simplify the batch size calculation by combining min/max logic into a single line using max() and min() functions instead of np.clip(). This makes the logic more explicit and easier to follow.

def get_scaled_training_params(df, base_lr=0.001, max_batch=1024, min_batch=64): n_samples = len(df) # For >= 1M samples, use max batch size if n_samples >= 1_000_000: return max_batch, base_lr # Calculate scaled batch size (linear scaling between min and max) batch_size = int(max(min(max_batch * n_samples / 1_000_000, max_batch), min_batch)) # Scale learning rate using square root relationship learning_rate = base_lr * np.sqrt(batch_size / max_batch) return batch_size, learning_rate

github-actions · 2025-01-17T15:16:36Z

alphadia/fdrexperimental.py

@@ -1066,6 +1110,14 @@ def fit(self, x: np.ndarray, y: np.ndarray):
            Target values of shape (n_samples,) or (n_samples, n_classes).

        """
+        if self.experimental_hyperparameter_tuning:


Use debug level logging instead of info for hyperparameter tuning details since this is diagnostic information. Also simplified the log message formatting.

if self.experimental_hyperparameter_tuning: self.batch_size, self.learning_rate = get_scaled_training_params(x) logger.debug( f"Using scaled hyperparameters - samples: {len(x):,}, batch_size: {self.batch_size:,}, lr: {self.learning_rate:.2e}" )

github-actions · 2025-01-17T15:16:38Z

alphadia/workflow/peptidecentric.py

@@ -95,7 +95,11 @@
 ]



Remove explicit batch_size and learning_rate since they will be set automatically by experimental_hyperparameter_tuning. Add clarifying comment.

classifier_base = fdrx.BinaryClassifierLegacyNewBatching( test_size=0.001, epochs=10, experimental_hyperparameter_tuning=True, # Batch size and learning rate will be scaled automatically )

github-actions · 2025-01-17T15:16:40Z

tests/unit_tests/test_fdrx_base.py

@@ -45,3 +47,31 @@ def test_target_decoy_fdr(mock_show):
    assert all([col in df.columns for col in ["decoy_proba", "qval", "pep"]])
    assert np.all(df[["decoy_proba", "qval", "pep"]].values >= 0)
    assert np.all(df[["decoy_proba", "qval", "pep"]].values <= 1)
+
+
+@pytest.mark.parametrize(


Improve test case clarity by: 1) Using power operator (**) instead of np.sqrt() for more readable learning rate calculations, 2) Adding descriptive comments for each test case, 3) Aligning values for better readability

@pytest.mark.parametrize( "n_samples,expected_batch,expected_lr", [ (1_000_000, 1024, 0.001), # Base case (2_000_000, 1024, 0.001), # Above max samples (500_000, 512, 0.001 * (512/1024)**0.5), # 50% scaling (250_000, 256, 0.001 * (256/1024)**0.5), # 25% scaling (50_000, 64, 0.001 * (64/1024)**0.5), # Min batch size (1_000, 64, 0.001 * (64/1024)**0.5), # Below min batch size ], )

github-actions · 2025-01-17T15:16:42Z

Number of tokens: input_tokens=28412 output_tokens=1087 MAX_NUM_OUTPUT_TOKENS=4096

vbrennsteiner

LGTM!

vbrennsteiner · 2025-01-17T13:00:57Z

alphadia/fdrexperimental.py

@@ -905,6 +908,42 @@ def predict_proba(self, x: np.ndarray):
        return self.network(torch.Tensor(x)).detach().numpy()


+def get_scaled_training_params(df, base_lr=0.001, max_batch=1024, min_batch=64):


Might be a small point but could we consider renaming 'max_batch' to 'max_batch_size' and 'min_batch' to 'min_batch_size'?

vbrennsteiner · 2025-01-17T16:11:20Z

alphadia/workflow/peptidecentric.py

@@ -95,7 +95,11 @@
 ]

 classifier_base = fdrx.BinaryClassifierLegacyNewBatching(
-    test_size=0.001, batch_size=5000, learning_rate=0.001, epochs=10
+    test_size=0.001,
+    batch_size=5000,


You mention in a previous comment that a batch size closer to 1000 is actually optimal, would it make sense to change this default?

vbrennsteiner · 2025-01-17T16:19:15Z

tests/unit_tests/test_fdrx_base.py

+        (1_000, 64, 0.001 * np.sqrt(64 / 1024)),  # Should hit min batch size
+    ],
+)
+def test_get_scaled_training_params(n_samples, expected_batch, expected_lr):


Same again, I would recommend 'expected_batch_size' for clarity

add dynamic hyperparameter scaling

acc0dad

GeorgWa requested review from anna-charlotte, mschwoer and vbrennsteiner January 16, 2025 23:09

anna-charlotte approved these changes Jan 17, 2025

View reviewed changes

anna-charlotte mentioned this pull request Jan 17, 2025

Add two step classifier #431

Merged

4 tasks

Base automatically changed from lib-fix to main January 17, 2025 12:28

GeorgWa merged commit 5f24fa3 into main Jan 17, 2025
5 checks passed

GeorgWa deleted the dynamic-hyperparameter branch January 17, 2025 12:33

mschwoer added the code-review label Jan 17, 2025

github-actions bot reviewed Jan 17, 2025

View reviewed changes

vbrennsteiner approved these changes Jan 17, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add dynamic hyperparameter scaling #435

add dynamic hyperparameter scaling #435

GeorgWa commented Jan 16, 2025

anna-charlotte left a comment

anna-charlotte Jan 17, 2025

GeorgWa Jan 17, 2025

github-actions bot Jan 17, 2025

github-actions bot Jan 17, 2025

github-actions bot Jan 17, 2025

github-actions bot Jan 17, 2025

github-actions bot commented Jan 17, 2025

vbrennsteiner left a comment

vbrennsteiner Jan 17, 2025

vbrennsteiner Jan 17, 2025

vbrennsteiner Jan 17, 2025

		@@ -905,6 +908,42 @@ def predict_proba(self, x: np.ndarray):
		return self.network(torch.Tensor(x)).detach().numpy()


		def get_scaled_training_params(df, base_lr=0.001, max_batch=1024, min_batch=64):

add dynamic hyperparameter scaling #435

add dynamic hyperparameter scaling #435

Conversation

GeorgWa commented Jan 16, 2025

anna-charlotte left a comment

Choose a reason for hiding this comment

anna-charlotte Jan 17, 2025

Choose a reason for hiding this comment

GeorgWa Jan 17, 2025

Choose a reason for hiding this comment

github-actions bot Jan 17, 2025

Choose a reason for hiding this comment

github-actions bot Jan 17, 2025

Choose a reason for hiding this comment

github-actions bot Jan 17, 2025

Choose a reason for hiding this comment

github-actions bot Jan 17, 2025

Choose a reason for hiding this comment

github-actions bot commented Jan 17, 2025

vbrennsteiner left a comment

Choose a reason for hiding this comment

vbrennsteiner Jan 17, 2025

Choose a reason for hiding this comment

vbrennsteiner Jan 17, 2025

Choose a reason for hiding this comment

vbrennsteiner Jan 17, 2025

Choose a reason for hiding this comment