Infinite Gradient Handling #582

Aziz-Shameem · 2025-04-16T14:59:19Z

Added support for infinite gradient check as per this

Note that the method added also checks for nan values in grads along with infinite values, since both lead to the same errors in optimization and are therefore, related.

Let me know if any further changes/polishing is required. All the CI tests seem to pass.

Thanks

timmens

Thanks for the PR!

This looks pretty good already. I have added a few review comments to your changes.

Additionally: Can you create a new test file tests/optimagic/optimization/test_invalid_function_value.py, and add two test cases where you check the cases of

Infinite values and
NaN values

These tests should fail without your changes, and pass with your changes; i.e., you will have to check that your code raises the correct error! We have many such tests in our test suite -- I would just search for "pytest.raises".

Thanks again and let me know if you have any questions!

src/optimagic/optimization/internal_optimization_problem.py

timmens · 2025-04-28T15:37:12Z

Hey @Aziz-Shameem!

Just checking in — do you have a sense of the timeline for finishing up this PR? No rush if you're still working on it; just want to plan ahead a bit.

Thanks a lot!

Aziz-Shameem · 2025-04-28T15:42:28Z

Hey @timmens

Really sorry for the delay. I was at ICLR, that was taking too much time during the day.
Since the conference has now ended, I should finish this up in a day or two max.

I also wanted to see if I could tackle the second part of this PR (deals with the error handling). I want to take that up after I submit this. Do you think that will be feasible ? @janosg did mention it requires more knowledge of the internals, so I will have to read up for it.

Thanks.

janosg · 2025-04-28T15:54:30Z

@Aziz-Shameem let's first finish this PR and then open a seperate one to implement the penalty approach instead of throwing the error.

…into InfGrads

Aziz-Shameem · 2025-04-29T17:46:06Z

Hey @timmens, @janosg
Made the changes, I have also added several tests for many input-combinations.
This does not currently handle numerical derivatives correctly, even though it checks for inf/nan values in the jacobian while evaluating numerical derivatives.

I changed the error message to make it more informative. It now reads as follows :

Aziz-Shameem · 2025-04-30T05:50:00Z

For handling such cases when using numerical derivatives :
I noticed assert_finite_jac() catches a portion of the cases itself. The others seem to be like the example I attach below :

Maybe we could fix a certain (±)THRESHOLD value, have it flag it as infinite when an element of the gradient crosses it, and throw the same error as for the cases when jac is provided explicitly.
If so, this could be added inside this check-function itself.

timmens

Thanks a lot for the changes, it looks much better now! There are a few things to consider before going forward though:

Feature:

Since _assert_finite_jac does not depend on self, I'd make this a regular function, just like _process_fun_value below.
For the error message to be more informative, I think we can go one step further.
- I would add a keyword argument origin: Literal["numerical", "jac", "fun_and_jac"]. I.e. you write params: PyTree, *, origin: ....
- For the error message you can then make the distinction whether the error occurred in a user provided derivative (jac or fun_and_jac case), or whether it occurred during a numerical derivative, in which case the user needs to check the criterion function fun.
- The resulting errors could look like this:
  - The optimization failed because the derivative (via jac) contains infinite or NaN values. Please validate the derivative function.
  - The optimization failed because the derivative (via fun_and_jac) contains infinite or NaN values. Please validate the derivative function.
  - The optimization failed because the numerical derivative (computed using fun) contains infinite or NaN values. Please validate the criterion function.
- Of course, you would still also print the parameter and Jacobian values.
- Make sure to consistently capitalize Jacobian!

Tests:

You have many tests for different combinations, which is great! However, there is a lot of code duplication, and in this case one general PyTree parameter should be enough. I have added a proposal below. If you customize the error message per origin, then ideally, you should adjust the message in the tests as well, so that we can additionally test that the correct error message is thrown.

Proposal:

import numpy as np
import pytest

from optimagic.exceptions import UserFunctionRuntimeError
from optimagic.optimization.optimize import minimize


# ======================================================================================
# Test setup:
# --------------------------------------------------------------------------------------
# We test that minimize raises an error if the user function returns a jacobian
# containing invalid values (e.g. np.inf, np.nan). To test that this works not only at
# the start parameters, we create jac functions that return invalid values if the
# parameter norm becomes smaller than 1. For this test, we assume the following
# parameter structure: {"a": 1, "b": np.array([2, 3])}
# ======================================================================================

def sphere(params):
    return params["a"] ** 2 + (params["b"] ** 2).sum()


def sphere_gradient(params):
    return {"a": 2 * params["a"], "b": 2 * params["b"]}


def sphere_and_gradient(params):
    return sphere(params), sphere_gradient(params)
        

def params_norm(params):
    squared_norm = (
        params["a"] ** 2 + np.linalg.norm(params["b"]) ** 2
    )
    return np.sqrt(squared_norm)


def get_invalid_jac(invalid_jac_value):
    """Get function that returns invalid jac if the parameter norm < 1."""

    def jac(params):
        if params_norm(params) < 1:
            return invalid_jac_value
        else:
            return sphere_gradient(params)
        
    return jac

def get_invalid_fun_and_jac(invalid_jac_value):
    """Get function that returns invalid fun and jac if the parameter norm < 1."""

    def fun_and_jac(params):
        if params_norm(params) < 1:
            return sphere(params), invalid_jac_value
        else:
            return sphere_and_gradient(params)

    return fun_and_jac

# ======================================================================================
# Tests
# ======================================================================================


INVALID_JACOBIAN_VALUES = [
    {"a": np.inf, "b": 2 * np.array([1, 2])},
    {"a": 1, "b": 2 * np.array([np.inf, 2])},
    {"a": np.nan, "b": 2 * np.array([1, 2])},
    {"a": 1, "b": 2 * np.array([np.nan, 2])},
]


@pytest.mark.parametrize("invalid_jac_value", INVALID_JACOBIAN_VALUES)
def test_minimize_with_invalid_jac(invalid_jac_value):
    params = {
        "a": 1,
        "b": np.array([3, 4]),
    }

    with pytest.raises(
        UserFunctionRuntimeError,
        match="The optimization received Jacobian containing infinite"
    ):
        minimize(
            fun=sphere,
            params=params,
            algorithm="scipy_lbfgsb",
            jac=get_invalid_jac(invalid_jac_value),
        )


@pytest.mark.parametrize("invalid_jac_value", INVALID_JACOBIAN_VALUES)
def test_minimize_with_invalid_fun_and_jac(invalid_jac_value):
    params = {
        "a": 1,
        "b": np.array([3, 4]),
    }

    with pytest.raises(
        UserFunctionRuntimeError,
        match="The optimization received Jacobian containing infinite"
    ):
        minimize(
            params=params,
            algorithm="scipy_lbfgsb",
            fun_and_jac=get_invalid_fun_and_jac(invalid_jac_value),
        )

timmens · 2025-04-30T10:15:41Z

src/optimagic/optimization/internal_optimization_problem.py

+from optimagic.exceptions import (
+    UserFunctionRuntimeError,
+    get_traceback,
+)


Suggested change

from optimagic.exceptions import (

UserFunctionRuntimeError,

get_traceback,

)

from optimagic.exceptions import UserFunctionRuntimeError, get_traceback

There is no change here anymore.

Aziz-Shameem added 2 commits April 10, 2025 18:24

Added support for infinite gradients

375d062

Added Infinite Gradient handling

7d3ea5a

timmens requested changes Apr 18, 2025

View reviewed changes

Aziz-Shameem and others added 4 commits April 19, 2025 11:49

minor fixes

b9fc5d2

added tests

107b0b9

final polishing

d399ea4

Merge branch 'main' into InfGrads

e787b88

Aziz-Shameem added 4 commits April 29, 2025 21:00

added exhaustive tests

7910e13

bug fixes

8980861

bug fixes

4989d2b

Merge branch 'InfGrads' of https://github.com/Aziz-Shameem/optimagic …

deaedcf

…into InfGrads

timmens requested changes Apr 30, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Infinite Gradient Handling #582

Infinite Gradient Handling #582

Aziz-Shameem commented Apr 16, 2025

timmens left a comment •

edited

Loading

timmens commented Apr 28, 2025

Aziz-Shameem commented Apr 28, 2025

janosg commented Apr 28, 2025

Aziz-Shameem commented Apr 29, 2025 •

edited

Loading

Aziz-Shameem commented Apr 30, 2025

timmens left a comment

timmens Apr 30, 2025

Infinite Gradient Handling #582

Are you sure you want to change the base?

Infinite Gradient Handling #582

Conversation

Aziz-Shameem commented Apr 16, 2025

timmens left a comment • edited Loading

Choose a reason for hiding this comment

timmens commented Apr 28, 2025

Aziz-Shameem commented Apr 28, 2025

janosg commented Apr 28, 2025

Aziz-Shameem commented Apr 29, 2025 • edited Loading

Aziz-Shameem commented Apr 30, 2025

timmens left a comment

Choose a reason for hiding this comment

timmens Apr 30, 2025

Choose a reason for hiding this comment

timmens left a comment •

edited

Loading

Aziz-Shameem commented Apr 29, 2025 •

edited

Loading