Add an example of Wilcoxon pruner #238

eukaryo · 2024-02-19T06:26:31Z

Motivation

I want to add an example using Wilcoxon pruner.

Description of the changes

added an example of Wilcoxon pruner. In this example, Optuna optimizes parameters of simmulated annealing which solves a random dataset of traveling salesman problems.

nabenabe0928 · 2024-02-21T06:36:35Z

@contramundum53 Could you review this PR?

.github/workflows/wilcoxon_pruner.yml

pruners/wilcoxon_pruner_tsp_sa.py

contramundum53 · 2024-02-28T11:21:27Z

@eukaryo
How about changing the example as this?

I made the following changes:

Use greedy solution as the initial solution (This generates a reasonable solution even with only 10000 iterations)
Just set np.random.seed(0) before everything for simplicity
Add print in the objective function
Change search space
Add comments for readability
Fix bug in TSP solver

import math
import sys
from dataclasses import dataclass

import numpy as np
from numpy.linalg import norm

import optuna

np.random.seed(0)

@dataclass
class SAOptions:
    max_iter: int = 10000
    T0: float = 1.0
    alpha: float = 2.0
    patience: int = 50


def tsp_cost(vertices: np.ndarray, idxs: np.ndarray) -> float:
    return norm(vertices[idxs] - vertices[np.roll(idxs, 1)], axis=-1).sum()


# Greedy solution for initial guess.
def tsp_greedy(vertices: np.ndarray) -> np.ndarray:
    idxs = [0]
    for _ in range(len(vertices) - 1):
        dists_from_last = norm(vertices[idxs[-1], None] - vertices, axis=-1)
        dists_from_last[idxs] = np.inf
        idxs.append(np.argmin(dists_from_last))
    return np.array(idxs)


# A minimal implementation of TSP solver using simulated annealing on 2-opt neighbors.
def tsp_simulated_annealing(vertices: np.ndarray, options: SAOptions) -> np.ndarray:

    def temperature(t: float):
        # t: 0 ... 1
        return options.T0 * (1 - t) ** options.alpha

    N = len(vertices)

    idxs = tsp_greedy(vertices)
    cost = tsp_cost(vertices, idxs)
    best_idxs = idxs.copy()
    best_cost = cost
    remaining_patience = options.patience

    for iter in range(options.max_iter):

        i = np.random.randint(0, N)
        j = (i + 2 + np.random.randint(0, N - 3)) % N
        i, j = min(i, j), max(i, j)
        # Reverse the order of vertices between range [i+1, j].

        # cost difference by 2-opt reversal
        delta_cost = (
            -norm(vertices[idxs[(i + 1) % N]] - vertices[idxs[i]])
            - norm(vertices[idxs[j]] - vertices[idxs[(j + 1) % N]])
            + norm(vertices[idxs[i]] - vertices[idxs[j]])
            + norm(vertices[idxs[(i + 1) % N]] - vertices[idxs[(j + 1) % N]])
        )
        temp = temperature(iter / options.max_iter)
        if delta_cost <= 0.0 or np.random.random() < math.exp(-delta_cost / temp):
            # accept the 2-opt reversal
            cost += delta_cost
            idxs[i + 1 : j + 1] = idxs[i + 1 : j + 1][::-1]
            if cost < best_cost:
                best_idxs[:] = idxs
                best_cost = cost
                remaining_patience = options.patience

        if cost > best_cost:
            # If the best solution is not updated for "patience" iteratoins,
            # restart from the best solution.
            remaining_patience -= 1
            if remaining_patience == 0:
                idxs[:] = best_idxs
                cost = best_cost
                remaining_patience = options.patience

    return best_idxs


def make_dataset(num_vertex: int, num_problem: int) -> np.ndarray:
    return np.random.random((num_problem, num_vertex, 2))


dataset = make_dataset(
    num_vertex=100,
    num_problem=50,
)

N_TRIALS = 50

# We set a very small number of SA iterations for demonstration purpose.
# In practice, you should set a larger number of iterations.
N_SA_ITER = 10000
count = 0


def objective(trial: optuna.Trial) -> float:
    global count
    options = SAOptions(
        max_iter=N_SA_ITER,
        T0=trial.suggest_float("T0", 0.01, 10.0, log=True),
        alpha=trial.suggest_float("alpha", 1.0, 10.0, log=True),
        patience=trial.suggest_int("patience", 10, 1000, log=True),
    )
    results = []

    # For best results, shuffle the evaluation order in each trial.
    ordering = np.random.permutation(len(dataset))
    for i in ordering:
        count += 1
        result_idxs = tsp_simulated_annealing(vertices=dataset[i], options=options)
        result_cost = tsp_cost(dataset[i], result_idxs)
        results.append(result_cost)

        trial.report(result_cost, i)
        if trial.should_prune():
            print(f"[{trial.number}] Pruned at {len(results)}/{len(dataset)}", file=sys.stderr)
            # raise optuna.TrialPruned()

            # Return the current predicted value when pruned.
            # This is a workaround for the problem that current TPE sampler cannot utilize
            # pruned trials effectively.
            return sum(results) / len(results)

    print(f"[{trial.number}] Not pruned ({len(results)}/{len(dataset)})", file=sys.stderr)
    return sum(results) / len(results)


if __name__ == "__main__":
    sampler = optuna.samplers.TPESampler(seed=1)
    pruner = optuna.pruners.WilcoxonPruner(p_threshold=0.1)
    study = optuna.create_study(direction="minimize", sampler=sampler, pruner=pruner)
    study.enqueue_trial({"T0": 1.0, "alpha": 2.0, "patience": 50})  # default params
    study.optimize(objective, n_trials=N_TRIALS)
    print(f"The number of trials: {len(study.trials)}")
    print(f"Best value: {study.best_value} (params: {study.best_params})")
    print(f"Number of evaluations: {count} / {N_TRIALS * len(dataset)}")

contramundum53

LGTM!

eukaryo added 11 commits February 19, 2024 15:16

Create requirements.txt

dd619ad

Create wilcoxon_pruner_tsp_sa.py

55c5739

fix lint

cd37c3c

fix lint

84c967f

fix isort

51f6ce9

remove numba from requirements.txt

853a319

Update wilcoxon_pruner_tsp_sa.py

9a7b013

Create wilcoxon_pruner.yml

451eef8

fix flake8

c4d0016

avoid math.exp error

878636c

add scipy to requirements.txt

83658b2

nabenabe0928 assigned contramundum53 Feb 21, 2024

eukaryo commented Feb 26, 2024

View reviewed changes

.github/workflows/wilcoxon_pruner.yml Outdated Show resolved Hide resolved

eukaryo added 2 commits February 26, 2024 15:33

modify name

cdb7de3

Rename wilcoxon_pruner.yml to pruners.yml

ccba822

eukaryo commented Feb 28, 2024

View reviewed changes

pruners/wilcoxon_pruner_tsp_sa.py Outdated Show resolved Hide resolved

fix a bug

49dab75

eukaryo added 3 commits February 28, 2024 20:56

Update wilcoxon_pruner_tsp_sa.py

df98004

fix black

fcdfcaa

fix isort

a5b1442

contramundum53 approved these changes Feb 29, 2024

View reviewed changes

contramundum53 merged commit 2f4fdbb into optuna:main Feb 29, 2024
6 checks passed

y0z added this to the v3.6.0 milestone Mar 15, 2024

nabenabe0928 added the document Documentation related. label Mar 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add an example of Wilcoxon pruner #238

Add an example of Wilcoxon pruner #238

eukaryo commented Feb 19, 2024 •

edited

Loading

nabenabe0928 commented Feb 21, 2024

contramundum53 commented Feb 28, 2024 •

edited

Loading

contramundum53 left a comment

Add an example of Wilcoxon pruner #238

Add an example of Wilcoxon pruner #238

Conversation

eukaryo commented Feb 19, 2024 • edited Loading

Motivation

Description of the changes

nabenabe0928 commented Feb 21, 2024

contramundum53 commented Feb 28, 2024 • edited Loading

contramundum53 left a comment

Choose a reason for hiding this comment

eukaryo commented Feb 19, 2024 •

edited

Loading

contramundum53 commented Feb 28, 2024 •

edited

Loading