Address some predict/transform type instabilities #969

ablaom · 2024-04-08T21:07:10Z

(edited) This PR addresses a type instability for operations (predict, transform, etc) acting on machines, as identified in #959 (although this PR does not resolve the particular issue there).

I admit it is not clear to me that the performance gains here are likely to significantly benefit many use cases. But having done the work to identify these instabilities, I don't see harm in addressing them.

The type instability is not difficult to address in the case of machines attached to ordinary models, by annotating a currently abstract type in the Machine struct. However, in the special case of a machine attached to a symbolic model (which appear exclusively in learning networks), the type instability remains (and looks difficult to remove).

Benchmarks

In 69 regression models, we compared the "high level" predict(::Machine, ...) method with the "low level" predict(::Model, ...) method (edit plus reformat(::Model, ...)) implemented by third party model providers. The benchmark code is hidden below:

using MLJTestInterface, MLJModels, MLJBase
using Tables
using Random
using BenchmarkTools
using Statistics
import DataFrames
import MLJModelInterface as MMI

const MODELS  = models() do m
    !(m.package_name in ["MLJText"]) &&
        AbstractVector{Continuous} <: m.target_scitype &&
        m.is_supervised
end

# This is a way to load all needed model code:
MLJTestInterface.test(MODELS, mod=@__MODULE__, level=1, throw=true)

rng = Random.MersenneTwister(0)
Xmat = randn(rng, 30, 3)
X = Tables.table(Xmat)
y = @. cos(Xmat[:, 1] * 2.1 - 0.9) * Xmat[:, 2] - Xmat[:, 3]

function predict_low(model, fitresult, X)
    Xraw = MMI.reformat(model, X)
    MMI.predict(model, fitresult, Xraw...)
end

stats = []
for m in MODELS
    print("\rBenchmarking $(m.name) $(m.package_name).")
    model = eval(:(@load $(m.name) pkg=$(m.package_name) verbosity=0))()
    mach = machine(model, X, y)
    fit!(mach, verbosity=0)
    fitresult = mach.fitresult
    b_high = @benchmark predict($mach, $X)
    b_low = @benchmark predict_low($model, $fitresult, $X)
    slow_down = median(b_high.times)/median(b_low.times)
    bloat = b_high.allocs/b_low.allocs
    push!(stats, (; model=m.name, pkg=m.package_name, slow_down, bloat))
    print(" Done.                           ")
end

@show length(MODELS)
#length(MODELS) = 69

stats = DataFrames.DataFrame([stats...])
filter(stats) do row
    row.slow_down > 1.75 || row.bloat > 2.0
end

In the tables below:

"slow_down" is the ratio of elapsed time of "high" to "low"
"bloat" is the ratio of number of allocations of "high" to "low"

Only models with slow_down > 1.75 or bloat > 2 are reported.

Before this PR

#  Row │ model                           pkg                           slow_down  bloat
#      │ String                          String                        Float64    Float64
# ─────┼──────────────────────────────────────────────────────────────────────────────────
#    1 │ ConstantRegressor               MLJModels                       9.67007    4.0
#    2 │ DeterministicConstantRegressor  MLJModels                      14.0756     4.0
#    3 │ ElasticNetRegressor             MLJLinearModels                 4.41319    2.0
#    4 │ HuberRegressor                  MLJLinearModels                 3.93931    2.0
#    5 │ LADRegressor                    MLJLinearModels                 3.97205    2.0
#    6 │ LassoRegressor                  MLJLinearModels                 4.20191    2.0
#    7 │ LinearRegressor                 MLJLinearModels                 3.95137    2.0
#    8 │ LinearRegressor                 MultivariateStats               4.92059    2.5
#    9 │ PLSRegressor                    PartialLeastSquaresRegressor    2.11492    1.375
#   10 │ QuantileRegressor               MLJLinearModels                 4.15291    2.0
#   11 │ RidgeRegressor                  MLJLinearModels                 3.99158    2.0
#   12 │ RidgeRegressor                  MultivariateStats               5.03699    2.5
#   13 │ RobustRegressor                 MLJLinearModels                 3.95992    2.0

After this PR:

#  Row │ model              pkg        slow_down  bloat
#      │ String             String     Float64    Float64
# ─────┼──────────────────────────────────────────────────
#    1 │ ConstantRegressor  MLJModels    1.61401      3.0

Note that machines serialised using #master cannot be deserialised after this PR. But I don't consider this triggers a breaking release.

To do:

Run MLJ tests with integration tests switched on

oops

OkonSamuel

Looks good to me!!!.
@ablaom I think we should benchmark the compile time for this package with vs without this PR.
But in general, I think run time performance gains we would get from this should outweigh any added compile time increase, because these operations (e.g predict, etc.) are expected in general to be quite expensive.

ablaom · 2024-04-24T00:04:07Z

Doesn't look like there's a significant difference.

Before this PR:

julia> @time_imports import MLJBase
    413.2 ms  MLJBase 23.21% compilation time

After this PR:

@time_imports import MLJBase
    437.3 ms  MLJBase 22.21% compilation time

ablaom added 2 commits April 8, 2024 13:00

annotate type for old_model field of Machine type

7ae5821

oops

annotate type of operation field in Node type

190de70

ablaom marked this pull request as draft April 8, 2024 21:07

ablaom requested a review from OkonSamuel April 8, 2024 21:08

ablaom marked this pull request as ready for review April 10, 2024 00:29

OkonSamuel approved these changes Apr 17, 2024

View reviewed changes

This was referenced Apr 23, 2024

Tests failing on dev #971

Closed

Make test of iterator(...) more robust #972

Merged

ablaom merged commit 6e77d6a into dev Apr 24, 2024
3 checks passed

ablaom deleted the predict-type-instability branch April 24, 2024 00:04

This was referenced May 6, 2024

For a 1.3 release #977

Merged

Issue to trigger releases #345

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Address some predict/transform type instabilities #969

Address some predict/transform type instabilities #969

ablaom commented Apr 8, 2024 •

edited

Loading

OkonSamuel left a comment

ablaom commented Apr 24, 2024

Address some predict/transform type instabilities #969

Address some predict/transform type instabilities #969

Conversation

ablaom commented Apr 8, 2024 • edited Loading

Benchmarks

Before this PR

After this PR:

OkonSamuel left a comment

Choose a reason for hiding this comment

ablaom commented Apr 24, 2024

ablaom commented Apr 8, 2024 •

edited

Loading