Add option to throw error on passing wrong precision floats to layers #2454

lassepe · 2024-06-07T10:36:03Z

Motivation and description

The warning about wrong precision is very helpful to point at potential performance issues

Line 60 in 2f19e68

@warn "Layer with Float32 parameters got Float64 input.

I think that this is the correct default behavior. However, in order to find out where the problem is coming from throwing an error to produce a stacktrace would be very helpful.

Possible Implementation

There could be a Preference or global flag that allows switching errors instead of warning for wrong precision inputs. This would also be made consistent with CUDA.allowscalar.

The text was updated successfully, but these errors were encountered:

mcabbott · 2024-06-11T13:49:09Z

This seems like a good idea.

Maybe it should literally be the same switch as CUDA.allowscalar, to avoid introducing ever more functions you have to know about? It's not exactly the same meaning, but it's likely to be used at the same time.

That function belongs to GPUArraysCore, which Flux doesn't directly load right now, but NNlib does.

darsnack · 2024-06-11T23:06:11Z

When you say "same switch" do you mean defining something like Flux.throw_sanity_check_errors() which flips the requested precision switch in the OP as well as turning off CUDA.allowscalar? Or do you mean if CUDA.allowscalar is false, then we throw errors for precision? If it's the latter, then that seems wrong to me since scalar indexing and floating point precision are totally unrelated.

lassepe · 2024-06-12T08:36:44Z

I also think it makes sense to have a separate function for each of these checks. But having a Flux.enable_all_sanity_check_errors() on top of that is good for discoverability (also since the docstring of that can also point to the relevant sub-functions).

mcabbott · 2024-10-26T00:58:35Z

seems wrong to me since scalar indexing and floating point precision are totally unrelated

This was my suggestion! Float precision is a big deal on GPU and not otherwise. There are basically two modes of using it:

You just load CUDA, and most things work (but may be super-slow & give warnings). Probably you want friendly warnings from Flux too.
Or you say CUDA.allowscalar(false) and then things will either be fast, or else give an error. You do this once you have it basically working. Probably you want Flux to also tell you if you are doing anything slow?

Of course we could invent some new switch that we own to control this. But then it's one more mysterious function you have to know about. One more kind of mutable state.

mcabbott · 2024-11-05T04:26:48Z

Note that CUDA.jl has switched the default to be disallowing scalar access.

Maybe that means using the same switch is a worse idea. So if we own a switch, what's a good name for it?

julia> using CUDA

julia> first(CUDA.randn(32))
ERROR: Scalar indexing is disallowed.
Invocation of getindex resulted in scalar indexing of a GPU array.
...

julia> CUDA.allowscalar(true)
┌ Warning: It's not recommended to use allowscalar([true]) to allow scalar indexing.
│ Instead, use `allowscalar() do end` or `@allowscalar` to denote exactly which operations can use scalar operations.
└ @ GPUArraysCore ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:188

julia> first(CUDA.randn(32))
0.37104183f0

(@v1.11) pkg> st CUDA
Status `~/.julia/environments/v1.11/Project.toml`
  [052768ef] CUDA v5.5.2

mcabbott added the performance label Nov 7, 2024

mcabbott mentioned this issue Nov 7, 2024

warn on batched_mul_generic!? FluxML/NNlib.jl#478

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add option to throw error on passing wrong precision floats to layers #2454

Add option to throw error on passing wrong precision floats to layers #2454

lassepe commented Jun 7, 2024

mcabbott commented Jun 11, 2024

darsnack commented Jun 11, 2024

lassepe commented Jun 12, 2024 •

edited

Loading

mcabbott commented Oct 26, 2024

mcabbott commented Nov 5, 2024

Add option to throw error on passing wrong precision floats to layers #2454

Add option to throw error on passing wrong precision floats to layers #2454

Comments

lassepe commented Jun 7, 2024

Motivation and description

Possible Implementation

mcabbott commented Jun 11, 2024

darsnack commented Jun 11, 2024

lassepe commented Jun 12, 2024 • edited Loading

mcabbott commented Oct 26, 2024

mcabbott commented Nov 5, 2024

lassepe commented Jun 12, 2024 •

edited

Loading