feat: factorizations #1234

avik-pal · 2025-05-01T23:12:38Z

No description provided.

src/mlir/Dialects/Nvvm.jl

src/mlir/Dialects/TPU.jl

src/mlir/Dialects/Triton.jl

avik-pal · 2025-05-01T23:46:39Z

current status

julia> @jit lu(Reactant.to_rarray(rand(4, 4)))
malloc(): invalid size (unsorted)

[29206] signal 6 (-6): Aborted
in expression starting at REPL[13]:1

src/stdlibs/LinearAlgebra.jl

avik-pal · 2025-05-15T01:06:17Z

well crap I forgot to link the custom_call stuff from jaxlib

┌ Error: Compilation failed, MLIR module written to /tmp/reactant_iKiI73/module_000_reactant_lu_post_xla_compile.mlir
└ @ Reactant.MLIR.IR /mnt/software/lux/Reactant.jl/src/mlir/IR/Pass.jl:116
ERROR: module @reactant_lu attributes {mhlo.num_partitions = 1 : i64, mhlo.num_replicas = 1 : i64} {
  func.func @main(%arg0: tensor<3x3x4xf64>) -> (tensor<3x3x4xf64>, tensor<3x4xi32>, tensor<3x4xi32>, tensor<4xi32>) {
    %0 = mhlo.constant dense<1> : tensor<4x3xi32>
    %1 = "mhlo.transpose"(%arg0) <{permutation = dense<[2, 1, 0]> : tensor<3xi64>}> : (tensor<3x3x4xf64>) -> tensor<4x3x3xf64>
    %2:3 = mhlo.custom_call @cusolver_getrf_ffi(%1) {operand_layouts = [dense<[1, 2, 0]> : tensor<3xindex>], output_operand_aliases = [#mhlo.output_operand_alias<output_tuple_indices = [0], operand_index = 0, operand_tuple_indices = []>], result_layouts = [dense<[1, 2, 0]> : tensor<3xindex>, dense<[1, 0]> : tensor<2xindex>, dense<0> : tensor<1xindex>]} : (tensor<4x3x3xf64>) -> (tensor<4x3x3xf64>, tensor<4x3xi32>, tensor<4xi32>)
    %3 = mhlo.subtract %2#1, %0 : tensor<4x3xi32>
    %4 = mhlo.custom_call @cu_lu_pivots_to_permutation(%3) : (tensor<4x3xi32>) -> tensor<4x3xi32>
    %5 = mhlo.add %4, %0 : tensor<4x3xi32>
    %6 = "mhlo.transpose"(%2#0) <{permutation = dense<[2, 1, 0]> : tensor<3xi64>}> : (tensor<4x3x3xf64>) -> tensor<3x3x4xf64>
    %7 = "mhlo.transpose"(%2#1) <{permutation = dense<[1, 0]> : tensor<2xi64>}> : (tensor<4x3xi32>) -> tensor<3x4xi32>
    %8 = "mhlo.transpose"(%5) <{permutation = dense<[1, 0]> : tensor<2xi64>}> : (tensor<4x3xi32>) -> tensor<3x4xi32>
    return %6, %7, %8, %2#2 : tensor<3x3x4xf64>, tensor<3x4xi32>, tensor<3x4xi32>, tensor<4xi32>
  }
}
UNIMPLEMENTED: No registered implementation for custom call to cusolver_getrf_ffi for platform CUDA

wsmoses · 2025-05-15T01:13:45Z

Another day another jll

github-actions bot reviewed May 1, 2025

View reviewed changes

avik-pal marked this pull request as draft May 1, 2025 23:17

giordano reviewed May 2, 2025

View reviewed changes

src/stdlibs/LinearAlgebra.jl Outdated Show resolved Hide resolved

avik-pal force-pushed the ap/fact branch 2 times, most recently from 539a485 to e48006a Compare May 14, 2025 20:15

avik-pal force-pushed the ap/fact branch from 54b2439 to 5d505f6 Compare May 15, 2025 01:27

avik-pal added 8 commits May 15, 2025 20:11

fix: mlir regeneration

2b7741b

feat: factorizations

9fecc34

chore: fmt

4be561d

fix: symname

ea77e14

fix: flags

8a83145

feat: lapack integration working 🎉

9eaec1d

feat: Ops.lu

0223a7f

feat: add triangular_solve op

9ddadf8

avik-pal force-pushed the ap/fact branch from 5d505f6 to 9ddadf8 Compare May 16, 2025 01:11

chore: rename

a4d402d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: factorizations #1234

feat: factorizations #1234

avik-pal commented May 1, 2025

avik-pal commented May 1, 2025

avik-pal commented May 15, 2025

wsmoses commented May 15, 2025

feat: factorizations #1234

Are you sure you want to change the base?

feat: factorizations #1234

Conversation

avik-pal commented May 1, 2025

avik-pal commented May 1, 2025

avik-pal commented May 15, 2025

wsmoses commented May 15, 2025