Refactor/cubecl fusion #2815

nathanielsimard · 2025-02-14T20:52:58Z

Improve Burn's compilation time significantly :)

Moved backend re-exports to the root burn crate, so we don't need to recompile burn-core when working on a backend (better caching).
Created a new crate, burn-cubecl-fusion, that exports all optimizations used by burn-cubecl when fusion is activated.
Refactored all fusion optimizations to use a dynamic element type, avoiding Rust generics.

Overall, the fusion part of burn-cubecl went from 41s compilation time to 6.5s! burn-cubecl now takes around 10s and compiles in parallel with burn-cubecl-fusion.

laggui

Apart from what was discussed offline, just some minor comments.

Approving in advance.

laggui · 2025-02-17T21:47:51Z

crates/burn-cubecl-fusion/src/on_write/base.rs

Any reason why this is alone in a new file? Setting the table for incoming stuff? 👀

We can probably remove this file otherwise

laggui · 2025-02-17T21:48:06Z

crates/burn-cubecl-fusion/src/on_write/ir.rs

+// #[derive(CubeLaunch, Default)]
+// /// Global arguments that are used for fusing [element wise operations](ElemwiseOp).
+// pub struct GlobalArgs {
+//     pub t_f32: Sequence<Tensor<Line<f32>>>,
+//     pub t_f16: Sequence<Tensor<Line<f16>>>,
+//     pub t_bf16: Sequence<Tensor<Line<bf16>>>,
+//     pub t_i64: Sequence<Tensor<Line<i64>>>,
+//     pub t_i32: Sequence<Tensor<Line<i32>>>,
+//     pub t_i16: Sequence<Tensor<Line<i16>>>,
+//     pub t_i8: Sequence<Tensor<Line<i8>>>,
+//     pub t_u64: Sequence<Tensor<Line<u64>>>,
+//     pub t_u32: Sequence<Tensor<Line<u32>>>,
+//     pub t_u16: Sequence<Tensor<Line<u16>>>,
+//     pub t_u8: Sequence<Tensor<Line<u8>>>,
+//     pub s_f32: Sequence<f32>,
+//     pub s_f16: Sequence<f16>,
+//     pub s_bf16: Sequence<bf16>,
+//     pub s_i64: Sequence<i64>,
+//     pub s_i32: Sequence<i32>,
+//     pub s_i16: Sequence<i16>,
+//     pub s_i8: Sequence<i8>,
+//     pub s_u64: Sequence<u64>,
+//     pub s_u32: Sequence<u32>,
+//     pub s_u16: Sequence<u16>,
+//     pub s_u8: Sequence<u8>,
+// }
+


laggui · 2025-02-17T21:48:33Z

crates/burn-cubecl-fusion/src/on_write/ir.rs

+// impl<R: Runtime> Default for GlobalArgsLaunch<'_, R> {
+//     fn default() -> Self {
+//         Self::new(
+//             Default::default(),
+//             Default::default(),
+//             Default::default(),
+//             Default::default(),
+//             Default::default(),
+//             Default::default(),
+//             Default::default(),
+//             Default::default(),
+//             Default::default(),
+//             Default::default(),
+//             Default::default(),
+//             Default::default(),
+//             Default::default(),
+//             Default::default(),
+//             Default::default(),
+//             Default::default(),
+//             Default::default(),
+//             Default::default(),
+//             Default::default(),
+//             Default::default(),
+//             Default::default(),
+//             Default::default(),
+//         )
+//     }
+// }


laggui · 2025-02-17T21:49:15Z

crates/burn-cubecl-fusion/src/on_write/ir.rs

+    pub tensors: Sequence<GlobalTensor>,
+    pub scalars: Sequence<GlobalScalar>,


That's clean 👌

codecov · 2025-02-17T22:04:03Z

Codecov Report

Attention: Patch coverage is 47.34940% with 874 lines in your changes missing coverage. Please review.

Project coverage is 82.23%. Comparing base (136eeb6) to head (bbcb438).
Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
crates/burn-cubecl-fusion/src/on_write/kernel.rs	14.68%	430 Missing ⚠️
crates/burn-cubecl-fusion/src/on_write/io.rs	54.22%	157 Missing ⚠️
crates/burn-cubecl-fusion/src/on_write/tensor.rs	49.37%	81 Missing ⚠️
.../burn-cubecl-fusion/src/on_write/trace/executor.rs	66.66%	60 Missing ⚠️
crates/burn-cubecl-fusion/src/matmul/args.rs	0.00%	39 Missing ⚠️
...es/burn-cubecl-fusion/src/elemwise/optimization.rs	60.52%	30 Missing ⚠️
crates/burn-cubecl-fusion/src/base.rs	69.14%	29 Missing ⚠️
crates/burn-cubecl-fusion/src/on_write/ir.rs	78.78%	14 Missing ⚠️
...ates/burn-cubecl-fusion/src/matmul/optimization.rs	71.42%	10 Missing ⚠️
crates/burn-cubecl/src/fusion.rs	85.48%	9 Missing ⚠️
... and 5 more

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2815      +/-   ##
==========================================
+ Coverage   81.70%   82.23%   +0.52%     
==========================================
  Files         852      853       +1     
  Lines      114337   113454     -883     
==========================================
- Hits        93424    93296     -128     
+ Misses      20913    20158     -755

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

nathanielsimard added 4 commits February 14, 2025 13:34

WIP

3f979cb

It compiles

5ca92e3

Cleanup

4bf67b1

Update readmes

d9f479e

nathanielsimard mentioned this pull request Feb 14, 2025

Refactor/fusion tracel-ai/cubecl#484

Merged

nathanielsimard added 14 commits February 14, 2025 15:55

Cleanup

59451ac

Move backends to burn root crate

15143d9

Use polyfill in fusion

286ff26

It compiles

9ae90c2

Fix a few things

46008ac

Fix scalar indexing

c1bb1ab

Update rev

b4bc1b9

Merge branch 'main' into refactor/cubecl-fusion

697a488

Merge branch 'main' into refactor/cubecl-fusion

5b5a188

Fix conv

ef29e1f

Clippy + Fmt

1505fe5

Add publish github + cargo description

a7e6042

Rename matmul;2u

0a1e515

Fix burn-core backend testing dependencies

4e64fb0

laggui approved these changes Feb 17, 2025

View reviewed changes

Cleanup

bbcb438

nathanielsimard merged commit f3dfd05 into main Feb 17, 2025
11 checks passed

nathanielsimard deleted the refactor/cubecl-fusion branch February 17, 2025 22:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor/cubecl fusion #2815

Refactor/cubecl fusion #2815

nathanielsimard commented Feb 14, 2025 •

edited

Loading

laggui left a comment

laggui Feb 17, 2025

laggui Feb 17, 2025

laggui Feb 17, 2025

laggui Feb 17, 2025

codecov bot commented Feb 17, 2025 •

edited

Loading

		pub tensors: Sequence<GlobalTensor>,
		pub scalars: Sequence<GlobalScalar>,

Refactor/cubecl fusion #2815

Refactor/cubecl fusion #2815

Conversation

nathanielsimard commented Feb 14, 2025 • edited Loading

laggui left a comment

Choose a reason for hiding this comment

laggui Feb 17, 2025

Choose a reason for hiding this comment

laggui Feb 17, 2025

Choose a reason for hiding this comment

laggui Feb 17, 2025

Choose a reason for hiding this comment

laggui Feb 17, 2025

Choose a reason for hiding this comment

codecov bot commented Feb 17, 2025 • edited Loading

Codecov Report

nathanielsimard commented Feb 14, 2025 •

edited

Loading

codecov bot commented Feb 17, 2025 •

edited

Loading