wasm linker: aggressive rewrite towards Data-Oriented Design #22220

andrewrk · 2024-12-13T02:20:10Z

The goals of this branch are to:

compile faster when using the wasm linker and backend
enable saving compiler state by directly copying in-memory linker state to disk.
more efficient compiler memory utilization
introduce integer type safety to wasm linker code
generate better WebAssembly code
fully participate in incremental compilation
do as much work as possible outside of flush(), while continuing to do linker garbage collection.
avoid unnecessary heap allocations
avoid unnecessary indirect function calls

In order to accomplish these goals, this removes the ZigObject abstraction, as well as Symbol and Atom. These abstractions resulted in overly generic code, doing unnecessary work, and needless complications that simply go away by creating a better in-memory data model and emitting more things lazily.

For example, this makes wasm codegen emit MIR which is then lowered to wasm code during linking, with optimal function indexes etc, or relocations are emitted if outputting an object. Previously, this would always emit relocations, which are fully unnecessary when emitting an executable, and required all function calls to use the maximum size LEB encoding.

This branch introduces the concept of the "prelink" phase which occurs after all object files have been parsed, but before any Zcu updates are sent to the linker. This allows the linker to fully parse all objects into a compact memory model, which is guaranteed to be complete when Zcu code is generated.

Merge Checklist

data_segments state needs to be reset on update
call the gc mark functions in updateFunc
implement the prelink phase in the frontend
fix regressions / get the tests passing again
eliminate TODOs
track function import ref count for optimal leb encoding
sort undef data segments separately and memset them at runtime

Demo: Incremental Compilation

After this branch is ready to merge, I'll put a demo here.

Demo: Serializing and Deserializing Linker State

After this branch is ready to merge, I'll put a demo here.

Followup

After landing this branch I plan to set a firm release date for the 0.14.0 tag.

ELF, COFF, and MachO need the same treatment. I started with Wasm because it is significantly fewer lines of code. Some strategies can be shared there, however, I don't expect to keep as much in memory with those linkers, since the total object file size could be enormous.

Post-Merge Roadmap:

One month of QA for 0.14.0
Release 0.14.0
Enhance wasm linker enough to pass LLD's test suite for Wasm.
Remove dependency on LLD for Wasm.
Repeat steps 3-4 for ELF
Repeat steps 3-4 for COFF
Repeat steps 3-4 for MachO
Rework ELF linker code with respect to incremental compilation goals
Rework COFF linker code with respect to incremental compilation goals
Rework MachO linker code with respect to incremental compilation goals

The goals of this branch are to: * compile faster when using the wasm linker and backend * enable saving compiler state by directly copying in-memory linker state to disk. * more efficient compiler memory utilization * introduce integer type safety to wasm linker code * generate better WebAssembly code * fully participate in incremental compilation * do as much work as possible outside of flush(), while continuing to do linker garbage collection. * avoid unnecessary heap allocations * avoid unnecessary indirect function calls In order to accomplish this goals, this removes the ZigObject abstraction, as well as Symbol and Atom. These abstractions resulted in overly generic code, doing unnecessary work, and needless complications that simply go away by creating a better in-memory data model and emitting more things lazily. For example, this makes wasm codegen emit MIR which is then lowered to wasm code during linking, with optimal function indexes etc, or relocations are emitted if outputting an object. Previously, this would always emit relocations, which are fully unnecessary when emitting an executable, and required all function calls to use the maximum size LEB encoding. This branch introduces the concept of the "prelink" phase which occurs after all object files have been parsed, but before any Zcu updates are sent to the linker. This allows the linker to fully parse all objects into a compact memory model, which is guaranteed to be complete when Zcu code is generated. This commit is not a complete implementation of all these goals; it is not even passing semantic analysis.

Makes linker functions have small error sets, required to report diagnostics properly rather than having a massive error set that has a lot of codes. Other linker implementations are not ported yet. Also the branch is not passing semantic analysis yet.

See #363. Please file issues rather than making TODO comments.

mainly, rework how relocations works. This is the point at which symbol indexes are known - not before. And don't emit unnecessary relocations! They're only needed when emitting an object file. Changes wasm linker to keep MIR around long-lived so that fixups can be reapplied after linker garbage collection. use labeled switch while we're at it

Still, the branch is not yet passing semantic analysis.

This branch is passing type checking now.

with this I get 5s compilations

fix some compilation errors for reworked Emit now that it's actually referenced introduce DataSegment.Id for sorting data both from object files and from the Zcu. introduce optimization: data segment sorting includes a descending sort on reference count so that references to data can be smaller integers leading to better LEB encodings. this optimization is skipped for object files. implement uav address access function which is based on only 1 hash table lookup to find out the offset after sorting.

and more disciplined type safety for output function indexes

in which case the values array is set to undefined

Recognize three distinct phases: * before prelink ("object phase") * after prelink, before flush ("zcu phase") * during flush ("flush phase") With this setup, we create data structures during the object phase, then mutate them during the zcu phase, and then further mutate them during the flush phase. In order to make the flush phase repeatable, the data structures are copied just before starting the flush phase. Further Zcu updates occur against the non-copied data structures. What's not implemented is frontend garbage collection, in which case some more changes will be needed in this linker logic to achieve a valid state with data invariants intact.

and expose object_host_name as an option for setting the lib name for object files, since the wasm linking standards don't specify a way to do it.

one hash table lookup per fixup

instead of recursion, callers of the function are responsible for checking the respective tables that might have new entries in them and then calling lowerZcuData again.

codegen can generate zcu data dependencies that need to be populated

it cannot be done earlier since ids are not stable yet

this strategy uses a "postponed" queue to handle codegen tasks that spawn too early. there's probably a better way.

andrewrk force-pushed the wasm-linker branch from c9bf6eb to 4154612 Compare December 14, 2024 22:04

alexrp mentioned this pull request Dec 15, 2024

compiler: Switch to DWARF 5 by default for zig cc and the LLVM backend. #22235

Draft

andrewrk force-pushed the wasm-linker branch 2 times, most recently from 327a795 to ede3604 Compare December 19, 2024 04:18

andrewrk added 26 commits December 19, 2024 23:25

remove "FIXME" from codebase

61ede0a

See #363. Please file issues rather than making TODO comments.

macho linker conforms to explicit error sets, again

ad5403f

elf linker: conform to explicit error sets

991a8cc

rework error handling in the backends

c4caa38

compiler: add type safety for export indices

6fea8fa

std.array_list: tiny refactor for pleasure

e86ce4a

wasm codegen: fix some compilation errors

bbb032c

wasm: implement errors_len as a MIR opcode with no linker involvement

6b16d32

wasm codegen: switch on bool instead of int

e2b051d

wasm codegen: rename func: CodeGen to cg: CodeGen

dfeacbf

wasm: move error_name lowering to Emit phase

9f2a2e8

wasm: use call_intrinsic MIR instruction

efba9d3

switch to ArrayListUnmanaged for machine code

94e684c

wasm: fix many compilation errors

e469f94

Still, the branch is not yet passing semantic analysis.

wasm linker: support export section as implicit symbols

d21770b

frontend: add const to more Zcu pointers

5b60b76

wasm linker: implement name, module name, and type for function imports

fa8c471

wasm linker: flush implemented up to the export section

5e5756e

wasm linker: flush export section

8de53cd

wasm linker: finish the flush function

6594f79

This branch is passing type checking now.

fix compilation when enabling llvm

aacf7e3

cmake: remove deleted file

428b4a4

add dev env for wasm

3299924

with this I get 5s compilations

andrewrk added 22 commits December 19, 2024 23:25

std.Thread: don't export wasi_thread_start in single-threaded mode

5fbb851

wasm linker: implement type index method

367ef00

complete wasm.Emit implementation

d3a8a6d

fix calculation of nav alignment

70bacc8

wasm codegen: fix wrong union field for locals

a179c95

add safety for calling functions that get virtual addrs

c91013f

wasm linker: add __zig_error_name_table data when needed

c80540b

wasm codegen: fix extra index not relative

01bd1d6

wasm linker: fix calling imported functions

643fc8f

and more disciplined type safety for output function indexes

std.ArrayHashMap: allow passing empty values array

2693886

in which case the values array is set to undefined

wasm linker: handle extern functions in updateNav

6e2f1c8

wasm linker: allow undefined imports when lib name is provided

b670839

and expose object_host_name as an option for setting the lib name for object files, since the wasm linking standards don't specify a way to do it.

wasm codegen: fix call_indirect

b06125b

wasm linker: fix eliding empty data segments

c315147

wasm linker: implement data fixups

4fc51a9

one hash table lookup per fixup

wasm linker: avoid recursion in lowerZcuData

be9c9d8

instead of recursion, callers of the function are responsible for checking the respective tables that might have new entries in them and then calling lowerZcuData again.

wasm linker: also call lowerZcuData in updateFunc

eeef240

codegen can generate zcu data dependencies that need to be populated

wasm linker: initialize the data segments table in flush

89a8cdf

it cannot be done earlier since ids are not stable yet

wasm linker: zcu data fixups are already applied

b187cd9

implement error table and error names data segments

3c4b45b

andrewrk force-pushed the wasm-linker branch from ede3604 to 3c4b45b Compare December 20, 2024 07:25

andrewrk added 7 commits December 20, 2024 16:03

wasm linker: fix data section in flush

edc1d0a

implement the prelink phase in the frontend

0af4c80

this strategy uses a "postponed" queue to handle codegen tasks that spawn too early. there's probably a better way.

wasm linker: implement stack pointer global

4b66789

std.io: remove the "temporary workaround" for stage2_aarch64

976f32c

wasm linker: implement indirect function calls

4dd60b0

fix stack pointer initialized to wrong vaddr

655375f

use fixed writer in more places

a037106

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

wasm linker: aggressive rewrite towards Data-Oriented Design #22220

wasm linker: aggressive rewrite towards Data-Oriented Design #22220

andrewrk commented Dec 13, 2024 •

edited

Loading

wasm linker: aggressive rewrite towards Data-Oriented Design #22220

Are you sure you want to change the base?

wasm linker: aggressive rewrite towards Data-Oriented Design #22220

Conversation

andrewrk commented Dec 13, 2024 • edited Loading

Merge Checklist

Demo: Incremental Compilation

Demo: Serializing and Deserializing Linker State

Followup

andrewrk commented Dec 13, 2024 •

edited

Loading