Add support for mapping old fields to new ones in TLV read macros #3378

TheBlueMatt · 2024-10-22T00:04:37Z

As we've grown, we regularly face a question of whether to "break out" of our nice TLV-based struct/enum reading/writing macros in order to handle mapping legacy fields to new ones, or deal with keeping the legacy fields and handling at runtime what should be hanlded at (de-)serialization time.

This attempts to address this tradeoff by adding support for a "legacy" TLV read. This read style allows us to read a TLV which is not directly mapped to any fields in the struct/enum but which can be computed from the struct/enum's layout at write-time and which is incorporated into the read data at read-time.

It takes a type, a $read expression (which is executed after all TLVs are read but before the struct/enum is built) and a $write expression (which is executed to calculate the value to write in the TLV).

They are always read as options to retain a future ability to remove the legacy fields.

Sadly, there's two issues with doing this trivially which force us into proc-macro land:

(a) when matching the original struct we want to list the fields
in the match arm so that we have them available to write.
Sadly, we can't call a macro to have it write out the field
name based on the field type, so instead need to pass the whole
match to a proc-macro and have it walk through to find the
types and skip fields that are legacy.
(b) when building a final struct/enum after reading, we need to
list a few $field: $exprs and cannot decide whether to
include a field based on a regular macro.

The proc-macros to do so aren't trivial, but they aren't that bad either. We could instead try to rewrite our TLV stream processing macros to handle a new set of TLVs which are passed via a separate argument, but as TLVs are required to in ordered by type this requires a good chunk of additional generated code in each TLV write. It also would result in a somewhat less ergonomic callsite as it would no longer fit into our existing list of TLVs.

shaavan

Look good to me, on going through the code!

Can we craft some tests for it?

lightning-macros/src/lib.rs

shaavan · 2024-10-27T12:24:29Z

lightning-macros/src/lib.rs

+/// Wraps a `match self {..}` statement and scans the fields in the match patterns (in the form
+/// `ref $field_name: $field_ty`) for types marked `legacy`, skipping those fields.
+#[proc_macro]
+pub fn skip_legacy_fields(expr: TokenStream) -> TokenStream {


Maybe we can break the function into modular parts. Something like process_match_pattern for handling the Enum::Variant part and a process_field for the internal fields. It might help keep things organized and easier to follow!

Sadly, this stuff is just kinda inscrutable :(. I'll add more comments and split it but I'm not sure how much it'll help...

shaavan · 2024-10-27T12:29:08Z

lightning-macros/src/lib.rs

+			let is_init = macro_name == "_init_tlv_based_struct_field";
+			let ty_tokens = mac.tokens.clone().into_iter().skip(2).next();
+			if let Some(proc_macro2::TokenTree::Group(group)) = ty_tokens {


From what I understand, the input field for this would look something like:

field: _init_tlv_based_struct_field!(field_name, (legacy, ...))

since the second element needs to be a group. I’m a bit unsure, though, about what exactly should go in place of ...—I’d love any insights on that!
Also, maybe it would be helpful to expand the docs a bit to clearly outline the expected input and behavior of the macro for future reference.

That is inherently a group, anything wrapped in () or {} is a group, even if its just one token. That said, the code doesn't require the second element be a group, it will accept anything that isn't.

lightning-macros/src/lib.rs

shaavan · 2024-10-27T12:36:59Z

lightning-macros/src/lib.rs

+	res
+}
+
+/// Scans an enum definition for fields initialized to `LDK_DROP_LEGACY_FIELD_DEFINITION` and drops


Reading through the code, I wasn’t quite able to figure out how LDK_DROP_LEGACY_FIELD_DEFINITION will be used in the end. I’d love to get some insights on that! Thanks!

Its not, that's stale.

lightning/src/util/ser_macros.rs

vincenzopalazzo · 2024-10-29T17:06:47Z

lightning-macros/src/lib.rs

+	let self_ident = stream.next().unwrap();
+	expect_ident(&self_ident, Some("self"));
+	res.extend(proc_macro::TokenStream::from(self_ident));
+
+	let token_to_stream = |tok| proc_macro::TokenStream::from(tok);
+
+	let arms = stream.next().unwrap();
+	if let TokenTree::Group(group) = arms {
+		let mut new_arms = TokenStream::new();
+
+		let mut arm_stream = group.stream().into_iter().peekable();
+		while arm_stream.peek().is_some() {
+			let enum_ident = arm_stream.next().unwrap();
+			let co1 = arm_stream.next().unwrap();
+			expect_punct(&co1, ':');
+			let co2 = arm_stream.next().unwrap();
+			expect_punct(&co2, ':');


I am just putting it there for reference, if you are looking for a no dependencies parser for the proc macro I developed a PoC for the rust compiler a while back https://github.com/rsmicro/kproc-macros this simplifies the parsing a little bit IMHO, but not sure if it is worth for just a single proc macro

Eh, I think its survivable for now, will see what others think.

As we've grown, we regularly face a question of whether to "break out" of our nice TLV-based struct/enum reading/writing macros in order to handle mapping legacy fields to new ones, or deal with keeping the legacy fields and handling at runtime what should be hanlded at (de-)serialization time. This attempts to address this tradeoff by adding support for a "legacy" TLV read. This read style allows us to read a TLV which is not directly mapped to any fields in the struct/enum but which can be computed from the struct/enum's layout at write-time and which is incorporated into the read data at read-time. It takes a type, a `$read` expression (which is executed after all TLVs are read but before the struct/enum is built) and a `$write` expression (which is executed to calculate the value to write in the TLV). They are always read as `option`s to retain a future ability to remove the `legacy` fields. Sadly, there's two issues with doing this trivially which force us into `proc-macro` land: (a) when matching the original struct we want to list the fields in the match arm so that we have them available to write. Sadly, we can't call a macro to have it write out the field name based on the field type, so instead need to pass the whole match to a proc-macro and have it walk through to find the types and skip fields that are `legacy`. (b) when building a final struct/enum after reading, we need to list a few `$field: $expr`s and cannot decide whether to include a field based on a regular macro. The proc-macros to do so aren't trivial, but they aren't that bad either. We could instead try to rewrite our TLV stream processing macros to handle a new set of TLVs which are passed via a separate argument, but as TLVs are required to in ordered by type this requires a good chunk of additional generated code in each TLV write. It also would result in a somewhat less ergonomic callsite as it would no longer fit into our existing list of TLVs.

TheBlueMatt · 2024-12-08T01:00:15Z

Oops, fixed CI, but I realize we kinda forgot about this one, and #3342 (might) depend on it. Tagging 0.2 for that reason.

shaavan

LGTM mod testing! 🚀

shaavan · 2024-12-09T11:16:33Z

lightning-macros/src/lib.rs

+/// ```ignore
+/// drop_legacy_field_definition!(Self {
+///		field1: _init_tlv_based_struct_field!(field1, option),
+///     field2: _init_tlv_based_struct_field!(field2, (legacy, u64, {}, {})),


nit ✨

Suggested change

/// field2: _init_tlv_based_struct_field!(field2, (legacy, u64, {}, {})),

/// field2: _init_tlv_based_struct_field!(field2, (legacy, u64, {}, {})),

TheBlueMatt mentioned this pull request Oct 22, 2024

Introduce RouteParametersConfig #3342

Open

jkczyz self-requested a review October 24, 2024 16:24

shaavan reviewed Oct 27, 2024

View reviewed changes

vincenzopalazzo reviewed Oct 29, 2024

View reviewed changes

TheBlueMatt added 4 commits December 8, 2024 00:56

f better doc/split skip_legacy_fields

e3d3db4

f misc fixes and better macro docs

f1a7bec

f fix docs

5113f3c

TheBlueMatt force-pushed the 2024-10-legacy-tlv-type branch from 10ce004 to 5113f3c Compare December 8, 2024 00:59

TheBlueMatt added this to the 0.2 milestone Dec 8, 2024

shaavan reviewed Dec 9, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for mapping old fields to new ones in TLV read macros #3378

Add support for mapping old fields to new ones in TLV read macros #3378

TheBlueMatt commented Oct 22, 2024

shaavan left a comment

shaavan Oct 27, 2024

TheBlueMatt Oct 29, 2024

shaavan Oct 27, 2024

TheBlueMatt Oct 29, 2024

shaavan Oct 27, 2024

TheBlueMatt Oct 29, 2024

vincenzopalazzo Oct 29, 2024

TheBlueMatt Oct 29, 2024

TheBlueMatt commented Dec 8, 2024

shaavan left a comment

shaavan Dec 9, 2024

	/// field2: _init_tlv_based_struct_field!(field2, (legacy, u64, {}, {})),
	/// field2: _init_tlv_based_struct_field!(field2, (legacy, u64, {}, {})),

Add support for mapping old fields to new ones in TLV read macros #3378

Are you sure you want to change the base?

Add support for mapping old fields to new ones in TLV read macros #3378

Conversation

TheBlueMatt commented Oct 22, 2024

shaavan left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

TheBlueMatt commented Dec 8, 2024

shaavan left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment