feat(serialization): Serialize and deserialize meta tensors #70

Eta0 · 2023-12-28T23:20:57Z

Serialization and Deserialization of Meta Tensors

This change allows serialization of tensors on the meta device. A brief explanation of meta tensors from the documentation for torch.Tensor.is_meta is:

Meta tensors are like normal tensors, but they carry no data.

As such, when serialized, they have a data_length of 0 and no presence in the file beyond a tensor header, but they are recorded with a real shape. When deserialized, they are yielded as zero-filled tensors of that shape on the requested device.

When verifying a deserialized meta tensor with verify_hash=True (or TensorDeserializer.verify_module()), only the header hashes are checked, as there is no tensor data included in the original CRC32 to compare against. Verifying that a zero-filled tensor is still zero-filled can be achieved manually with not torch.any(tensor), which returns True for tensors of all zeroes and False if any elements are non-zero, and is more efficient than computing a CRC32.

There is room to adjust the implementation to provide an option to deserialize meta tensors back onto the meta device if desired instead of always zero-filling them. Yielding meta tensors would allow more flexibility for the end user.

dmarx · 2023-12-28T23:45:45Z

lgtm, will extend the test suite to check the new behavior

Eta0 · 2023-12-29T02:42:32Z

This may warrant bumping up to a new tensorizer data version when meta tensors are encountered during serialization, since trying to deserialize a tensor with 0 data_length on an older tensorizer version will likely break down in bizarre ways.

dmarx · 2023-12-29T19:16:48Z

sharing informative blurb from DM with Eta. out of scope for this PR, but should probably document this more formally in the future (e.g. add to the tensorizer readme)

Tensorizer has a code version and a data version

The code version is the PyPI package version (like 2.7.1)
The data version (currently 3) is encoded in files for feature compatibility checks
- The newest version of tensorizer can read any current data version
- The newest version of tensorizer emits files with the lowest data version applicable based on the features used
- Older versions of tensorizer can serialize files with data versions they recognize
- Data version 1 is the baseline
- Data version 2 adds support for bfloat16 and some other odd datatypes
- Data version 3 adds encryption
- Data version 4 (proposed) would add zero-filled tensors

Eta0 · 2023-12-29T19:29:39Z

Tensorizer has a code version and a data version

The code version is the PyPI package version (like 2.7.1)

The data version (currently 3) is encoded in files for feature compatibility checks

The newest version of tensorizer can read any current data version

The newest version of tensorizer emits files with the lowest data version applicable based on the features used

Older versions of tensorizer can serialize files with data versions they recognize

Data version 1 is the baseline

Data version 2 adds support for bfloat16 and some other odd datatypes

Data version 3 adds encryption

Data version 4 (proposed) would add zero-filled tensors

To add to this quote from me (hiya), this is the current de facto state for compatibility, not de jure; newer versions of tensorizer are not guaranteed to have the same level of backwards compatibility in the serializer or deserializer (though any change that breaks it will probably be a major version bump). This isn't documented because we have never agreed on a rigorous policy for data version compatibility, but what could be documented at the moment is a table of extant tensorizer versions and their possible output data versions + possible input data versions.

wbrown · 2024-02-07T01:45:56Z

@Eta0 @dmarx Let's add a data version as part of 2.8.0?

# Conflicts: # CHANGELOG.md # tensorizer/_version.py # tensorizer/serialization.py

This prevents files containing meta tensors from being misinterpreted by older versions of tensorizer that do not support them. This adds a feature_flags bitfield to the tensorizer file header that currently records whether a file is encrypted or not, as the previous way of recording encryption wasn't compatible with new data versions that may only be required part-way through serialization.

Previously, only the lazy_load setting was varied correctly. This also tests encryption with meta tensors when possible.

Eta0 · 2024-02-08T03:06:35Z

@Eta0 @dmarx Let's add a data version as part of 2.8.0?

@wbrown Merged in the latest changes from main and added a new data version (4).

In data version 4, the previously-unused "file hash" field in the tensorizer file header is now a "feature flags" field, because the old way of indicating encryption metadata wasn't compatible with newer data versions that aren't known up-front, at the beginning of serialization.
Currently, the feature flags field only tracks whether a file is encrypted or not, since that is the only optional file-level attribute. The presence of opaque tensors and meta tensors aren't tracked in the file feature flags, only in the ways they were previously indicated per-tensor.

I also fixed some broken tests and fixed a bug that was preventing meta tensor deserialization from working correctly in plaid_mode. Should be ready to merge now.

wbrown

Great job. 👍

feat(serialization): Serialize and deserialize meta tensors

488a77c

Eta0 added the enhancement New feature or request label Dec 28, 2023

Eta0 requested review from wbrown and dmarx December 28, 2023 23:20

Eta0 assigned dmarx and Eta0 Dec 28, 2023

dmarx added 3 commits December 28, 2023 16:24

main test. to do: check values

ff7afdf

test shape and zero valued

8a96d5e

add test

cd4b5a9

dmarx marked this pull request as ready for review December 29, 2023 00:44

Eta0 added 5 commits December 28, 2023 19:25

style: Reformat the test_meta_tensors case with black

7a58d65

tests(serialization): Test more settings in test_meta_tensors

ccaf20f

fix(serialization): Don't allocate disk space for meta tensors

71a8833

tests(serialization): Test bulk meta tensor writes

5729671

chore: Update code version to 2.7.2.dev0, document changes

a6ecbab

Eta0 added 4 commits February 7, 2024 15:02

Merge branch 'main' into eta/meta-tensors

80af922

# Conflicts: # CHANGELOG.md # tensorizer/_version.py # tensorizer/serialization.py

fix(serialization): Deserialize meta tensors correctly in plaid_mode

9a88b6d

tests(serialization): Test different settings during test_meta_tensors

74911df

Previously, only the lazy_load setting was varied correctly. This also tests encryption with meta tensors when possible.

wbrown approved these changes Feb 8, 2024

View reviewed changes

wbrown merged commit f3d935f into main Feb 8, 2024

Eta0 deleted the eta/meta-tensors branch February 8, 2024 23:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(serialization): Serialize and deserialize meta tensors #70

feat(serialization): Serialize and deserialize meta tensors #70

Uh oh!

Eta0 commented Dec 28, 2023

Uh oh!

dmarx commented Dec 28, 2023

Uh oh!

Eta0 commented Dec 29, 2023

Uh oh!

dmarx commented Dec 29, 2023

Uh oh!

Eta0 commented Dec 29, 2023

Uh oh!

wbrown commented Feb 7, 2024

Uh oh!

Eta0 commented Feb 8, 2024

Uh oh!

wbrown left a comment

Uh oh!

Uh oh!

feat(serialization): Serialize and deserialize meta tensors #70

feat(serialization): Serialize and deserialize meta tensors #70

Uh oh!

Conversation

Eta0 commented Dec 28, 2023

Serialization and Deserialization of Meta Tensors

Uh oh!

dmarx commented Dec 28, 2023

Uh oh!

Eta0 commented Dec 29, 2023

Uh oh!

dmarx commented Dec 29, 2023

Uh oh!

Eta0 commented Dec 29, 2023

Uh oh!

wbrown commented Feb 7, 2024

Uh oh!

Eta0 commented Feb 8, 2024

Uh oh!

wbrown left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!