Skip to content

Commit

Permalink
PARQUET-2435: Clarify behavior of DELTA_BINARY_PACKED encoding (#231)
Browse files Browse the repository at this point in the history
Address the issue of using more bits in the encoding than are used in
the underlying type being encoded.
  • Loading branch information
etseidl authored Feb 28, 2024
1 parent 97ed3ba commit f65d4e1
Showing 1 changed file with 3 additions and 1 deletion.
4 changes: 3 additions & 1 deletion Encodings.md
Original file line number Diff line number Diff line change
Expand Up @@ -245,7 +245,9 @@ Subtractions in steps 1) and 2) may incur signed arithmetic overflow, and so
will the corresponding additions when decoding. Overflow should be allowed
and handled as wrapping around in 2's complement notation so that the original
values are correctly restituted. This may require explicit care in some programming
languages (for example by doing all arithmetic in the unsigned domain).
languages (for example by doing all arithmetic in the unsigned domain). Writers
must not use more bits when bit packing the miniblock data than would be required
to PLAIN encode the physical type (e.g. INT32 data must not use more than 32 bits).

The following examples use 8 as the block size to keep the examples short,
but in real cases it would be invalid.
Expand Down

0 comments on commit f65d4e1

Please sign in to comment.