forked from facebookincubator/velox
-
Notifications
You must be signed in to change notification settings - Fork 2
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Encode the minimum necessary values from a dictionary in PrestoSerial…
…izer (facebookincubator#8639) Summary: Pull Request resolved: facebookincubator#8639 There are cases where the values in a DictionaryVector may be much larger than what's actually needed. E.g. some UDFs use DictionaryVectors to filter values, like array subscript. When we use PrestoBatchSerializer in PartitionedOutput with a single destination, I suspect we'll run into similar issues if we have a large RowVector that we break into smaller pieces, we don't necessarily want to send the whole alphabet for each piece. This change adds logic to serializeDictionaryVector to determine the minimum subset of the alphabet to include in the serialized output. It also updates the indices to point to the correct values in this minimal dictionary. Reviewed By: bikramSingh91 Differential Revision: D53327612 fbshipit-source-id: 1651554806d2124d982cca122fd707fcc84cab95
- Loading branch information
1 parent
07ece26
commit 032358e
Showing
2 changed files
with
184 additions
and
39 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.