Deserialization performance is poor for large products #44

henryem · 2019-06-15T02:42:51Z

Currently, as noted in the project page, we use an O(N*M) algorithm for deserializing products with N fields repeated a total of M times. In profiling the deserialization of a complex deeply-nested object, I found that we end up spending a huge amount of time in the resulting calls to CodedInputStream.skipField.

One way to eliminate this is to (1) create a map from field index to parser, (2) loop through fields, passing each to its appropriate parser, and (3) finally build the results of each parser. I have prepared a PR that does this, which I'll be submitting soon (once my previous PR, which is a dependency, is merged). My PR also reduces memory pressure for deserializing nested objects by eliminating allocation of extra byte buffers in all cases except coproduct parsing.

The text was updated successfully, but these errors were encountered:

henryem mentioned this issue Jul 7, 2019

Improve product deserialization performance #46

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deserialization performance is poor for large products #44

Deserialization performance is poor for large products #44

henryem commented Jun 15, 2019 •

edited

Loading

Deserialization performance is poor for large products #44

Deserialization performance is poor for large products #44

Comments

henryem commented Jun 15, 2019 • edited Loading

henryem commented Jun 15, 2019 •

edited

Loading