You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We are trying to optimize features preprocessing step for the real-time inference, where latency is critical. We can cache some intermediate data for building tensor more efficiently, but for that purposes we need a way to extract categorical features mapping, as well as continuous feature conversion rules from the trained NVTabular workflow. Is there a way doing it?
Thanks!
The text was updated successfully, but these errors were encountered:
We are setting up online inference where features need to be preprocessed in real-time. We just need to preprocess one to few rows of data, and passing it through NVT transform() function takes too long.
We are looking to instead extract the categorical features mapping that NVT workflow has fitted to as well as the statistics that NVT collected in for the Normalize operator for each of the continuous variables (please assume all the continuous variables are simply passed through Normalize operator).
We are aware that the index mapping for categorical features can be retrieved by looking at the parquet files in the categories/ folder of the saved workflow. However, the difficulty comes with extracting the statistics learned for the continuous variables. From a quick glance around, it doesn't seem like these statistics are saved in a separate file, and I'm guessing they are pickled together in the workflow. We are looking to be able to do something similar to the following with an already fitted workflow:
We are trying to optimize features preprocessing step for the real-time inference, where latency is critical. We can cache some intermediate data for building tensor more efficiently, but for that purposes we need a way to extract categorical features mapping, as well as continuous feature conversion rules from the trained NVTabular workflow. Is there a way doing it?
Thanks!
The text was updated successfully, but these errors were encountered: