You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
when the model (it is transformer based XLNet model that we are training) is loaded and saved again, the argument names disappear (become 'args_0, args_0_1 etc) and the saved model becomes invalid. Because of that we cannot export the reloaded model using Merlin Systems, therefore cannot generate the config files to serve on Triton.
Steps/Code to reproduce bug
Please run these two gists in order to repro the issue:
One quick solution to that is to add model block line right before loading the saved model. Something like that:
_ = mm.XLNetBlock(d_model=32, n_head=2, n_layer=2) # replace XLNetBlock with `GPT2Block` if you use GPT2 block in model training.
loaded_model = tf.keras.models.load_model('/workspace/saved_model/')
Bug description
when the model (it is transformer based XLNet model that we are training) is loaded and saved again, the argument names disappear (become 'args_0, args_0_1 etc) and the saved model becomes invalid. Because of that we cannot export the reloaded model using Merlin Systems, therefore cannot generate the config files to serve on Triton.
Steps/Code to reproduce bug
Please run these two gists in order to repro the issue:
Expected behavior
Environment details
The text was updated successfully, but these errors were encountered: