Model Export to liteRT #2405

pctablet505 · 2025-09-17T09:25:59Z

Added support for model export to keras-hub models.

This PR requires keras-team/keras#21674 as prerequisite, the export feature in keras.
Then it is built on top of that.

Simple Demo

Complete Numeric verification tests multiple models for numeric verifications.

Verified models:

llama3.2_1b
gemma3_1b
gpt2_base_en
resnet_50_imagenet
efficientnet_b0_ra_imagenet
densenet_121_imagenet
mobilenet_v3_small_100_imagenet
dfine_nano_coco
retinanet_resnet50_fpn_coco
deeplab_v3_plus_resnet50_pascalvoc

This reverts commit 62d2484.

This reverts commit de830b1.

export working 1st commit

Refactored exporter and registry logic for better type safety and error handling. Improved input signature methods in config classes by extracting sequence length logic. Enhanced LiteRT exporter with clearer verbose handling and stricter error reporting. Registry now conditionally registers LiteRT exporter and extends export method only if dependencies are available.

gemini-code-assist

Summary of Changes

Hello @pctablet505, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request introduces a comprehensive and extensible framework for exporting Keras-Hub models to various formats, with an initial focus on LiteRT. The system is designed to seamlessly integrate with Keras-Hub's model architecture, particularly by addressing the unique challenge of handling dictionary-based model inputs during the export process. This enhancement significantly improves the deployability of Keras-Hub models by providing a standardized and robust export pipeline, alongside crucial compatibility fixes for TensorFlow's SavedModel/TFLite export mechanisms.

Highlights

New Model Export Framework: Introduced a new, extensible framework for exporting Keras-Hub models, designed to support various formats and model types.
LiteRT Export Support: Added specific support for exporting Keras-Hub models to the LiteRT format, verified for models like gemma3, llama3.2, and gpt2.
Registry-Based Configuration: Implemented an ExporterRegistry to manage and retrieve appropriate exporter configurations and exporters based on model type and target format.
Input Handling for Keras-Hub Models: Developed a KerasHubModelWrapper to seamlessly convert Keras-Hub's dictionary-based inputs to the list-based inputs expected by the underlying Keras LiteRT exporter.
TensorFlow Export Compatibility: Added compatibility shims (_get_save_spec and _trackable_children) to Keras-Hub Backbone models to ensure proper functioning with TensorFlow's SavedModel and TFLite export utilities.
Automated Export Method Extension: The Task class in Keras-Hub models is now automatically extended with an export method, simplifying the model export process for users.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces a significant new feature: model exporting to liteRT. The implementation is well-structured, using a modular and extensible registry pattern. However, there are several areas that require attention. The most critical issue is the complete absence of tests for the new export functionality, which is a direct violation of the repository's style guide stating that testing is non-negotiable. Additionally, I've identified a critical bug in the error handling logic within the lite_rt.py exporter that includes unreachable code. There are also several violations of the style guide regarding the use of type hints in function signatures across all new files. I've provided specific comments and suggestions to address these points, which should help improve the robustness, maintainability, and compliance of this new feature.

keras_hub/src/export/lite_rt.py

keras_hub/src/export/base.py

keras_hub/src/export/configs.py

keras_hub/src/export/lite_rt.py

keras_hub/src/export/registry.py

keras_hub/src/models/__init__.py

Introduces the keras_hub.api.export submodule and updates the main API to expose it. The new export module imports various exporter configs and functions from the internal export package, making them available through the public API.

Added ImageClassifierExporterConfig, ImageSegmenterExporterConfig, and ObjectDetectorExporterConfig to the export API. Improved input shape inference and dummy input generation for image-related exporter configs. Refactored LiteRTExporter to better handle model type checks and input signature logic, with improved error handling for input mapping.

Moved the 'import keras' statement to the top of the module and removed redundant local imports within class methods. This improves code clarity and avoids repeated imports.

Deleted the debug_object_detection.py script, which was used for testing object detection model outputs and export issues. This cleanup removes unused debugging code from the repository.

Renames all references of 'LiteRT' to 'Litert' across the codebase, including file names, class names, and function names. Updates exporter registry and API imports to use the new 'litert' naming. Also improves image model exporter configs to dynamically determine input dtype from the model, enhancing flexibility for different input types. Adds support for ImageSegmenter model type detection in the exporter registry.

Refactored InputSpec definitions in exporter configs for improved readability by placing each argument on a separate line. Updated import path in litert.py to import from keras.src.export.litert instead of keras.src.export.litert_exporter.

divyashreepathihalli · 2025-10-08T21:09:06Z

@pctablet505 can you update the Colab to use the changes from this PR? and keep the demo short where you load a model and export it, and reload and verify numerics?

divyashreepathihalli · 2025-10-08T21:10:02Z

Also if you have addressed the Gemini comments, please respond on it. Is this PR dependent on the PR in Keras repo? - keras-team/keras#21674

Simplifies and unifies input signature and dummy input generation for text and image models by removing redundant helper methods and centralizing logic. Updates model building in KerasHubExporter to use input signatures and improves error handling. Refactors LiteRT exporter to use the new parameterized input signature and model building approach, reducing code duplication and improving maintainability.

Removed redundant registry initialization in export_model and clarified model building comments in KerasHubExporter. Switched to using warnings.warn for import errors in models/__init__.py instead of print statements for better error reporting.

pctablet505 · 2025-10-09T05:50:34Z

@gemini-code-assist please review it again

gemini-code-assist

Code Review

This is a significant and valuable addition to KerasHub, enabling model exportation to LiteRT. The overall structure is well-designed, following a modular and extensible pattern similar to Hugging Face's Optimum.

I've reviewed this pull request, focusing on correctness, maintainability, and adherence to the repository's style guide. The style guide was instrumental in this review, particularly the sections on Testing and Validation (lines 401-496) and Docstrings and Type Hinting (lines 364-398).

My main findings are:

Critical Issue: Missing Tests
- The pull request introduces substantial new functionality for model exporting across several new files (base.py, configs.py, litert.py, registry.py), but it lacks corresponding tests.
- The repository style guide is explicit that "Testing is a non-negotiable part of every contribution" (line 403) and "Every .py file containing logic...must have a corresponding _test.py file" (line 406).
- Please add comprehensive unit tests for the new export logic, covering different model types, configurations, and edge cases. This is crucial to ensure the robustness and correctness of this feature.
Other Findings
- I've also left several inline comments regarding a bug in model type detection, incorrect dtype handling, and violations of the docstring style guide. Please address these to improve code quality and consistency.

keras_hub/src/export/configs.py

keras_hub/src/export/registry.py

keras_hub/src/export/base.py

keras_hub/src/export/litert.py

Refined dtype extraction logic in image and object model exporter configs to better handle different dtype representations. Updated LiteRT exporter to use Keras io_utils for progress messages and improved verbose flag handling. Added ObjectDetector and ImageSegmenter to export registry model type checks. Enhanced docstrings for clarity and consistency in base exporter classes.

gemini-code-assist

Code Review

This pull request introduces a significant new feature: support for exporting Keras-Hub models to the LiteRT format. The implementation is well-designed and modular, with base classes for exporters and configurations, a registry for discoverability, and a specific LiteRTExporter. The feature is nicely integrated by extending the export method on Task models, providing a seamless user experience. The accompanying tests are thorough, covering unit tests with mocks and crucial integration tests with production models that include numerical verification. I have identified a couple of minor issues: one related to redundant code in an __init__.py file and another concerning a misleading docstring. These are straightforward to address. Overall, this is a high-quality contribution that adds valuable functionality.

keras_hub/api/export/__init__.py

keras_hub/src/export/configs.py

Corrected the docstring in _is_model_compatible to refer to text classifier instead of image classifier.

keras_hub/src/export/litert_export_design.md

divyashreepathihalli · 2025-10-27T22:28:10Z

keras_hub/src/export/litert_models_test.py

@@ -0,0 +1,536 @@
+"""Tests for LiteRT export with specific production models.


better way is to add a standardized test like this - https://github.com/keras-team/keras-hub/blob/master/keras_hub/src/tests/test_case.py#L414

enable it for all models

we can not run the test for all models on keras-hub, some are in multiple gigabytes, and that would require too much memory that won't be available. the system with 50GB crashes for 7b models.

Refactored LiteRT export tests to use a standardized helper method for model export and numerical accuracy verification, reducing code duplication and improving maintainability. Removed direct file and interpreter management in favor of a unified approach, tightened numerical accuracy thresholds, and ensured proper resource cleanup. Updated test cases to dynamically determine input shapes and handle model-specific requirements, improving robustness and reliability of export validation.

Adds support for specifying per-output numerical thresholds and input ranges in LiteRT export tests. Refactors test utilities to handle output mapping, threshold configuration, and input preparation for improved accuracy and flexibility across model types.

pctablet505 · 2025-10-28T13:44:27Z

@gemini-code-assist review

gemini-code-assist

Code Review

This is an excellent and comprehensive pull request that adds a much-needed model export functionality to LiteRT. The design is well-thought-out, using a registry and adapter pattern to handle various model architectures and their specific requirements. The code is well-structured and accompanied by extensive tests, including unit, integration, and end-to-end tests with production models. The detailed design document is also a great addition. I have one suggestion to improve the code by replacing a private TensorFlow API with a public one.

keras_hub/src/models/backbone.py

Replaces class-based test cases with pytest parameterized functions for CausalLM, ImageClassifier, ObjectDetector, and ImageSegmenter LiteRT export tests. This improves test readability, reduces code duplication, and ensures each model configuration is tested independently with clearer output. Cleans up helper methods and consolidates numerical verification logic.

pctablet505 · 2025-10-28T16:51:34Z

@gemini-code-assist review

gemini-code-assist

Code Review

This pull request introduces a comprehensive framework for exporting Keras-Hub models to the LiteRT format. The changes are well-structured, introducing base classes for exporters and configurations, a registry for extensibility, and specific implementations for various model types (CausalLM, ImageClassifiers, etc.). The integration is handled cleanly by extending the Task class's export method. The pull request also includes extensive tests, from unit tests for the new components to end-to-end numerical verification with production models, which is excellent. I've included a couple of suggestions to improve the maintainability and robustness of the new testing utilities and export registry.

keras_hub/src/tests/test_case.py

keras_hub/src/export/registry.py

Removed the export registry and related initialization logic, replacing it with a direct model type detection via `get_exporter_config`. The `Task` class now provides its own `export` method for specialized Keras-Hub model export, supporting dictionary inputs and LiteRT export. Cleaned up imports and removed registry-related tests and files.

pctablet505 · 2025-10-28T17:08:09Z

@gemini-code-assist review

gemini-code-assist

Code Review

This pull request introduces a significant new feature: exporting Keras-Hub models to the LiteRT format. The implementation is well-structured, following an exporter pattern with base classes, model-specific configurations, and a dedicated LiteRT exporter. The use of adapter classes to handle Keras-Hub's dictionary-based inputs is a clever solution. The changes are accompanied by a comprehensive suite of tests, including unit tests, integration tests, and end-to-end numerical verification for a wide range of production models, which is excellent. The addition of run_litert_export_test to the base TestCase will be very valuable for future contributions. Overall, this is a high-quality contribution that greatly enhances the utility of Keras-Hub. I have found one critical issue in the convenience function export_litert that needs to be addressed.

keras_hub/src/export/litert.py

…o export

Updates LiteRT exporter and related configs to support dynamic input shapes by default for text and image models, allowing runtime resizing via TFLite's interpreter.resize_tensor_input(). Removes static sequence length inference, adapts input signature logic, and updates tests to verify dynamic shape support and runtime resizing. Also improves multimodal model handling and input mapping for TFLite export.

Adds tests to verify that SignatureDef preserves input names for ImageClassifier and CausalLM models. Refactors test utilities to use SignatureDef for input/output mapping, ensuring meaningful names and robust output verification. Updates numerical accuracy checks to compare outputs by name using SignatureDef, and adds validation for expected input/output names in exported models.

Consolidates LiteRT input preparation, inference, and output verification into clearer helper methods. Improves handling of dynamic shapes, input/output name matching via SignatureDef, and output comparison logic. Updates docstrings and argument names for consistency and readability. bug fix in inference: - there was a bug, that used corrupted the results of invoke, during getting the SignatureDef fixed it.

Updated test utilities to use TFLite's signature runner for inference, simplifying input handling and output extraction. Also updated model creation in numerical accuracy tests to use explicit Input layers for clarity and consistency.

divyashreepathihalli · 2025-11-04T19:39:09Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces a robust and well-tested feature for exporting Keras-Hub models to the LiteRT format. The implementation is well-structured, with clear separation of concerns into base classes, model-specific configurations, and the exporter itself. The addition of an export() method to the Task class provides a convenient user-facing API. The necessary compatibility shims for TensorFlow export are well-documented, including warnings about private API usage. The test coverage is excellent, including both unit tests with mocks and end-to-end tests with real model presets, which ensures the reliability of this new feature. My feedback includes a few minor suggestions to improve code clarity and maintainability in the API and task modules.

keras_hub/api/__init__.py

keras_hub/api/export/__init__.py

keras_hub/src/models/task.py

pctablet505 and others added 11 commits September 1, 2025 19:11

Update backbone.py

087b9b2

Update backbone.py

de830b1

Update task.py

62d2484

Revert "Update task.py"

3b71125

This reverts commit 62d2484.

Revert "Update backbone.py"

3d453ff

This reverts commit de830b1.

export

92b1254

export working 1st commit

refactoring

e46241d

refactor

6e970e2

Update registry.py

15ad9f3

Merge branch 'keras-team:master' into export

901c233

github-actions bot added the Gemma Gemma model specific issues label Sep 17, 2025

gemini-code-assist bot reviewed Sep 17, 2025

View reviewed changes

pctablet505 added 9 commits September 22, 2025 11:06

reformat

442fdd3

Add export submodule to keras_hub API

5446e2a

Introduces the keras_hub.api.export submodule and updates the main API to expose it. The new export module imports various exporter configs and functions from the internal export package, making them available through the public API.

reformat

5c31d88

now supporting export for objectDetectors

3290d42

Refactor: move keras import to module level

8df5a75

Moved the 'import keras' statement to the top of the module and removed redundant local imports within class methods. This improves code clarity and avoids repeated imports.

Remove debug_object_detection.py script

759d223

Deleted the debug_object_detection.py script, which was used for testing object detection model outputs and export issues. This cleanup removes unused debugging code from the repository.

pctablet505 added 2 commits October 9, 2025 10:59

gemini-code-assist bot reviewed Oct 9, 2025

View reviewed changes

gemini-code-assist bot reviewed Oct 27, 2025

View reviewed changes

keras_hub/api/export/__init__.py Show resolved Hide resolved

keras_hub/src/export/configs.py Outdated Show resolved Hide resolved

pctablet505 added 3 commits October 27, 2025 12:54

Fix docstring in TextClassifierExporterConfig

ec37ac4

Corrected the docstring in _is_model_compatible to refer to text classifier instead of image classifier.

Update base.py

911eb96

Create litert_export_design.md

51b99b1

divyashreepathihalli reviewed Oct 27, 2025

View reviewed changes

pctablet505 added 2 commits October 28, 2025 14:47

gemini-code-assist bot reviewed Oct 28, 2025

View reviewed changes

keras_hub/src/models/backbone.py Show resolved Hide resolved

pctablet505 added 4 commits October 28, 2025 19:41

Update litert_models_test.py

4adeadf

Delete litert_export_design.md

00f49ca

Update litert_models_test.py

5fa0498

gemini-code-assist bot reviewed Oct 28, 2025

View reviewed changes

keras_hub/src/tests/test_case.py Outdated Show resolved Hide resolved

keras_hub/src/export/registry.py Outdated Show resolved Hide resolved

gemini-code-assist bot reviewed Oct 28, 2025

View reviewed changes

keras_hub/src/export/litert.py Outdated Show resolved Hide resolved

pctablet505 and others added 4 commits October 28, 2025 22:42

Update litert.py

519c3b6

Merge branch 'keras-team:master' into export

2dcbf23

Update task.py

0136c34

Merge branch 'export' of https://github.com/pctablet505/keras-hub int…

f8bd6fa

…o export

pctablet505 requested a review from divyashreepathihalli October 29, 2025 06:22

pctablet505 added 5 commits October 29, 2025 12:01

Update test_case.py

14cffe0

Refactor TFLite inference to use signature runner

d43de36

Updated test utilities to use TFLite's signature runner for inference, simplifying input handling and output extraction. Also updated model creation in numerical accuracy tests to use explicit Input layers for clarity and consistency.

gemini-code-assist bot reviewed Nov 4, 2025

View reviewed changes

keras_hub/api/__init__.py Show resolved Hide resolved

keras_hub/api/export/__init__.py Show resolved Hide resolved

keras_hub/src/models/task.py Show resolved Hide resolved

		@@ -0,0 +1,536 @@
		"""Tests for LiteRT export with specific production models.

Model Export to liteRT #2405

Are you sure you want to change the base?

Model Export to liteRT #2405

Conversation

pctablet505 commented Sep 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

divyashreepathihalli commented Oct 8, 2025

Uh oh!

divyashreepathihalli commented Oct 8, 2025

Uh oh!

pctablet505 commented Oct 9, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

divyashreepathihalli Oct 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pctablet505 Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

pctablet505 commented Oct 28, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

pctablet505 commented Oct 28, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

pctablet505 commented Oct 28, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

divyashreepathihalli commented Nov 4, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

pctablet505 commented Sep 17, 2025 •

edited

Loading

divyashreepathihalli Oct 27, 2025 •

edited

Loading