add et export with gguf with test #245

metascroy · 2024-04-17T20:33:27Z

ET does not support _weight_int4pack_mm, so this adds gguf_kwargs that can be passed to building that control whether GGUF should be load_as_quantized. If load_as_quantized=False, GGUF is converted to floating point.

Also adds test for torchchat export + generate to et.yml with gguf file.

mikekgfb

Approving with mixed emotions.

mikekgfb · 2024-04-17T20:42:29Z

export.py

+        model_to_pte = model
+        model_to_dso = model
+    else:
+        if output_pte_path:


This is very kludgy and I would prefer to export to int4 and then handle it from there. Basing front end decisions on backend is a very bad practice because we're going to end up in a world of hurt.

Kimish and I had discussed doing a transform from int4 ->a8w4dq. Right now we just get a de-quantized model.
Please plan to land that asap, Kimish?

cc: @kimishpatel

mikekgfb · 2024-04-17T20:42:35Z

export.py

    with torch.no_grad():
        if output_pte_path:
            output_pte_path = str(os.path.abspath(output_pte_path))
            print(f">{output_pte_path}<")
            if executorch_export_available:
                print(f"Exporting model using Executorch to {output_pte_path}")
-                export_model_et(model, builder_args.device, args.output_pte_path, args)
+                export_model_et(model_to_pte, builder_args.device, args.output_pte_path, args)


I don't like this at all :( But we're out of runway, so I will approve for now.

mikekgfb · 2024-04-17T20:42:46Z

export.py

@@ -68,7 +90,7 @@ def main(args):
        if output_dso_path:
            output_dso_path = str(os.path.abspath(output_dso_path))
            print(f"Exporting model using AOT Inductor to {output_dso_path}")
-            export_model_aoti(model, builder_args.device, output_dso_path, args)
+            export_model_aoti(model_to_dso, builder_args.device, output_dso_path, args)


* add et export with gguf with test * fix generate too * add gguf path to generate

metascroy requested review from mikekgfb and larryliu0820 April 17, 2024 20:33

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Apr 17, 2024

metascroy requested a review from mergennachin April 17, 2024 20:33

mikekgfb approved these changes Apr 17, 2024

View reviewed changes

metascroy force-pushed the gguf-export-ext branch from f8884e6 to bc92599 Compare April 17, 2024 22:50

metascroy added 3 commits April 17, 2024 16:39

add et export with gguf with test

8ad99f2

fix generate too

da53078

add gguf path to generate

9563191

metascroy force-pushed the gguf-export-ext branch from bc92599 to 9563191 Compare April 17, 2024 23:39

metascroy merged commit cf33af2 into main Apr 18, 2024

malfet deleted the gguf-export-ext branch April 30, 2024 16:51

malfet pushed a commit that referenced this pull request Jul 17, 2024

add et export with gguf with test (#245)

bbaa9ec

* add et export with gguf with test * fix generate too * add gguf path to generate

malfet pushed a commit that referenced this pull request Jul 17, 2024

add et export with gguf with test (#245)

f1f69e2

* add et export with gguf with test * fix generate too * add gguf path to generate

malfet pushed a commit that referenced this pull request Jul 17, 2024

add et export with gguf with test (#245)

66a7df8

* add et export with gguf with test * fix generate too * add gguf path to generate

malfet pushed a commit that referenced this pull request Jul 17, 2024

add et export with gguf with test (#245)

64ed3cf

* add et export with gguf with test * fix generate too * add gguf path to generate

malfet pushed a commit that referenced this pull request Jul 17, 2024

add et export with gguf with test (#245)

d520e93

* add et export with gguf with test * fix generate too * add gguf path to generate

malfet pushed a commit that referenced this pull request Jul 17, 2024

add et export with gguf with test (#245)

0906a11

* add et export with gguf with test * fix generate too * add gguf path to generate

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add et export with gguf with test #245

add et export with gguf with test #245

Uh oh!

metascroy commented Apr 17, 2024 •

edited

Loading

Uh oh!

mikekgfb left a comment

Uh oh!

mikekgfb Apr 17, 2024

Uh oh!

mikekgfb Apr 17, 2024

Uh oh!

mikekgfb Apr 17, 2024

Uh oh!

Uh oh!

add et export with gguf with test #245

add et export with gguf with test #245

Uh oh!

Conversation

metascroy commented Apr 17, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mikekgfb left a comment

Choose a reason for hiding this comment

Uh oh!

mikekgfb Apr 17, 2024

Choose a reason for hiding this comment

Uh oh!

mikekgfb Apr 17, 2024

Choose a reason for hiding this comment

Uh oh!

mikekgfb Apr 17, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

metascroy commented Apr 17, 2024 •

edited

Loading