Lce benchmark and interpreter flags #717

simonmaurer · 2022-03-07T13:46:19Z

In general, this PR allows registering of the different kernel implementations of LceBconv2D (lce_register_ops.h) by additional parameters in LceInterpreter and flags in lce_benchmark_model binary

What do these changes do?

lce_benchmark_model

added two cmdline flags use_reference_bconv/use_indirect_bgemm as global variables to register respective kernel implementations
instead of manually parsing the flags in lce_benchmark_main.cc the builtin Flags of TF's BenchmarkModel are used. Thus by adding LceBenchmarkTfLiteModel as a child class of BenchmarkTfLiteModel the global variables are passed in as reference upon object instantiation and set after calling overidden Run() method

LceInterpreter

added parameter use_indirect_bgemm to register optimized indirect BGEMM kernel using Register_BCONV_2D_OPT_INDIRECT_BGEMM() in lce_register_ops.h
added parameter use_xnnpack in interpreter_wrapper_lite.cc to apply BuiltinOpResolver when true, otherwise BuiltinOpResolverWithoutDefaultDelegates

Related issue number

#711, #713

…EMM kernels

…te/deactivate XNNPACK delegate

- added two cmdline flags use_reference_bconv/use_indirect_bgemm as global variables in lce_benchmark_main.cc to register respective kernels - implemented LceBenchmarkTfLiteModel as a child class of BenchmarkTfLiteModel to use builtin flags instead of manually parsing them in lce_benchmark_main.cc - modified lce_benchmark_main.cc to use LceBenchmarkTfLiteModel, the global flags are set upon calling overriden Run() method by passing them as an internal reference - added build options for lce_benchmark_tflite_model.h same as in TFLite's benchmark_tflite_model.h

Tombana

Looks great, thanks a lot!

simonmaurer · 2022-03-08T11:10:24Z

@Tombana my pleasure. was also thinking about overriding ValidParams() to check when both use_reference_bconv and use_indirect_bgemm are set to true to throw an error/warning (same as BenchmarkTfLiteModel/BenchmarkModel do). similarly in the LCEInterpreter, what do you think ?
either we can do it in BenchmarkTfLiteModel/LCEInterpreter or when registering in lce_register_ops.h.

on the other hand there's a platform check to discretely register the regular bconv kernel if the optimized BGEMM kernel is not available. this might be also valuable to throw a warning for the caller

lgeiger

This looks good to me. But can you explain your use case a bit, I'm curious why you need to turn off XNNPack in the interpreter.

lgeiger · 2022-03-08T11:06:16Z

larq_compute_engine/tflite/python/interpreter.py

@@ -40,11 +42,17 @@ def __init__(
        flatbuffer_model: bytes,
        num_threads: int = 1,
        use_reference_bconv: bool = False,
+        use_indirect_bgemm: bool = False,
+        use_xnnpack: bool = False,


I think this changes the default or am I missing something? I think that's fine, but can you explain the reasoning behind it?

hi @lgeiger. you're correct. following the discussion on #713 I've implemented it like this.
A) one reason to include this flag in the LceInterpreter is to make it reflect more the behavior of the benchmark binary.
B) the second reason is dynamic allocation, thus having models with allocation_type==kTfLiteDynamic.
we had models like this and when we set use_xnnpack=true in the benchmark_model there will be an error thrown. this would be true for LCEInterpreter as well, as under the hood XNNPACK is "silently" applied.

Tombana · 2022-03-08T11:41:51Z

@Tombana my pleasure. was also thinking about overriding ValidParams() to check when both use_reference_bconv and use_indirect_bgemm are set to true to throw an error/warning (same as BenchmarkTfLiteModel/BenchmarkModel do. similarly in the LCEInterpreter, what do you think ? either we can do it in BenchmarkTfLiteModel/LCEInterpreter or when registering in lce_register_ops.h.

That sounds good yes. It's probably easiest to do it in lce_ops_register.h, then all uses are covered. As for the output mechanism, it seems that by default the Interpreter uses DefaultErrorReporter() (from #include "tensorflow/lite/stderr_reporter.h") so its probably the cleanest to use that instead of calling printf directly.

on the other hand there's a platform check to discretely register the regular bconv kernel if the optimized BGEMM kernel is not available. this might be also valuable to throw a warning for the caller

It seems that the TFLite conv kernels don't have any warnings there either, so I think we can leave those like it is.

…h set to true in lce_ops_register.h

simonmaurer · 2022-03-09T10:25:19Z

@Tombana my pleasure. was also thinking about overriding ValidParams() to check when both use_reference_bconv and use_indirect_bgemm are set to true to throw an error/warning (same as BenchmarkTfLiteModel/BenchmarkModel do. similarly in the LCEInterpreter, what do you think ? either we can do it in BenchmarkTfLiteModel/LCEInterpreter or when registering in lce_register_ops.h.

That sounds good yes. It's probably easiest to do it in lce_ops_register.h, then all uses are covered. As for the output mechanism, it seems that by default the Interpreter uses DefaultErrorReporter() (from #include "tensorflow/lite/stderr_reporter.h") so its probably the cleanest to use that instead of calling printf directly.

on the other hand there's a platform check to discretely register the regular bconv kernel if the optimized BGEMM kernel is not available. this might be also valuable to throw a warning for the caller

It seems that the TFLite conv kernels don't have any warnings there either, so I think we can leave those like it is.

perfect. I've added this as you suggested with the DefaultErrorReporter() first, which returns a static pointer to report an error for this use case. as it should be a warning, we're now using TFLITE_LOG(WARN) instead if y'all are okay with that.

…build

simonmaurer · 2022-03-09T10:50:52Z

@Tombana @lgeiger any hints why the inclusion of "@org_tensorflow//tensorflow/lite/tools/logging" is only working for tf_cc_binary() in BUILD of lce_benchmark_model but not for interpreter_wrapper_lite in py_bind_extension() as a dependency?

Tombana · 2022-03-09T11:48:48Z

@Tombana @lgeiger any hints why the inclusion of "@org_tensorflow//tensorflow/lite/tools/logging" is only working for tf_cc_binary() in BUILD of lce_benchmark_model but not for interpreter_wrapper_lite in py_bind_extension() as a dependency?

I'm not sure, I think you might need ..../tools:logging instead of ..../tools/logging.

larq_compute_engine/tflite/kernels/lce_ops_register.h

Tombana · 2022-03-09T21:41:39Z

I think the PR is finished, right @simonmaurer ? Then it can be merged.

simonmaurer · 2022-03-10T00:50:54Z

yessir 💪

simonmaurer added 19 commits March 3, 2022 15:10

add boolean flag allowing to register indirect BGEMM kernel

b9e9e9f

added boolean flag to LceInterpreter allowing to register indirect BG…

bf8ce6a

…EMM kernels

fix comment

e25c392

added boolean flag use_xnnpack in LceInterpreter to explicitly activa…

1a23b0c

…te/deactivate XNNPACK delegate

formatted code using black code style

f5bdf7b

removed uncommented code for lint checks

c00a56b

reformatted code, parameter fix in LiteInterpreterWrapper

b1afcf6

second parameter fix in LiteInterpreterWrapper

b275955

reformat files for proper linting

4afb363

reformat code for linting

060d39c

add include of LceBenchmarkTfLiteModel for cmake build

33b8a78

reformat code for linting

cddcad1

fix intent in lce_benchmark_tflite_mode.cc

2d136ea

typo fix

3bf5817

format code in lce_benchmark_tflite_model.cc

4be5503

buildifier check for BUILD

e669c8b

srcs for Makefile

82951a6

removed include

1c50317

Tombana approved these changes Mar 8, 2022

View reviewed changes

lgeiger reviewed Mar 8, 2022

View reviewed changes

lgeiger approved these changes Mar 8, 2022

View reviewed changes

simonmaurer added 2 commits March 9, 2022 11:15

added warning when use_reference_bconv and use_indirect_bgemm are bot…

71d3b35

…h set to true in lce_ops_register.h

reformat code for linting

e43b84d

simonmaurer added 3 commits March 9, 2022 11:29

adapted BUILD file to include TFLite logging header

a2be814

include TFLite logging header for interpreter_wrapper_lite build

039c531

typo fix: adapted BUILD file to include TFLite logging

cb2c344

typo fix: include TFLite logging header for interpreter_wrapper_lite …

038a603

…build

Tombana reviewed Mar 9, 2022

View reviewed changes

larq_compute_engine/tflite/kernels/lce_ops_register.h Outdated Show resolved Hide resolved

fix: remove unused header file, logging package dependency

7a08381

lgeiger approved these changes Mar 10, 2022

View reviewed changes

Tombana merged commit 4cb8e72 into larq:main Mar 10, 2022

simonmaurer mentioned this pull request Mar 10, 2022

Select indirect BGEMM kernels - Benchmarking grouped binary convolutions #711

Closed

CNugteren added the feature New feature or request label Apr 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lce benchmark and interpreter flags #717

Lce benchmark and interpreter flags #717

simonmaurer commented Mar 7, 2022 •

edited

Loading

Tombana left a comment

simonmaurer commented Mar 8, 2022 •

edited

Loading

lgeiger left a comment

lgeiger Mar 8, 2022

simonmaurer Mar 8, 2022 •

edited

Loading

Tombana commented Mar 8, 2022

simonmaurer commented Mar 9, 2022 •

edited

Loading

simonmaurer commented Mar 9, 2022 •

edited

Loading

Tombana commented Mar 9, 2022

Tombana commented Mar 9, 2022

simonmaurer commented Mar 10, 2022

Lce benchmark and interpreter flags #717

Lce benchmark and interpreter flags #717

Conversation

simonmaurer commented Mar 7, 2022 • edited Loading

What do these changes do?

lce_benchmark_model

LceInterpreter

Related issue number

Tombana left a comment

Choose a reason for hiding this comment

simonmaurer commented Mar 8, 2022 • edited Loading

lgeiger left a comment

Choose a reason for hiding this comment

lgeiger Mar 8, 2022

Choose a reason for hiding this comment

simonmaurer Mar 8, 2022 • edited Loading

Choose a reason for hiding this comment

Tombana commented Mar 8, 2022

simonmaurer commented Mar 9, 2022 • edited Loading

simonmaurer commented Mar 9, 2022 • edited Loading

Tombana commented Mar 9, 2022

Tombana commented Mar 9, 2022

simonmaurer commented Mar 10, 2022

simonmaurer commented Mar 7, 2022 •

edited

Loading

simonmaurer commented Mar 8, 2022 •

edited

Loading

simonmaurer Mar 8, 2022 •

edited

Loading

simonmaurer commented Mar 9, 2022 •

edited

Loading

simonmaurer commented Mar 9, 2022 •

edited

Loading