Convert to Keras 3: `Knowledge Distillation` example #18493

awsaf49 · 2023-09-25T17:56:02Z

This PR will add Knowledge Distillation example from keras.io/examples. Sadly, this example is not backend agnostic, presumably due to tf.GradientTape() which is TensorFlow specific.

This reverts commit 97f6559.

awsaf49 · 2023-09-25T17:58:17Z

@fchollet love to hear your feedback.
Diff: https://github.com/keras-team/keras/commit/edb58cded82958467bf7556847f251081fea5197.diff
Issue: #18468

codecov-commenter · 2023-09-25T18:01:32Z

Codecov Report

All modified lines are covered by tests ✅

Comparison is base (29a954a) 77.40% compared to head (01b3701) 73.77%.
Report is 13 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master   #18493      +/-   ##
==========================================
- Coverage   77.40%   73.77%   -3.64%     
==========================================
  Files         331      331              
  Lines       31972    31984      +12     
  Branches     6241     6246       +5     
==========================================
- Hits        24749    23596    -1153     
- Misses       5646     6857    +1211     
+ Partials     1577     1531      -46

Flag	Coverage Δ
keras	`73.70% <ø> (-3.62%)`	⬇️
keras-jax	`?`
keras-numpy	`56.15% <ø> (-0.02%)`	⬇️
keras-tensorflow	`62.03% <ø> (-0.15%)`	⬇️
keras-torch	`63.96% <ø> (-0.02%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

see 28 files with indirect coverage changes

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

fchollet · 2023-09-25T20:01:36Z

Formatted diff:

--- a/examples/keras_io/tensorflow/vision/knowledge_distillation.py
+++ b/examples/keras_io/tensorflow/vision/knowledge_distillation.py
@@ -1,12 +1,11 @@
 """
 Title: Knowledge Distillation
 Author: [Kenneth Borup](https://twitter.com/Kennethborup)
+Converted to Keras 3: [Md Awsafur Rahman](https://awsaf49.github.io)
 Date created: 2020/09/01
 Last modified: 2020/09/01
 Description: Implementation of classical Knowledge Distillation.
-Accelerator: GPU
 """
-
 """
 ## Introduction to Knowledge Distillation
 
@@ -29,12 +28,16 @@
 ## Setup
 """
 
+import os
+
+os.environ["KERAS_BACKEND"] = "tensorflow"
+
+import keras
+from keras import layers
+from keras import ops
 import tensorflow as tf
-from tensorflow import keras
-from tensorflow.keras import layers
 import numpy as np
 
-
 """
 ## Construct `Distiller()` class
 
@@ -50,8 +53,10 @@
 - An optimizer for the student and (optional) metrics to evaluate performance
 
 In the `train_step` method, we perform a forward pass of both the teacher and student,
-calculate the loss with weighting of the `student_loss` and `distillation_loss` by `alpha` and
-`1 - alpha`, respectively, and perform the backward pass. Note: only the student weights are updated,
+calculate the loss with weighting of the `student_loss` and `distillation_loss` by
+`alpha` and
+`1 - alpha`, respectively, and perform the backward pass. Note: only the student weights
+are updated,
 and therefore we only calculate the gradients for the student weights.
 
 In the `test_step` method, we evaluate the student model on the provided dataset.
@@ -111,8 +116,8 @@ def train_step(self, data):
             # as 1/T^2, multiply them by T^2 when using both hard and soft targets.
             distillation_loss = (
                 self.distillation_loss_fn(
-                    tf.nn.softmax(teacher_predictions / self.temperature, axis=1),
-                    tf.nn.softmax(student_predictions / self.temperature, axis=1),
+                    ops.softmax(teacher_predictions / self.temperature, axis=1),
+                    ops.softmax(student_predictions / self.temperature, axis=1),
                 )
                 * self.temperature**2
             )
@@ -168,7 +173,7 @@ def test_step(self, data):
     [
         keras.Input(shape=(28, 28, 1)),
         layers.Conv2D(256, (3, 3), strides=(2, 2), padding="same"),
-        layers.LeakyReLU(alpha=0.2),
+        layers.LeakyReLU(negative_slope=0.2),
         layers.MaxPooling2D(pool_size=(2, 2), strides=(1, 1), padding="same"),
         layers.Conv2D(512, (3, 3), strides=(2, 2), padding="same"),
         layers.Flatten(),
@@ -182,7 +187,7 @@ def test_step(self, data):
     [
         keras.Input(shape=(28, 28, 1)),
         layers.Conv2D(16, (3, 3), strides=(2, 2), padding="same"),
-        layers.LeakyReLU(alpha=0.2),
+        layers.LeakyReLU(negative_slope=0.2),
         layers.MaxPooling2D(pool_size=(2, 2), strides=(1, 1), padding="same"),
         layers.Conv2D(32, (3, 3), strides=(2, 2), padding="same"),
         layers.Flatten(),
@@ -198,7 +203,8 @@ def test_step(self, data):
 ## Prepare the dataset
 
 The dataset used for training the teacher and distilling the teacher is
-[MNIST](https://keras.io/api/datasets/mnist/), and the procedure would be equivalent for any other
+[MNIST](https://keras.io/api/datasets/mnist/), and the procedure would be equivalent for
+any other
 dataset, e.g. [CIFAR-10](https://keras.io/api/datasets/cifar10/), with a suitable choice
 of models. Both the student and teacher are trained on the training set and evaluated on
 the test set.
@@ -284,4 +290,4 @@ def test_step(self, data):
 You should expect the teacher to have accuracy around 97.6%, the student trained from
 scratch should be around 97.6%, and the distilled student should be around 98.1%. Remove
 or try out different seeds to use different weight initializations.
-"""
\ No newline at end of file

fchollet

Thanks for the PR!

examples/keras_io/tensorflow/vision/knowledge_distillation.py

fchollet

Very nice!

awsaf49 added 3 commits September 25, 2023 23:50

add: knowledge_distillation example

8905462

add: old example

97f6559

Revert "add: old example"

edb58cd

This reverts commit 97f6559.

google-ml-butler bot added the size:L label Sep 25, 2023

google-ml-butler bot assigned gbaned Sep 25, 2023

fchollet reviewed Sep 25, 2023

View reviewed changes

examples/keras_io/tensorflow/vision/knowledge_distillation.py Outdated Show resolved Hide resolved

examples/keras_io/tensorflow/vision/knowledge_distillation.py Outdated Show resolved Hide resolved

examples/keras_io/tensorflow/vision/knowledge_distillation.py Outdated Show resolved Hide resolved

awsaf49 added 4 commits September 26, 2023 08:35

add: by

2323b42

update: black format

fae4e23

update: remove tf code

ffec41a

update: move to /vision from /tensorflow/vision

01b3701

fchollet approved these changes Sep 26, 2023

View reviewed changes

google-ml-butler bot added kokoro:force-run ready to pull Ready to be merged into the codebase labels Sep 26, 2023

fchollet merged commit 0484165 into keras-team:master Sep 26, 2023

google-ml-butler bot removed ready to pull Ready to be merged into the codebase kokoro:force-run labels Sep 26, 2023

awsaf49 deleted the ex_kd branch September 26, 2023 11:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convert to Keras 3: `Knowledge Distillation` example #18493

Convert to Keras 3: `Knowledge Distillation` example #18493

awsaf49 commented Sep 25, 2023 •

edited

Loading

awsaf49 commented Sep 25, 2023 •

edited

Loading

codecov-commenter commented Sep 25, 2023 •

edited

Loading

fchollet commented Sep 25, 2023

fchollet left a comment

fchollet left a comment

Convert to Keras 3: Knowledge Distillation example #18493

Convert to Keras 3: Knowledge Distillation example #18493

Conversation

awsaf49 commented Sep 25, 2023 • edited Loading

awsaf49 commented Sep 25, 2023 • edited Loading

codecov-commenter commented Sep 25, 2023 • edited Loading

Codecov Report

fchollet commented Sep 25, 2023

fchollet left a comment

Choose a reason for hiding this comment

fchollet left a comment

Choose a reason for hiding this comment

Convert to Keras 3: `Knowledge Distillation` example #18493

Convert to Keras 3: `Knowledge Distillation` example #18493

awsaf49 commented Sep 25, 2023 •

edited

Loading

awsaf49 commented Sep 25, 2023 •

edited

Loading

codecov-commenter commented Sep 25, 2023 •

edited

Loading