CTC API for JAX #18952

MaanasArora · 2023-12-17T07:38:37Z

CTC loss implementation for JAX. I also included a documentation fix for the op.

Thank you!

- Refactor sparse labels into main ctc_batch_cost function for tf

codecov-commenter · 2023-12-17T07:42:30Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (d550552) 79.55% compared to head (5e98c5c) 79.58%.
Report is 3 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master   #18952      +/-   ##
==========================================
+ Coverage   79.55%   79.58%   +0.02%     
==========================================
  Files         337      337              
  Lines       35056    35116      +60     
  Branches     6872     6879       +7     
==========================================
+ Hits        27890    27947      +57     
- Misses       5587     5588       +1     
- Partials     1579     1581       +2

Flag	Coverage Δ
keras	`79.44% <100.00%> (+0.02%)`	⬆️
keras-jax	`61.27% <96.87%> (+0.08%)`	⬆️
keras-numpy	`55.96% <3.12%> (-0.01%)`	⬇️
keras-tensorflow	`63.16% <3.12%> (-0.04%)`	⬇️
keras-torch	`63.81% <6.25%> (-0.04%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

fchollet

Thanks for the PR!

fchollet · 2023-12-17T16:57:37Z

keras/ops/nn.py

@@ -1823,7 +1823,7 @@ def ctc_loss(target, output, target_length, output_length, mask_index=0):
    Args:
        target: A tensor of shape `(batch_size, target_max_length)` containing
            the true labels in integer format.
-        output: A tensor of shape `(batch_size, output_max_length, num_classes)`
+        output: A tensor of shape `(output_max_length, batch_size, num_classes)`


That seems very counterintuitive given that all output tensors in Keras start with the batch dimension. Surely we should transpose?

Definitely, this is fixed now. Thanks!

fchollet · 2023-12-17T21:37:59Z

keras/backend/tensorflow/nn.py

@@ -819,6 +819,7 @@ def ctc_loss(
    target = tf.cast(target, dtype="int32")
    output = tf.convert_to_tensor(output)
    output = tf.cast(output, dtype="float32")
+    output = tf.transpose(output, perm=(1, 0, 2))


You can use the logits_time_major=False arg in tf.nn.ctc_loss to avoid this transposition.

Right, done, thanks!

fchollet · 2023-12-17T22:08:51Z

keras/backend/jax/nn.py

+    batch_size, _, _ = output.shape
+    batch_size, max_target_length = target.shape
+
+    output = output.transpose((1, 0, 2))


Is this necessary for the scan op on the first dimension?

Yes AFAIK, since we're scanning along the time dimension and scan runs along the leading axis.

fchollet

LGTM, thank you!

MaanasArora added 20 commits December 12, 2023 04:43

Implement CTC loss in tensorflow backend

8ca8bf8

Implement CTC api in torch backend

3b2d7ca

Add CTC loss to keras losses

ecbb28e

Remove CTC from losses

e7584f7

Perform log softmax in torch CTC loss

f7554b4

Refactor reviewed code in CTC API

cbbd628

- Refactor sparse labels into main ctc_batch_cost function for tf

Fix formatting issue in docstring

cb97574

Removed trailing space

bedad98

Naming changes in nn.ctc_loss backend functions

ab2efc9

Add ctc_loss keras op

0741531

Add correctness unit test for CTC loss

1ffa35d

Skip test for CTC loss in JAX backend

4edf18d

Update ctc_loss function to also export to ops.nn

85865ba

Add static type testing for CTC loss

ab8a81b

Fix enabled backends for CTC loss test

92e6654

Linting keras ops

97873a0

Fix line overflow in CtcLoss class

cfd949e

CTC loss implementation for JAX

58caf83

Merge branch 'master' of github.com:keras-team/keras into ctc-jax

56d84dd

Fix shape order in ctc loss documentation

90801dc

google-ml-butler bot added the size:M label Dec 17, 2023

google-ml-butler bot assigned gbaned Dec 17, 2023

fchollet reviewed Dec 17, 2023

View reviewed changes

Transpose output in CTC loss

28032cd

fchollet reviewed Dec 17, 2023

View reviewed changes

MaanasArora added 2 commits December 17, 2023 17:26

Use logits_time_major instead of transpose in TF CTC

9f266e3

Fix linting issue

5e98c5c

fchollet approved these changes Dec 18, 2023

View reviewed changes

google-ml-butler bot added the kokoro:force-run label Dec 18, 2023

google-ml-butler bot added the ready to pull Ready to be merged into the codebase label Dec 18, 2023

fchollet merged commit 6a6e4f8 into keras-team:master Dec 18, 2023
6 checks passed

google-ml-butler bot removed ready to pull Ready to be merged into the codebase kokoro:force-run labels Dec 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CTC API for JAX #18952

CTC API for JAX #18952

MaanasArora commented Dec 17, 2023

codecov-commenter commented Dec 17, 2023 •

edited

Loading

fchollet left a comment

fchollet Dec 17, 2023

MaanasArora Dec 17, 2023

fchollet Dec 17, 2023

MaanasArora Dec 17, 2023

fchollet Dec 17, 2023

MaanasArora Dec 17, 2023

fchollet left a comment

CTC API for JAX #18952

CTC API for JAX #18952

Conversation

MaanasArora commented Dec 17, 2023

codecov-commenter commented Dec 17, 2023 • edited Loading

Codecov Report

fchollet left a comment

Choose a reason for hiding this comment

fchollet Dec 17, 2023

Choose a reason for hiding this comment

MaanasArora Dec 17, 2023

Choose a reason for hiding this comment

fchollet Dec 17, 2023

Choose a reason for hiding this comment

MaanasArora Dec 17, 2023

Choose a reason for hiding this comment

fchollet Dec 17, 2023

Choose a reason for hiding this comment

MaanasArora Dec 17, 2023

Choose a reason for hiding this comment

fchollet left a comment

Choose a reason for hiding this comment

codecov-commenter commented Dec 17, 2023 •

edited

Loading