Support online cls model prediction #769

zhangjunlongtech · 2024-11-15T02:37:36Z

Thank you for your contribution to the MindOCR repo.
Before submitting this PR, please make sure:

[✔] You have read the Contributing Guidelines on pull requests
[✔] Your code builds clean without any errors or warnings
[✔] You are using approved terminology
[✔] You have added unit tests

Motivation

Currently mobilenet_v3 classification (CLS) model only supports offline inference with mindspore lite. This PR is adding online mobilenet_v3 classification model inference for text direction classification (CLS) task. Add predict_cls.py for everyone to use.

Test Plan

Running the following command and check the output files under ./inferrence_results

python tools/infer/text/predict_cls.py  --image_dir {path_to_img or dir_to_imgs} --rec_algorithm MV3

We can also run in single image mode by setting --cls_batch_mode False.

python tools/infer/text/predict_cls.py  --image_dir {path_to_img or dir_to_imgs} --rec_algorithm MV3 --cls_batch_mode False

The target classification image is

The cls task output should looks like this

mindocr INFO - Init classification model: MV3 --> cls_mobilenet_v3_small_100_model. Model weights loaded from pretrained url
mindocr INFO - num images for cls: 1
mindocr INFO - CLS img idx range: [0, 1)
mindocr INFO - All cls res: [('180', 1.0)]
mindocr INFO - Done! Text angle classification results saved in ./inference_results
mindocr INFO - Time cost: 6.98498272895813, FPS: 0.14316427667805537

CaitinZhao · 2024-11-15T08:57:50Z

tools/infer/text/README.md

@@ -238,6 +238,56 @@ Evaluation of the text spotting inference results on Ascend 910 with MindSpore 2
 2. Unless extra inidication, all experiments are run with `--det_limit_type`="min" and `--det_limit_side`=720.
 3. SVTR is run in mixed precision mode (amp_level=O2) since it is optimized for O2.

+## Text Direction Classification


这块不用单独呈现，e2e的时候加上就行

Add online prediction of text direction classification model

4093e6c

zhangjunlongtech changed the title ~~Add online prediction of text direction classification (CLS) model~~ Support online cls model prediction Nov 15, 2024

zhangjunlongtech added 2 commits November 15, 2024 15:21

fix code specification

0c52d67

Merge https://github.com/mindspore-lab/mindocr into cls

02c0193

CaitinZhao reviewed Nov 15, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support online cls model prediction #769

Support online cls model prediction #769

zhangjunlongtech commented Nov 15, 2024 •

edited

Loading

CaitinZhao Nov 15, 2024

Support online cls model prediction #769

Are you sure you want to change the base?

Support online cls model prediction #769

Conversation

zhangjunlongtech commented Nov 15, 2024 • edited Loading

Motivation

Test Plan

CaitinZhao Nov 15, 2024

Choose a reason for hiding this comment

zhangjunlongtech commented Nov 15, 2024 •

edited

Loading