-
Notifications
You must be signed in to change notification settings - Fork 9.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Run LSTM recognition in multiple threads #4275
base: main
Are you sure you want to change the base?
Changes from 1 commit
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -169,8 +169,11 @@ bool Tesseract::init_tesseract_lang_data(const std::string &arg0, | |
tessedit_ocr_engine_mode == OEM_TESSERACT_LSTM_COMBINED) { | ||
#endif // ndef DISABLED_LEGACY_ENGINE | ||
if (mgr->IsComponentAvailable(TESSDATA_LSTM)) { | ||
lstm_recognizer_ = new LSTMRecognizer(language_data_path_prefix.c_str()); | ||
ASSERT_HOST(lstm_recognizer_->Load(this->params(), lstm_use_matrix ? language : "", mgr)); | ||
for (int i = 0; i < lstm_num_threads; ++i) { | ||
lstm_recognizers_.push_back(new LSTMRecognizer(language_data_path_prefix.c_str())); | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Fine for me. Upd.: There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I was just following the existing design pattern of tesseract of using |
||
lstm_recognizers_.back()->Load(this->params(), lstm_use_matrix ? language : "", mgr); | ||
} | ||
lstm_recognizer_ = lstm_recognizers_[0]; | ||
} else { | ||
tprintf("Error: LSTM requested, but not present!! Loading tesseract.\n"); | ||
tessedit_ocr_engine_mode.set_value(OEM_TESSERACT_ONLY); | ||
|
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -439,6 +439,10 @@ Tesseract::Tesseract() | |
"lstm_choice_mode. Note that lstm_choice_mode must be set to a " | ||
"value greater than 0 to produce results.", | ||
this->params()) | ||
, INT_INIT_MEMBER(lstm_num_threads, 1, | ||
"Sets the number of threads used by the LSTM recognizer. The " | ||
"default value is 1.", | ||
this->params()) | ||
, double_MEMBER(lstm_rating_coefficient, 5, | ||
"Sets the rating coefficient for the lstm choices. The smaller the " | ||
"coefficient, the better are the ratings for each choice and less " | ||
|
@@ -477,7 +481,10 @@ Tesseract::~Tesseract() { | |
for (auto *lang : sub_langs_) { | ||
delete lang; | ||
} | ||
delete lstm_recognizer_; | ||
for (int i = 0; i < lstm_recognizers_.size(); ++i) { | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Thanks for your review comments. I have addressed this now. |
||
delete lstm_recognizers_[i]; | ||
} | ||
lstm_recognizers_.clear(); | ||
lstm_recognizer_ = nullptr; | ||
} | ||
|
||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What about static thread pool here?
If code is called multiple times, how is std::async performance compared to static thread pool?
Considering async will create a new thread each time.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good question.
As
stweil
has clarified in his latest comment, this multiple lstm threads is not meant for mass production. This feature is probably meant for consumer-end devices running a single page OCR once in a while, but with least possible latency. In such cases thread creation/deletion is not a big overhead. But if we really come across use-cases where thread creation/deletion overhead becomes significant, we could look at replacing this with a thread pool at that point.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The default case (lstm_num_threads == 1) must not create a new thread.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yes, the default case (lstm_num_threads == 1) does not create a new thread. As you see, the loop starts with
i = 1
, and hence does not execute whenlstm_num_threads == 1
.