Replies: 5 comments 6 replies
-
I’d be willing to feed data if it is automated
\\Greg
…________________________________
From: Nicolas Liaudat ***@***.***>
Sent: Tuesday, February 11, 2025 2:30:49 PM
To: jomjol/AI-on-the-edge-device ***@***.***>
Cc: Subscribed ***@***.***>
Subject: [jomjol/AI-on-the-edge-device] Real user images dataset (Discussion #3551)
Hi,
I ask if someone is ready to upload an image of your counter every 12 hours to a github repo?
For the past month, I've been trying to improve the accuracy of an AI model to process all the digits in a single step, without having to specify the position of each digit.
This process is already implemented by seedstudio<https://wiki.seeedstudio.com/Train-Water-Meter-Digits-Recognition-Model-with-SenseCAP-A1101/> (but doesn't work very well).
I'm getting average results due to a lack of good usable data.
Would you be willing to send the RAW images and crop positions of your counters at regular intervals in order to improve the model?
This would also be very useful for improving the current models, which are trained on a majority of images with a dark background.
My model doesn't deal with decimals, so I think detecting a 9 instead of a 6 is complicated enough without introducing 9.1. What's more, it doesn't provide much accuracy, since the aim is to have a minimum error rate with 1 decimal place (1 liter).
Ps. : I can write the api to upload to github, huggingface.co or as you want.
—
Reply to this email directly, view it on GitHub<#3551>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AGEO6Z66A3GQA4QW3Q2YZ3T2PJFWTAVCNFSM6AAAAABW53RVI6VHI2DSMVQWIX3LMV43ERDJONRXK43TNFXW4OZXHE2DQNBXGM>.
You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>
|
Beta Was this translation helpful? Give feedback.
-
For what it's worth, there are a few techniques that I usually employ to train a model to "ignore" dark/light background and just look for digits. I have only done this on a larger model, so I'm not sure whether this technique will produce a good result on small models that run on an ESP32.
When we actually use the model, we want to convert the input image to grey scale and normalize the brightness before feeding it to the model. |
Beta Was this translation helpful? Give feedback.
-
I an reading an LCD panel. I feel like my picture might be important
[cid:37fa6f04-ecb3-4e41-937f-9737cfa313ff]
\\Greg
…________________________________
From: Nicolas Liaudat ***@***.***>
Sent: Thursday, February 13, 2025 12:35
To: jomjol/AI-on-the-edge-device ***@***.***>
Cc: uSlackr ***@***.***>; Comment ***@***.***>
Subject: Re: [jomjol/AI-on-the-edge-device] Real user images dataset (Discussion #3551)
Thank you for the advices.
What I need is a lot more of images and especially images with the whole eight digits of the counter.
I want to pass the image in one shot in the ocr AI engine.
For grayscale transformation, I works only on custom ocr model.
If using pretrained models with fine tuning, the result with grayscale images is degraded compared to RGB input.
PS. : The collect and send images part in ai-on-the-edge will be very difficult after deep analysis.
Some has already tried without success actually. (#3326<#3326>, #1330<#1330>, #3378<#3378>)
I mean the best way to go is perhaps an home assistant component or a docker, but it will limit the number of users
—
Reply to this email directly, view it on GitHub<#3551 (reply in thread)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AGEO6Z2B4YDNQ26YEDKE26L2PTJWTAVCNFSM6AAAAABW53RVI6VHI2DSMVQWIX3LMV43URDJONRXK43TNFXW4Q3PNVWWK3TUHMYTEMJZGA4TCNA>.
You are receiving this because you commented.Message ID: ***@***.***>
|
Beta Was this translation helpful? Give feedback.
-
I have done some testing on webhooks and it may works if the provided image where not AlgRoi, but Alg (Raw after alignement). The define ALGROI_LOAD_FROM_MEM_AS_JPG takes memory and I do not want to implement an second ALG_LOAD_FROM_MEM_AS_JPG to have only the raw image as it could be excessif memory usage. For memories , here is the python version of php webhook (https://jomjol.github.io/AI-on-the-edge-device-docs/Webhook/) :
|
Beta Was this translation helpful? Give feedback.
-
final component : https://github.com/nliaudat/aioted_manager |
Beta Was this translation helpful? Give feedback.
-
Hi,
I ask if someone is ready to upload an image of your counter every 12 hours to a github repo?
For the past month, I've been trying to improve the accuracy of an AI model to process all the digits in a single step, without having to specify the position of each digit.
This process is already implemented by seedstudio (but doesn't work very well).
I'm getting average results due to a lack of good usable data.
Would you be willing to send the RAW images and crop positions of your counters at regular intervals in order to improve the model?
This would also be very useful for improving the current models, which are trained on a majority of images with a dark background.
My model doesn't deal with decimals, so I think detecting a 9 instead of a 6 is complicated enough without introducing 9.1. What's more, it doesn't provide much accuracy, since the aim is to have a minimum error rate with 1 decimal place (1 liter).
Ps. : I can write the api to upload to github, huggingface.co or as you want.
Beta Was this translation helpful? Give feedback.
All reactions