Label detection #13502

Uddeshya1052 · 2025-01-30T13:57:20Z

Search before asking

I have searched the YOLOv5 issues and discussions and found no similar questions.

Question

I am using YOLO to detect labels and then extract the text within the detected regions. However, I’m facing an issue with background color variations. If the background color of the label changes, the model struggles to detect it. I don’t have enough images with different background colors to train the model.

Would it be a good approach to train the model using grayscale images to generalize for any background color? Or are there alternative techniques or preprocessing steps that could help improve detection robustness in this scenario? Any suggestions or ideas would be greatly appreciated.
Thank you!

Additional

No response

UltralyticsAssistant · 2025-01-30T13:59:24Z

👋 Hello @Uddeshya1052, thank you for your interest in YOLOv5 🚀! Please visit our ⭐️ Tutorials to get started, where you can find quickstart guides for simple tasks like Custom Data Training all the way to advanced concepts like Hyperparameter Evolution.

If this is a custom training ❓ Question, your approach to generalizing detection (e.g., by experimenting with grayscale images or other preprocessing steps) is valid and worth investigating. However, to provide more targeted assistance, please share more details about your dataset, training setup, and any preprocessing techniques you've already tried. Additionally, verify you're considering our Tips for Best Training Results.

For now, here are a few suggestions to improve robustness:

Augmentation Techniques: YOLOv5 already offers powerful augmentation options out of the box. Ensure you are leveraging augmentations like hsv_h, hsv_s, and hsv_v for color variance. You can modify these in the training configuration.
Dataset Expansion Ideas: You might generate synthetic images with varied backgrounds using tools like Albumentations or Photoshop. Adding diverse data can greatly improve generalization.
Grayscale Approach: Converting your dataset to grayscale before training could reduce dependency on color features. Experimenting here could be insightful.

Requirements

Python>=3.8.0 with all requirements.txt installed including PyTorch>=1.8. To get started:

git clone https://github.com/ultralytics/yolov5  # clone
cd yolov5
pip install -r requirements.txt  # install

Environments

YOLOv5 may be run in any of the following up-to-date verified environments (with all dependencies including CUDA/CUDNN, Python and PyTorch preinstalled):

Notebooks with free GPU:
Google Cloud Deep Learning VM. See GCP Quickstart Guide
Amazon Deep Learning AMI. See AWS Quickstart Guide
Docker Image. See Docker Quickstart Guide

Status

If this badge is green, all YOLOv5 GitHub Actions Continuous Integration (CI) tests are currently passing. CI tests verify correct operation of YOLOv5 training, validation, inference, export and benchmarks on macOS, Windows, and Ubuntu every 24 hours and on every commit.

This is an automated response, but no worries 😊—an Ultralytics engineer will assist you further as soon as possible!

pderrenger · 2025-01-31T12:27:16Z

@Uddeshya1052 for improved robustness against background variations in YOLOv5, we recommend:

Leveraging YOLOv5's built-in augmentations (--hsv_h, --hsv_s, --hsv_v in train.py) to simulate color variations
Adding background images (0-10% of dataset) per our training tips guide
Generating synthetic training data with varied backgrounds using tools like Photoshop/Python

Grayscale conversion alone typically isn't sufficient. Focus on data diversity through augmentation. If you need more specific guidance, please share your dataset statistics and example training mosaics from runs/train/exp/train_batch*.jpg.

Uddeshya1052 · 2025-02-04T13:11:58Z

@pderrenger Thank you for your suggestion. I tried these settings, but unfortunately, the performance decreased. I don't have a dataset with different colors. How can I still solve this issue? My application is equipment labeling detection, where labels are pasted on devices with different background colors.

pderrenger · 2025-02-04T19:09:42Z

@Uddeshya1052 for equipment label detection with variable backgrounds, consider these steps:

Use controlled color augmentation (reduce HSV gains to 0.1-0.2 range in hyp.yaml)
Generate synthetic label images with Python's PIL/CV2 by pasting labels onto random-colored backgrounds
Implement edge detection preprocessing to focus on label contours

You can test grayscale conversion as a temporary inference preprocessing step without retraining:
img = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY) then stack channels

For specific implementation help, please share sample images from your runs/train/exp/train_batch*.jpg mosaics.

Uddeshya1052 · 2025-02-06T08:14:21Z

@pderrenger,

Here are a few sample images I’m working with for detection. As you can see, the labels come in different colors, such as yellow and white, and we also have similar variations in other colors.

For training, I have used the following settings in the hyp.yaml file:

fl_gamma: 0.0 # focal loss gamma (EfficientDet default gamma=1.5)
hsv_h: 0.015 # image HSV-Hue augmentation (fraction)
hsv_s: 0.7 # image HSV-Saturation augmentation (fraction)
hsv_v: 0.4 # image HSV-Value augmentation (fraction)
degrees: 0.0 # image rotation (+/- deg)
translate: 0.1 # image translation (+/- fraction)
scale: 0.5 # image scale (+/- gain)
shear: 0.0 # image shear (+/- deg)
perspective: 0.0 # image perspective (+/- fraction), range 0-0.001
flipud: 0.0 # image flip up-down (probability)
fliplr: 0.5 # image flip left-right (probability)
mosaic: 1.0 # image mosaic (probability)
mixup: 0.0 # image mixup (probability)

These are the same settings used in hyp.scratch-low.yaml.
Let me know if you have any suggestions or if you need more details.

pderrenger · 2025-02-06T23:01:22Z

Thanks for sharing the samples and settings—try slightly lowering your hsv_s and hsv_v values to moderate color augmentation and consider synthetic background generation to further boost variability, ensuring you're using the latest YOLOv5 release.

Uddeshya1052 added the question Further information is requested label Jan 30, 2025

UltralyticsAssistant added the detect Object Detection issues, PR's label Jan 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Label detection #13502

Label detection #13502

Uddeshya1052 commented Jan 30, 2025

UltralyticsAssistant commented Jan 30, 2025

pderrenger commented Jan 31, 2025

Uddeshya1052 commented Feb 4, 2025 •

edited

Loading

pderrenger commented Feb 4, 2025

Uddeshya1052 commented Feb 6, 2025

pderrenger commented Feb 6, 2025

Label detection #13502

Label detection #13502

Comments

Uddeshya1052 commented Jan 30, 2025

Search before asking

Question

Additional

UltralyticsAssistant commented Jan 30, 2025

Requirements

Environments

Status

pderrenger commented Jan 31, 2025

Uddeshya1052 commented Feb 4, 2025 • edited Loading

pderrenger commented Feb 4, 2025

Uddeshya1052 commented Feb 6, 2025

pderrenger commented Feb 6, 2025

Uddeshya1052 commented Feb 4, 2025 •

edited

Loading