Python: Custom text image compression

Python: This is an exercise on how to compress text images using custom code to process the image and compress it using the walsh hadamard function.

Open an image file (color) which contain text (using cv2).
Convert to grayscale using a custom implementation.
Treshold the grayscale to get a binary picture.
Segment vertically, finding not empty rows and building the linked list (finding the bounding box of text rows).
Running through the list, segmenting horizontally, seeking empty pixel columns, so finding the bounding box of individual chars, inserting them into, "replacing" the row segment.
Cuting out empy rows or columns from the characters bounding box.
Transforming each character's picture (the bounding box part of the original picture) to a 64x64 binary image, and storing this in the linked list element.
Applying Hadamard-Walsh tranformation to each 64x64 picture, storing the result of the transform into the array.
The program should finally compare H-W transform value to a pre-built "database" of various characters with various fonts (the English ABC with maybe Ariel, Times New Roman, and something else) (Not done but it work on various fonts).

4. Solution

5. Challenges

The program is Slow, this depends on the machine proccessing power BUT, overall is slow.
The program uses sequential modification on images, this is not optimal, but due to the goal of the task being a "learning" task, this is acceptable.
The program is not optimal, the processing of an image can take several minutes depending on the size. For this task a 8 million pixel images was tested and took around 2 minutes of proccessing.
Is not using object oriented programming, even knowing this is not hard to change I DON'T recommend using OOP, because this paradigm reduces proccessing speed.
The programm is not capable to proccess phone pictures, specially when peper sound is in the image. I tested som clean phone images and seems to work but is not entirely accurate. This might require a more complex solution like a gauss transform.

6. Annotations

Overall the program is working great, docstring was used.
The App.py code includes some statements used to show the proff of work.
I recommend an IDE with intellisense such as vscode to check the code, this is so you can see the documentation with the pop ups and save time navigating through the code.
Due to the paradigm used, the docstring includes wich functions are being used. E.g. inside foo(), foo1() and foo2() are being used, this is specified in the docstring of foo() for a better comprehension.
The HW transform is done with binary matrix of single value cell, not pixels.

7. Code flow

I'm going to try on explain how the code flow works.

Getting the row segments

sequential horizontal search until a black pixel is found.
starting from the pixel founded searchs horizontally until it finds a blank pixel row. This blank row is the bottom part of the row.
while the step 2 is happening the program finds the left smalles pixel and the right greater pixel.
When step 2 is done, the bounds are saved and the algorithm looks for the next starting pixel row until it reach the end of the image.
segment the original image acording to the segments obtained.
time is saved by stopping the search at the momment a black pixel apears, this redices time searching.

Getting the char segmentes

Sequential vertical search until a black pixel is found.
Starting from the pixel founded searchs vertical until it finds a blank pixel column. This blank column is the right part of the char.
While the step 2 is happening the program finds the top smalles pixel and the bottom greater pixel.
When step 2 is done, the bounds are saved and the algorithm looks for the next starting pixel column until it reach the end of the image.
Segment the original image acording to the segments obtained.
Time is saved by stopping the search at the momment a black pixel apears, this reduces time searching.

Walsh Hadamard function

Creates the binary matrixes with 0 and 1 acording to the binary images.
Creates the HW matrix.
Being H = HW matrix & M the binary matrix & a option scalar sometimes used as S = 1 / len(H).
for each m in M's: M = H * M * H [* scalar].
for each m in M's: sumColumns(M).

8. Conclusion

The program resulted in a modular program, you can get an HW array of chars from an image by calling the main function and giving it the path to your image, I recommend using complete paths NOT relative paths.
The program is using full docstring showing params, returns, and function being usen inside a function.
The program succesfully gets chars without blank spaces or fixel sizes,this is done by a custom code.
The program succesfully applies the HW transform to the 64x64 matrixes, the Hadamard Walsh matrix is being generated inside the code as custom.
All images convertions and modifications are made custom pixel by pixel.
The program needs only a single path to and image and it will get the HW vectors.
Dependencies between functions are minimal and said functions are modular, following modularity rules.

9. Additional information

The code is as specified, it can be improve but this means using non custom functions.

I would love to see the program running in a more powered machine.

Made by freelancer -josucano on fiverr.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
images		images
src		src
work		work
LICENSE		LICENSE
README.md		README.md
idea.jpg		idea.jpg
workflow.jpg		workflow.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Python: Custom text image compression

Contents

1. Run

2. Problem

3. Rules

4. Solution

5. Challenges

6. Annotations

7. Code flow

Getting the row segments

Getting the char segmentes

Walsh Hadamard function

8. Conclusion

9. Additional information

About

Languages

License

tecompufiv/custom-text-image-compress

Folders and files

Latest commit

History

Repository files navigation

Python: Custom text image compression

Contents

1. Run

2. Problem

3. Rules

4. Solution

5. Challenges

6. Annotations

7. Code flow

Getting the row segments

Getting the char segmentes

Walsh Hadamard function

8. Conclusion

9. Additional information

About

Topics

Resources

License

Stars

Watchers

Forks

Languages