Added Test Email with non UTF-8 Characters which breaks execution. #4

zaghadon · 2024-04-19T13:33:20Z

In response to dunnkers#3 (comment) on dunnkers#3

zaghadon · 2024-04-19T14:11:41Z

Hi @dunnkers I find something interesting here, when I run grep -axv '.*' ./* It outputs rightly the eml file with the non UTF-8 character found, and I if I try to run the globally installed package on the folder where it is it'll fail with the error message, but then when I copy it to the test_emails/ folder and run the test, it'll work successfully after which running grep -axv '.*' ./* would also outputs nothing anymore.

Is the eml file getting transformed to utf-8 during test?

Meanwhile, when I paste all the eml files in the test_emails/ folder I get 17 failed tests, and I can't put all of that into the repository.

dunnkers · 2024-04-23T08:44:27Z

Hi @zaghadon ! I just executed the CI pipeline for this branch and it all seems to pass:

https://github.com/dunnkers/eml-to-html/actions/runs/8754777933?pr=4

.. same locally. So that executes the CI on ubuntu, macOS and Windows. Any idea how we can still reproduce the issue? And great you're so actively contributing !

zaghadon · 2024-04-24T11:51:49Z

Hello @dunnkers I just created a test_email_3.eml which I stuffed with characters of different encodings and it indeed failed the test on local. However, I checked the CI Pipeline logs and don't think it reached the pytest workflow. Could you try this to see how this is replicated?

…, closest to the previous deprecated version. See Manifest for Available Releases -> https://raw.githubusercontent.com/actions/python-versions/main/versions-manifest.json

zaghadon · 2024-05-02T22:09:27Z

Hello @dunnkers I finally checkedout a branch to try to fix the breaking of the workflow at Python Installation, after several tries, I discovered the error was coming from the latest MacOS, I couldn't troubleshoot it, so I removed MacOS in the GitHub Action and The tests passed with the existing tests.

Then I merged it into the latest add-test-email-for-#2 branch. You can check the recent workflow runs to see that execution breaks at Test with message:

FAILED test_eml_to_html.py::test_eml_to_html[test_emails/test_email_3.eml] - UnicodeDecodeError: 'utf-8' codec can't decode byte 0xe1 in position 159: invalid continuation byte

Review this updates and merge to try it with the initial Pull Request that contains fixes at #3

zaghadon · 2024-05-28T23:57:03Z

@dunnkers Hi Jereon, I just wanted to bring this again to your notice, if you have a free time, kindly review.

dunnkers · 2024-06-23T20:15:02Z

@dunnkers Hi Jereon, I just wanted to bring this again to your notice, if you have a free time, kindly review.

Hi @zaghadon your continuous effort is much appreciated. I hadn't had much time to look at this. My main thought here is that when we change the reading encoding to latin-1, that the reading process keeps supporting other encodings as well. Perhaps we can make the code such that we support both? By either detecting which encoding we should use or by just trying utf-8 first and then falling back to latin-1 in case of an error.

How's that sound? Curious to hear your thoughts. In either case- have a great day still ☀️.

zaghadon added 2 commits April 19, 2024 14:32

Added Test Email with non UTF-8 Characters which breaks execution.

bef7053

In response to dunnkers#3 (comment) on dunnkers#3

Breaking Email confirmed.

17cefed

Finally made one that failed the test.

4446b0f

zaghadon and others added 5 commits May 2, 2024 22:14

Bumped the Python Version the next available version on the Maninfest…

89b6306

…, closest to the previous deprecated version. See Manifest for Available Releases -> https://raw.githubusercontent.com/actions/python-versions/main/versions-manifest.json

Retry with any 3.x stable dist

64f1552

Retry with the most 3.7.x stable dist

caf8dfc

Removed MacOS

329e22f

Merge branch 'fix-python-version' into add-test-email-for-dunnkers#2

733a2a9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added Test Email with non UTF-8 Characters which breaks execution. #4

Added Test Email with non UTF-8 Characters which breaks execution. #4

zaghadon commented Apr 19, 2024

zaghadon commented Apr 19, 2024

dunnkers commented Apr 23, 2024

zaghadon commented Apr 24, 2024

zaghadon commented May 2, 2024

zaghadon commented May 28, 2024

dunnkers commented Jun 23, 2024

Added Test Email with non UTF-8 Characters which breaks execution. #4

Are you sure you want to change the base?

Added Test Email with non UTF-8 Characters which breaks execution. #4

Conversation

zaghadon commented Apr 19, 2024

zaghadon commented Apr 19, 2024

dunnkers commented Apr 23, 2024

zaghadon commented Apr 24, 2024

zaghadon commented May 2, 2024

zaghadon commented May 28, 2024

dunnkers commented Jun 23, 2024